IN VIVO PRODUCTION OF LONG DOUBLE STRANDED RNA UTILIZING VLPS
20180291066 ยท 2018-10-11
Assignee
Inventors
Cpc classification
C12N7/00
CHEMISTRY; METALLURGY
C12N2795/18142
CHEMISTRY; METALLURGY
International classification
Abstract
Methods and compositions for producing, isolating and purifying virus-like particles (VLPs) containing long sense and antisense RNA molecules, including RNA molecules predicted to fold into double-stranded regions longer than the inner diameter of the VLP capsids are disclosed.
Claims
1. A virus-like particle (VLP) comprising a capsid enclosing at least one heterologous cargo molecule and at least one packing sequence, wherein the heterologous cargo molecule comprises at least two RNA sequences one of which is the reverse complement of the other and each is longer than 35 nucleotides.
2. The VLP of claim 1 in which the number of complementary nucleotides in each RNA strand is larger than the ratio of the internal diameter of the capsid expressed in nanometers and 0.28.
3. The VLP of claim 1 wherein the two RNA sequences are part of the same RNA strand.
4. The VLP of claim 1 wherein the two RNA sequences are part of RNA strands not covalently linked to each other.
5. A virus-like particle (VLP) comprising a capsid enclosing at least one heterologous cargo molecule and at least one packing sequence, wherein the heterologous cargo molecule comprises at least one substantially completely complementary sequence of dsRNA (SRP) longer than 46 base pairs and an EINVERTED score or XEINVERTED score greater than 50 when using the calculation parameters: match score for AU and GC pairs=3, match score for GU pairs=1, mismatch score=4 and gap penalty=5.
6. The VLP of claim 5 in which the number of base pairs in the SRP is larger than the ratio between the internal diameter of the capsid expressed in nanometers and 0.28.
7. The VLP of claim 5 wherein the SRP is formed within RNA sequences contained the same RNA strand.
8. The VLP of claim 5 wherein the SRP is formed within RNA sequences not covalently linked to each other.
9. A nucleic acid construct encoding a capsid protein and a transcribed sequence, the transcribed region further comprising two regions, wherein one region comprises the reverse complement of the other, such that the transcribed sequence is capable of forming a double stranded RNA of at least 35 base pairs, such transcribed sequence further comprising a pac sequence.
10. A host cell comprising the nucleic acid construct of claim 9.
11. A nucleic acid construct encoding a capsid protein and a transcribed sequence, the transcribed region further comprising two regions, wherein one region comprises the reverse complement of the other, such that the transcribed region comprises at least one SRP longer than 46 base pairs and an EINVERTED score or XEINVERTED score greater than 50 when using the calculation parameters: match score for AU and GC pairs=3, match score for GU pairs=1, mismatch score=4 and gap penalty=5.
12. A host cell comprising the nucleic acid construct of claim 11.
Description
DESCRIPTION OF THE DRAWINGS
[0017]
DETAILED DESCRIPTION OF THE INVENTION
[0018] While the present invention is capable of being embodied in various forms, the description below of several embodiments is made with the understanding that the present disclosure is to be considered as an exemplification of the invention, and is not intended to limit the invention to the specific embodiments illustrated. Headings are provided for convenience only and are not to be construed to limit the invention in any manner. Embodiments illustrated under any heading may be combined with embodiments illustrated under any other heading.
[0019] The use of numerical values in the various ranges specified in this application, unless expressly indicated otherwise, are stated as approximations as though the minimum and maximum values within the stated ranges were both preceded by the word about. In this manner, slight variations above and below the stated ranges can be used to achieve substantially the same results as values within the ranges. As used herein, the terms about and approximately when referring to a numerical value shall have their plain and ordinary meanings to one skilled in the pertinent art at issue. Also, the disclosure of ranges is intended as a continuous range including every value between the minimum and maximum values recited as well as any ranges that can be formed by such values. This includes ranges that can be formed that do or do not include a finite upper and/or lower boundary. This also includes ratios that are derivable by dividing a given disclosed numeral into another disclosed numeral. Accordingly, the skilled person will appreciate that many such ratios, ranges, and ranges of ratios can be unambiguously derived from the data and numbers presented herein and all represent various embodiments of the present invention.
[0020] A wide variety of conventional techniques and tools in chemistry, biochemistry, molecular biology, and immunology are employed and available for practicing the methods and compositions described herein, are within the capabilities of a person of ordinary skill in the art and well described in the literature. Such techniques and tools include those for generating and purifying VLPs including those with a wild type or a recombinant capsid together with the cargo molecule(s), and for transforming host organisms and expressing recombinant proteins and nucleic acids as described herein. See, e.g., Sambrook, et al., Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989); and Ausubel, et al, Current Protocols in Molecular Biology, Greene Publ. Assoc., Wiley-Interscience, NY (1995). The disclosures in each of which are herein incorporated by reference.
[0021] Processes for preparing VLPs from genetically engineered bacterial host cells such as E. coli comprising expression systems are well known to those skilled in the art. For example, such systems are described in U.S. Patent Publication No. 20140302593 A1, the contents of which are incorporated herein by reference. In one aspect, the present methods relate to E. coli host cells comprising expression systems, the expression systems comprising nucleotide sequence encoding capsid proteins and a long sense and antisense RNA strand operably linked to an inducible promoter such that the capsid protein and the long sense and antisense RNA strand is expressed in the host cells when the promoter is induced. Introduction of the plasmid into the E. coli host cell can be accomplished by any of several standard molecular biology techniques such as those described in Davis, et al., Basic Methods in Molecular Biology, Elsevier Science Publishing Co., Inc., N.Y. (1986) and Sambrook, et al., Molecular Cloning: A Laboratory Manual, 2nd Ed., (1989), including, without limitation, calcium phosphate transfection, microinjection, electroporation, conjugation, infection and the like. Similarly, any system or vector suitable to maintain, propagate or express polynucleotides and/or express a polypeptide in a host may be used to practice the present invention. For example, the appropriate DNA sequence may be inserted into a vector such as a plasmid by standard techniques.
A. Definitions
[0022] As used herein, the term cargo molecule refers to any oligonucleotide, polypeptide or peptide molecule, which is or may be enclosed by a capsid.
[0023] As used herein, the term oligonucleotide refers to a short polymer of at least two nucleotides. An oligonucleotide may be an oligodeoxyribonucleotide (DNA) or an oligoribonucleotide (RNA), and encompasses short RNA molecules such as but not limited to siRNA, shRNA, sshRNA, and miRNA.
[0024] As used herein, the term peptide refers to a polymeric molecule which minimally includes at least two amino acid monomers linked by peptide bond, and preferably has at least about 10, and more preferably at least about 20 amino acid monomers, and no more than about 60 amino acid monomers, preferably no more than about 50 amino acid monomers linked by peptide bonds. For example, the term encompasses polymers having about 10, about 20, about 30, about 40, about 50, or about 60 amino acid residues.
[0025] As used herein, the term polypeptide refers to a polymeric molecule including at least one chain of amino acid monomers linked by peptide bonds, wherein the chain includes at least about 70 amino acid residues, preferably at least about 80, more preferably at least about 90, and still more preferably at least about 100 amino acid residues. As used herein the term encompasses proteins, which may include one or more linked polypeptide chains, which may or may not be further bound to cofactors or other proteins. The term protein as used herein is used interchangeably with the term polypeptide.
[0026] As used herein, the term variant with reference to a molecule is a sequence that is substantially similar to the sequence of a native or wild type molecule. With respect to nucleotide sequences, variants include those sequences that may vary as to one or more bases, but because of the degeneracy of the genetic code, still encode the identical amino acid sequence of the native protein. Variants include naturally occurring alleles, and nucleotide sequences which are engineered using well-known techniques in molecular biology, such as for example site-directed mutagenesis, and which encode the native protein, as well as those that encode a polypeptide having amino acid substitutions. Generally, nucleotide sequence variants of the invention have at least 40%, at least 50%, at least 60%, at least 70% or at least 80% sequence identity to the native (endogenous) nucleotide sequence. The present disclosure also encompasses nucleotide sequence variants having at least about 85% sequence identity, at least about 90% sequence identity, at least about 85%, 86%, 87%, 88%, 89%, 90% 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%.
[0027] Sequence identity of amino acid sequences or nucleotide sequences, within defined regions of the molecule or across the full-length sequence, can be readily determined using conventional tools and methods known in the art and as described herein. For example, the degree of sequence identity of two amino acid sequences, or two nucleotide sequences, is readily determined using alignment tools such as the NCBI Basic Local Alignment Search Tool (BLAST) [Altschul et al., J. Mol. Biol. 215(3):403-10 (1990)], which are readily available from multiple online sources. Algorithms for optimal sequence alignment are well known and described in the art, including for example in Smith, T. F. and M. S. Waterman, Adv. Appl. Math. 2:482-9 (1981) and Pearson, W. R. and D. J. Lipman, Proc. Natl. Acad. Sci. (U.S.A.) 85: 2444-8 (1988). Algorithms for sequence analysis are also readily available in programs such as blastp, blastn, blastx, tblastn and tblastx. For the purposes of the present disclosure, two nucleotide sequences may be also considered substantially identical when they hybridize to each other under stringent conditions. Stringent conditions include high hybridization temperatures and low salt hybridization buffers which permit hybridization only between nucleic acid sequences that are highly similar. Stringent conditions are sequence-dependent and will be different in different circumstance, but typically include a temperature at least about 60 C., which is about 10 C. to about 15 C. lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. Salt concentration is typically about 0.02 molar at pH 7.
[0028] As used herein with respect to a given nucleotide sequence, the term conservative variant refers to a nucleotide sequence that encodes an identical or essentially identical amino acid sequence as that of a reference sequence. Due to the degeneracy of the genetic code, whereby almost always more than one codon may code for each amino acid, nucleotide sequences encoding very closely related proteins may not share a high level of sequence identity. Moreover, different organisms have preferred codons for many amino acids, and different organisms or even different strains of the same organism, e.g., E. coli strains, can have different preferred codons for the same amino acid. Thus, a first nucleotide acid sequence which encodes essentially the same polypeptide as a second nucleotide acid sequence is considered substantially identical to the second nucleotide sequence, even if they do not share a minimum percentage sequence identity, or would not hybridize to one another under stringent conditions. Additionally, it should be understood that with the limited exception of ATG, which is usually the sole codon for methionine, any sequence can be modified to yield a functionally identical molecule by standard techniques, and such modifications are encompassed by the present disclosure. As described herein below, the present disclosure specifically contemplates protein variants of a native protein, which have amino acid sequences having at least 15%, at least 16%, at least 21%, at least 40%, at least 41%, at least 52%, at least 53%, at least 56%, at least 59% or at least 86% sequence identity to a native nucleotide sequence.
[0029] The degree of sequence identity between two amino acid sequences may be determined using the BLASTp algorithm [Karlin, S. and S. F. Altschul, Proc. Natl. Acad. Sci. USA 87:2264-8 (1990). The percentage of sequence identity is determined by comparing two optimally aligned sequences over a comparison window, wherein the portion of the amino acid sequence in the comparison window may comprise additions or deletions (i.e. gaps) as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which an identical amino acid occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison and multiplying the result by 100 to yield the percentage of sequence identity.
[0030] One of skill will recognize that polypeptides may be substantially similar, in that an amino acid may be substituted with a similar amino acid residue without affecting the function of the mature protein. Polypeptide sequences which are substantially similar share sequences as noted above, except that residue positions, which are not identical may have conservative amino acid changes. Conservative amino acid substitutions refer to the interchangeability of residues having similar side chains. For example, a group of amino acids having aliphatic side chains is glycine, alanine, valine, leucine, and isoleucine; a group of amino acids having aliphatic-hydroxyl side chains is serine and threonine; a group of amino acids having amide-containing side chains is asparagine and glutamine; a group of amino acids having aromatic side chains is phenylalanine, tyrosine, and tryptophan; a group of amino acids having basic side chains is lysine, arginine, and histidine; and a group of amino acids having sulfur-containing side chains is cysteine and methionine. Preferred conservative amino acid substitution groups include: valine-leucine-isoleucine, phenylalanine-tyrosine, lysine-arginine, alanine-valine, and asparagine-glutamine.
[0031] A nucleic acid encoding a peptide, polypeptide or protein may be obtained by screening selected cDNA or genomic libraries using a deduced amino acid sequence for a given protein. Conventional procedures using primer extension procedures, as described for example in Sambrook, et al, (1989) can be used to detect precursors and processing intermediates.
[0032] EINVERTED is publicly available software for identifying inverted repeats (stem-loops) in a nucleotide sequence. EINVERTED can be used to obtain various parametric scores for SPC dsRNA stems. Since EINVERTED doesn't take into account GU pairs it systematically underestimates the actual potential length of some SPC dsRNAs. To account for this, we have used the following protocol which we have named XEINVERTED, to obtain more accurate identification of the extent of SPC regions in an RNA strand.
[0033] First, EINVERTED identifies SPC sections of an RNA strand. Second, we evaluate immediately adjacent flanking sections for additional GU, AT and GC pairs. Third, we expand the identified SPC dsRNA stem to include such flanking sections. Fourth, we calculate the maximum possible score for the expanded SPC dsRNA stem. Fifth, we accept the expanded SPC ds RNA stem if the maximum possible score is higher than the EINVERTED score and XEINVERTED score for shorter SPC dsRNA stems, otherwise it is rejected. This ensures that more stable, but shorter, SPC dsRNA structures are properly recognized. The number of bases incorporated within an SPC region is referred to as the EINVERTED length or XEINVERTED length, depending on the method used to calculate that particular SPC region.
[0034] SRP is RNA capable of adopting a conformation comprising a substantially completely complementary stretch of dsRNA and is defined here as any RNA comprising 2 ssRNA sequences which, if occurring within the same RNA strand, has an EINVERTED length or an XEINVERTED length of at least 46 bp and an EINVERTED score or an XEINVERTED score greater than 50 when using the following calculational parameters:
[0035] Match score for AU & GC pairs=3
[0036] Match score for GU pairs=1
[0037] Mismatch score=4
[0038] Gap penalty=5
[0039] Minimum score threshold=50
[0040] Maximum extent of repeats=3,000
[0041] Pan, et al., [Pan, et al., FEBS Journal 279(7): 1198-208 (2012)] describe encapsidation of sense and antisense RNA strands comprising SEQ ID NO.: 1 (P146 of Table 2, p. 1205 of Pan et al.), each containing a single SRP packaged into MS2. SEQ ID NO.: 1 represents an example of how a particular sequence can produce significantly different EINVERTED and XEINVERTED lengths and scores. Analysis of SEQ ID NO.: 1 with the parameters given above produces an EINVERTED length of 42 and EINVERTED score of 66. Applying the XEINVERTED protocol to SEQ ID NO.: 1, based on the same EINVERTED parameters produces an XEINVERTED length of 46 and XEINVERTED score of 85.
[0042] As described above, EINVERTED identified the two SPC strands within SEQ ID NO.: I as SEQ ID NO.:2, ACAUGAGGAUCACCCAUGUAGCUCUGAGAACUG AAUUCCAUGGGU on the sense strand, as well as the matching sequence SEQ ID NO.: 3, ACCUGUGAAAUUCAGUUCUUCAGCUACAUGAGGAUCACCCAUGU on the antisense strand.
[0043] XEINVERTED analysis reveals two additional GU matches, as well as an additional GC match, which results in expanding the two SPC strands within SEQ ID NO.: 1 as SEQ ID NO.: 4, CUACAUGAGGAUCACCCAUGUAGCUCUGAGAACU GAAUUCCAUGGGUU on the sense strand, as well as the matching sequence SEQ ID NO.: 5, GACCUGUGAAAUUCAGUUCUUCAGCUACAUGAGGAUCACC CAUGUGG on the antisense strand.
[0044] Thus, although EINVERTED provides a convenient tool for determining the basic parameters of SPC dsRNA, additional steps such as those described in the XEINVERTED protocol may reveal that a particular SPC dsRNA has significantly different parameters. This is important because the length of an SPC dsRNA is particularly relevant to determining whether it can be packaged within a VLP or not. In some cases EINVERTED may underestimate the length of an SPC dsRNA since it discriminates against non-canonical base pairing that is likely to occur in dsRNA formation.
[0045] SRPs is defined as more than one SRP.
B. VLPS Enclosing RNA Capable of Adopting at Least One SRP Configuration
[0046] The methods and compositions described herein are the result in part of the appreciation that certain viral capsids can be prepared and/or used in novel manufacturing and purification methods to improve commercialization procedures for SRPs
[0047] The capsid may be a wild type capsid or a mutant capsid derived from a wild type capsid, provided that the capsid encloses a SRP.
[0048] The present disclosure encompasses a composition comprising: a plurality of virus-like particles each comprising a wild type viral capsid and at least one SRP in the wild type viral capsid. Subsequently the cargo molecules can be readily harvested from the capsids. Accordingly, such compositions are highly desirable for all applications where high purity and/or high production efficiency is required.
[0049] A capsid can self-assemble from capsid protein in the presence of nucleotide cargo, such as an oligoribonucleotide. In non-limiting example, a capsid as described herein may enclose a target heterologous RNA strand, such as for example a target heterologous RNA strand containing a total of between 1,800 and 2,248 ribonucleotides, including the 19-mer pac site from Enterobacteriophage MS2, such RNA strand transcribed from a plasmid separate from a plasmid coding for the capsid proteins, as described by Wei, et al., J. Clin. Microbiol. 46:1734-40 (2008), or transcribed from the same plasmid as the plasmid coding for the capsid proteins.
[0050] RNA interference (RNAi) is a phenomenon mediated by short RNA molecules such as siRNA molecules, which can be used for selective suppression of a target gene of interest, and has multiple applications in biotechnology and medicine. For example, short RNA molecules can be employed to target a specific gene of interest in an organism (the target host) to obtain a desirable phenotype. Longer dsRNA molecules, for example SRPs, can also be used for RNAi purposes. RNA molecules, including siRNA and SRPs, are however easily degraded by ubiquitous enzymes called RNAses. Capsids, such as those described herein, protect encapsidated RNA from enzymatic degradation.
[0051] VLPs containing RNA strands capable of forming SRPs, as described herein comprise any substantially perfectly complementary dsRNA longer than 42 bp identified by EINVERTED, or longer than 46 bp when identified using XEINVERTED, in which at least one of the strands comprises a sequence capable of hybridizing with RNA or DNA present in the target host at least 19 nt long. Such RNA or DNA present in the target host can be, or result from, target host genomic DNA or can be viral, bacterial or fungal sequences present in the target host before, during or after the SRPs are delivered to the target host. Examples of SRPs are pre-miRNAs, pre-siRNAs, miRNAs, certain hairpin-forming mRNAs, sections of dsRNA viruses and sections of replication intermediates of ssRNA viruses.
[0052] Such compositions are much simpler, less expensive and more reliably manufactured than current alternatives for RNA delivery.
[0053] VLPs as described herein may be assembled by any available method(s) which produces a VLP with an assembled capsid encapsidating one or more cargo molecule(s). For example, capsids and cargo molecules may be co-expressed in any expression system. Recombinant DNA encoding one or more capsid proteins, one or more cargo molecule(s) can be readily introduced into the host cells, e.g., bacterial cells, plant cells, yeast cells, fungal cells, and animal cells (including insect and mammalian cells) by transfection with one or more expression vectors by any procedure useful for introducing such a vector into a particular cell, and stably transfecting the cell to yield a cell which expresses the recombinant sequence(s).
[0054] The host cell is preferably of eukaryotic origin, e.g., plant, mammalian, insect, yeast or fungal sources, but non-eukaryotic host cells may also be used. Suitable expression systems include but are not limited to microorganisms such as bacteria {e.g. E. coli) transformed with recombinant bacteriophage DNA, plasmid DNA or cosmid DNA expression vectors containing the coding sequences for the VLP elements. In non-limiting example, E. coli is a suitable expression system for VLPs comprising the MS2 capsid protein.
[0055] The present disclosure expressly contemplates plant cells which have been transformed using a nucleic acid construct as described herein, and which expresses a capsid coat protein and cargo molecule. Means for transforming cells including plant cells and preparing transgenic cells are well known in the art. Vectors, plasmids, cosmids, YACs (yeast artificial chromosomes) and DNA segments can be used to transform cells and will as generally recognized include promoters, enhancers, and/or polylinkers. Transgenic cells specifically contemplated include transgenic plant cells including but not limited to cells obtained from corn, soybean, wheat, vegetables, grains, legumes, fruit trees, and so on, or any plant which would benefit from introduction of a VLP as described herein. Also contemplated are plants, plant tissue obtained from cells transformed as described herein, and the seed or progeny of the plant or plant tissue.
[0056] Expression of assembled VLPs can be obtained for example by constructing at least one expression vector including sequences encoding all elements of the VLP. Sometimes two vectors are used, a first vector which includes a sequence encoding the cargo molecule(s); and a second vector which includes a sequence encoding the capsid protein. In an exemplary process for generating exemplary VLPs comprising SRPs, one vector may be expressed in the host cell for generation of the VLP, as further detailed in the Examples. Methods and tools for constructing such expression vector containing the coding sequences and transcriptional and translational control sequences are well known in the art. Vector(s) once constructed are transferred to the host cells also using techniques well known in the art, and the cells then maintained under culture conditions for a time sufficient for expression and assembling of the VLPs to occur, all using conventional techniques. The present disclosure thus encompasses host cells containing any such vectors, and cells which have been transformed by such vectors, as well as cells containing the VLPs.
[0057] When the VLPs have been expressed and assembled in the host cell they may be isolated and purified using any method known in the art for virus purification. For example, the cells can be lysed using conventional cell lysis techniques and agents. VLPs in the cell lysate following hydrolysis can be removed and purified using conventional protein isolation techniques.
[0058] Purification of capsids, VLPs or proteins may also include methods generally known in the art. For example, following capsid expression and purification of capsids, VLPs or proteins may include for example at least one liquid-liquid extraction step, at least one fractional precipitation step, at least one ultrafiltration step, or at least one crystallization step. A liquid-liquid extraction may comprise for example use of an immiscible non-aqueous non-polar solvent, such as but not limited to benzene, toluene, hexane, heptane, octane, chloroform, dichloromethane, or carbon tetrachloride. Purifying may include at least one crystallization step.
[0059] Following purification, the capsid can be opened to obtain the cargo molecule, which may be a protein or polypeptide, a peptide, or a nucleic acid molecule as described herein. Capsids can be opened using any one of several possible procedures known in the art, including for example heating in an aqueous solution above 50 C.; repeated freeze-thawing; incubating with denaturing agents such as formamide; by incubating with one or more proteases; or by a combination of any of these procedures.
[0060] VLPs according to the present disclosure and as used in any of the methods and processes, thus encompass those comprising a capsid protein having at least 15%, 16%, 21%, 40%, 41%, 52%, 53%, 56%, 59% or at least 86% sequence identity with the amino acid sequence of wild type Enterobacteriophage MS2 capsid protein.
[0061] The specialized VLPs described herein can be used in research and development and in industrial manufacturing facilities to provide improved yields, since the purification processes used in both settings have the same matrix composition. Having such same composition mainly depends on using the same cell line in both research and development and manufacturing processes.
C. Preferred Embodiments
[0062] The following are among the preferred embodiments of the invention.
[0063] In one embodiment, the invention is a method for producing VLPs containing heterologous RNA sequences capable of forming sense and antisense RNA strand regions longer than the internal diameter of the VLP capsid.
[0064] In another embodiment, the invention is a virus-like particle (VLP) comprising a capsid enclosing at least one heterologous cargo molecule and at least one packing sequence, wherein the heterologous cargo molecule comprises at least two RNA sequences one of which is the reverse complement of the other and each is longer than 35 nucleotides.
[0065] In yet another embodiment, the VLP comprising a capsid enclosing at least one heterologous cargo molecule and at least one packing sequence, wherein the heterologous cargo molecule comprises at least two RNA sequences one of which is the reverse complement of the other, the number of complementary nucleotides in each RNA strand is larger than the ratio of the internal diameter of the capsid expressed in nanometers and 0.28.
[0066] In yet another embodiment, the VLP comprises a capsid enclosing at least one heterologous cargo molecule and at least one packing sequence, wherein the heterologous cargo molecule comprises at least two RNA sequences one of which is the reverse complement of the other, the two RNA sequences are part of the same RNA strand.
[0067] In yet another embodiment, the VLP comprises a capsid enclosing at least one heterologous cargo molecule and at least one packing sequence, wherein the heterologous cargo molecule comprises at least two RNA sequences one of which is the reverse complement of the other, the two RNA sequences are part of RNA strands not covalently linked to each other.
[0068] In still another embodiment the invention is a virus-like particle (VLP) comprising a capsid enclosing at least one heterologous cargo molecule and at least one packing sequence, wherein the heterologous cargo molecule comprises at least one SRP longer than 46 base pairs and an EINVERTED score or XEINVERTED score greater than 50 when using the following calculation parameters: match score for AU and GC pairs=3, match score for GU pairs=1, mismatch score=4 and gap penalty=5.
[0069] In yet another embodiment, the VLP comprising a capsid enclosing at least one heterologous cargo molecule and at least one packing sequence, wherein the heterologous cargo molecule comprises at least one SRP longer than 46 base pairs and an EINVERTED score or XEINVERTED score greater than 50 when using the following calculation parameters: match score for AU and GC pairs=3, match score for GU pairs=1, mismatch score=4 and gap penalty=5, the number of base pairs in the SRP is larger than the ratio between the internal diameter of the capsid expressed in nanometers and 0.28.
[0070] In yet another embodiment, the VLP comprising a capsid enclosing at least one heterologous cargo molecule and at least one packing sequence, wherein the heterologous cargo molecule comprises at least one SRP longer than 46 base pairs and an EINVERTED score or XEINVERTED score greater than 50 when using the following calculation parameters: match score for AU and GC pairs=3, match score for GU pairs=1, mismatch score=4 and gap penalty=5, the SRP is formed within RNA sequences contained the same RNA strand.
[0071] In yet another embodiment, the VLP comprising a capsid enclosing at least one heterologous cargo molecule and at least one packing sequence, wherein the heterologous cargo molecule comprises at least one SRP longer than 46 base pairs and an EINVERTED score or XEINVERTED score greater than 50 when using the following calculation parameters: match score for AU and GC pairs=3, match score for GU pairs=1, mismatch score=4 and gap penalty=5, the SRP is formed within RNA sequences not covalently linked to each other.
[0072] In still another embodiment, the invention comprises a nucleic acid construct encoding a capsid protein and a transcribed sequence, the transcribed region further comprising two regions, wherein one region comprises the reverse complement of the other, such that the transcribed sequence is capable of forming a double stranded RNA of at least 35 base pairs, such transcribed sequence further comprising a pac sequence.
[0073] In yet another embodiment, the invention comprises a host cell comprising a nucleic acid construct encoding a capsid protein and a transcribed sequence, the transcribed region further comprising two regions, wherein one region comprises the reverse complement of the other, such that the transcribed sequence is capable of forming a double stranded RNA of at least 35 base pairs, such transcribed sequence further comprising a pac sequence.
[0074] In still another embodiment, the invention comprises a nucleic acid construct encoding a capsid protein and a transcribed sequence, the transcribed region further comprising two regions, wherein one region comprises the reverse complement of the other, such that the transcribed region comprises at least one SRP longer than 46 base pairs and an EINVERTED score or XEINVERTED score greater than 50 when using the following calculation parameters: match score for AU and GC pairs=3, match score for GU pairs=1, mismatch score=4 and gap penalty=5.
[0075] In yet another embodiment, the invention comprises a host cell comprising a nucleic acid construct encoding a capsid protein and a transcribed sequence, the transcribed region further comprising two regions, wherein one region comprises the reverse complement of the other, such that the transcribed region comprises at least one SRP longer than 46 base pairs and an EINVERTED score or XEINVERTED score greater than 50 when using the following calculation parameters: match score for AU and GC pairs=3, match score for GU pairs=1, mismatch score=4 and gap penalty=5.
Example 1
Production of VLPs Containing a Single Stranded RNA Comprising Sense and Antisense ErkA Sequence Flanked by PAC Sequences in a Single Capsid
[0076] Plasmid pAPSE10114 (SEQ ID NO.: 6) encodes a T7 promoter sequence upstream of the MS2 coat protein (SEQ ID NO: 7) followed by a multiple cloning site flanked by two distinct MS2 19-mer RNA hairpins to enhance packing of the target sequence (the target sequence site). The pac site found 5 of the target sequence site has the wild-type sequence ACAUGAGGAUUACCCAUGU and the C-PAC site found 3 of the target sequence site is a modified pac site with a higher affinity for binding the MS2 coat protein with the sequence ACAUGAGGAUCACCCAUGU. A T7 terminator is also encoded in the vector to stop transcription of the RNA. This construct is inserted into the SmaI restriction site of plasmid pBR322.
[0077] Cloning of sense and antisense 180 bp fragments of ErkA (SEQ ID NO: 8) from Drosophila melanogaster (GenBank Accession NM_001300706 positions 156-335) were performed by PCR amplification of the region of interest and subsequent ligation into pAPSE10114 (SEQ ID NO: 6) to form pAPSE10131 (SEQ ID NO: 9) which contains the ErkA sense and antisense strands connected by a 40 bp linker.
[0078] Chemically competent HTE115 (DE3) cells were transformed with pAPSE10131, and selected for ampicillin resistant transformants. For VLP production these transformants were grown at 37 C. in 100 mL LB medium containing ampicillin. When the culture density reached OD.sub.600 0.8, isopropyl -D-thiogalactopyranoside (Gold Biotech, St. Louis, Mo.) was added to a final concentration of 1 mM. Cells were harvested 4 hours post-induction by centrifugation at 3,000 g at 4 C. for 10 minutes. Cells were stored at 4 C. until time of lysis.
Example 2
[0079] Purification of VLPs by Enzymatic Treatment and Fractional Precipitation with Ammonium Sulfate
[0080] Purification of VLPs was conducted as follows. One half of the cell pellet obtained from Example 1 was resuspended in approximately 10 volumes of 20 mM Tris-HCl, pH 7.0, containing 10 mM NaCl and sonicated to lyse the cells. Cell debris was removed by centrifugation at 16,000 g. The sample was further processed by addition of Benzonase Nuclease (Sigma Aldrich, St. Louis, Mo.) was added to a final concentration of 100 units per mL and incubated at 37 C. for two hours. Proteinase K was then added to final concentration of 150 micrograms per mL and incubated at 37 C. for an additional three hours. At this point the VLP sample was ready for fractional ammonium sulfate precipitation.
[0081] Fractional precipitation of VLPs was conducted as follows. A saturated ammonium sulfate solution was prepared by adding ammonium sulfate to water to a final concentration of 4.1 M. The saturated ammonium sulfate was added to the enzymatically treated VLPs to a final concentration of 186 mM (approximately a 1:22 dilution) and placed on ice for two hours. Unwanted precipitate was cleared from the lysate by centrifugation at 16,000 g. The sample was then subjected to a second precipitation by the addition of 155 mg of dry ammonium sulfate directly to each mL of cleared lysate. The sample was vortexed and incubated on ice for two hours. The precipitate was spun down at 16,000 g and the solid precipitate was kept and resuspended in one tenth the original volume of 20 mM Tris-HCl, pH 7.0, containing 10 mM NaCl.
Example 3
VLPs Produce Double-Stranded RNA Resistant to RNAse a Treatment
[0082] VLPs comprising MS2 capsids obtained from 100 mL of culture of Example 1 and purified as described in Example 2 were used to produce dsRNA as follows. The amount of capsid used for the experiment was quantified by measuring A.sub.260 and by protein assay using a BCA Assay (Pierce BCA Protein Assay Kit, Thermo Fisher Scientific, Rockford, Ill.). Protein concentration was determined to be 28.50 microgram/microliter. The A.sub.260 value was 123. To test the ability for the RNA inside the capsid to form double-stranded RNA the following amounts of capsids were used: 171, 342, and 684 micrograms.
[0083] Incubation of 40 microliters of the capsids in 10 mM Tris-HCl, 300 mM NaCl, pH 7.0 at 75 C. for 5 minutes released the RNA into solution and which was subsequently allowed to cool to room temperature at a rate of 4 C. per minute. Once this was done the released RNA was treated with 0.400 nanograms of RNAse A (Ambion, Life Technologies, Thermo Fisher Scientific, Rockford, Ill.) and incubated at 37 C. for 40 minutes. To deactivate the RNAse 0.001 units of Proteinase K from Tritirachium album was added and incubated at 37 C. for 40 minutes. The resultant RNA that was protected from RNAse A cleavage by virtue of being in double stranded form, was run on a 1.25% TAE-agarose gel stained with ethidium bromide. See
Example 4
Production of VLP Preparations, One Comprising a Single Stranded RNA ErkA Sense Strand and Another the Complementary ErkA Antisense Strand, Each Flanked by PAC Sequences in Separate Capsids
[0084] A 180 bp fragment of ErkA (SEQ ID NO: 8) from Drosophila melanogaster (GenBank Accession NM_001300706 156-335) was amplified by PCR and subsequently ligated into the AsiSI and PmeI restriction sites of pAPSE10114 (SEQ ID NO: 6) to form pAPSE10120 (SEQ ID NO: 10) which contains the ErkA fragment in the sense orientation relative to the T7 promoter. A separate construct, pAPSE10121 (SEQ ID NO: 11) was similarly produced containing the ErkA fragment in the anti-sense orientation relative to the T7 promoter.
[0085] Chemically competent HTE115 (DE3) cells were separately transformed with pAPSE10120 and pAPSE10121 and individually selected for ampicillin resistant transformants. For capsid production each of these transformants were grown separately at 37 C. in 100 mL LB medium containing ampicillin. When the culture density reached OD.sub.600 0.8, isopropyl -D-thiogalactopyranoside (Gold Biotech, St. Louis, Mo.) was added to a final concentration of 1 mM. Cells were harvested 4 hours post-induction by centrifugation at 3,000 g at 4 C. for 10 minutes. Cells were stored at 4 C. until time of lysis.
Example 5
VLPs of Example 4 are Capable of Producing Double-Stranded RNA Resistant to RNAse a Nuclease Treatment
[0086] VLPs comprising MS2 capsids obtained in Example 4 were purified as described in Example 2 and used to produce dsRNA by the following method. The amount of capsid used for the experiment was quantified by A.sub.260 and by BCA Assay (Pierce BCA Protein Assay Kit, Theinio Fisher Scientific, Rockford, Ill.). Based on these values equal amounts of VLPs from pAPSE10120 and pAPSE10121 were combined to a total of 30 micrograms of encapsidated RNA in a total reaction volume of 40 microliters.
[0087] As an alternative to Trizol extraction of the nucleic acid encapsulated in the VLPs, the following procedure was used to heat-crack the VLPs. Incubation of 40 microliters of the capsids in 10 mM Tris-HCl, 300 mM NaCl, pH 7.0 at 75 C. for 5 minutes released the RNA into solution, and then cooled to room temperature at a rate of 4 C. per minute. Once this was done the released RNA was treated with 0.400 nanograms of RNAse A (Ambion, Life Technologies, Thermo Fischer Scientific, Rockford, Ill.) and incubated at 37 C. for 40 minutes. To deactivate the RNAse A, 0.001 units of Proteinase K from Tritirachium album was added and the solution incubated at 37 C. for an additional 40 minutes. The remaining RNA protected from RNAse A digestion was run on a 1.25% TAE-agarose gel stained with ethidium bromide. See
[0088] The results clearly indicate that when the two VLPs, sense and antisense, are heat-cracked together the single-stranded RNA they contain may form a dsRNA product that corresponds to the 180 bp RNAse A resistant molecule. The same dsRNA product can also be formed from VLPs which encode the sense and antisense strands within a single RNA molecule encapsidated within a VLP. This is a surprising result since a 180 bp dsRNA is predicted to have a length of about 40 nanometers, which is too long to fit within the approximately 20 nanometer diameter interior cavity of MS2. This indicates that the RNA is incorporated into the capsid in a manner which allows for packing of SRPs of significantly larger size in a single capsid.
[0089] Those skilled in the art will understand that plasmids similar to pAPSE10120, pAPSE10121 and pAPSE10131 may easily be constructed to produce sense and anti-sense RNA products of any particular sequence or length by substituting different DNA sequences cloned in place of the ErkA specific sequences described in this Example. Furthermore, those of skill in the art will also understand that producing RNA strands entirely lacking pac site sequences can be accomplished by deleting, or not incorporating, such sequences from expression plasmid pAPSE10114. For example, plasmid pAPSE10114 lacking the sequences between 908-926 and 11199-1137 provides a suitable expression platform to produce RNA molecules lacking pac site sequences.
Example 6
[0090] In-Vitro Production of MS2 Capsids Self-Assembled in Presence of 30 bp dsRNA Flanked by MS2 PAC Sequences
[0091] Thirty bp dsRNA flanked by MS2 pac sequences can be produced in-vitro by simultaneous transcription of 30 nucleotides sense and antisense strands, each having an MS2 pac site at the 3 end. MS2 coat protein is prepared and MS2 capsids are then allowed to spontaneously assemble into VLPs in the solution containing the annealed RNA strands as described in Ashley, et al., [Ashley, et al., ACSNano 6(3):2174-88 (2012)].
[0092] The in-vitro assembled MS2 VLPs are purified by treatment with 100 units of Benzonase Nuclease (Sigma Aldrich, St. Louis, Mo.) and incubation at 37 C. for two hours. One hundred and fifty micrograms of Proteinase K is added and incubated at 37 C. for an additional three hours. At this point the VLPs may be purified by ammonium sulfate precipitation as described in Example 2.
[0093] The purified MS2 capsids are incubated in 10 mM Tris-HCl, 300 mM NaCl, pH 7.0 at 75 C. for 5 minutes to release the RNA into solution and allowed to cool to room temperature at a rate of 4 C. per minute. Once cooled, the released RNA is treated with 0.400 nanograms of RNAse A (Ambion, Life Technologies, Thermo Fischer Scientific, Rockford, Ill.) and incubated at 37 C. for 40 minutes. To deactivate the RNAse A 0.001 units of Proteinase K from Tritirachium album is added and incubated at 37 C. for 40 minutes. The resultant RNA protected from RNAse A cleavage by virtue of being in a double stranded form is electrophoresed on a 1.25% TAE-agarose gel stained with ethidium bromide. A strong band of about 30 bp dsRNA indicates that double-stranded RNA was successfully packaged and purified.
Example 7
[0094] In-Vitro Production of MS2 Capsids Self-Assembled in Presence of 180 bp dsRNA Flanked by MS2 PAC Sequences
[0095] One hundred and eighty bp dsRNA flanked by MS2 pac sequences can be produced in-vitro by simultaneous transcription of 180 nucleotides sense and antisense strands, each having an MS2 pac site at the 3 end. MS2 coat protein is prepared and MS2 capsids are then allowed to spontaneously assemble into VLPs in the solution containing the annealed RNA strands as described in Example 6.
[0096] The in-vitro assembled MS2 VLPs are purified by treatment with 100 units of Benzonase Nuclease (Sigma Aldrich, St. Louis, Mo.) and incubation at 37 C. for two hours. One hundred and fifty micrograms of Proteinase K is added and incubated at 37 C. for an additional three hours. At this point the VLPs may be purified by ammonium sulfate precipitation as described in Example 2.
[0097] The purified MS2 capsids are incubated in 10 mM Tris-HCl, 300 mM NaCl, pH 7.0 at 75 C. for 5 minutes to release the RNA into solution and allowed to cool to room temperature at a rate of 4 C. per minute. Once this is done the released RNA is treated with 0.400 nanograms of RNAse A (Ambion, Life Technologies, Thermo Fischer, Rockford, Ill.) and incubated at 37 C. for 40 minutes. To deactivate the RNAse A, 0.001 units of Proteinase K from Tritirachium album is added and the solution incubated at 37 C. for an additional 40 minutes. The resultant RNA protected from RNAse A cleavage by virtue of being in a double stranded form is electrophoresed on a 1.25% TAE-agarose gel stained with ethidium bromide. Absence of a band of about 180 bp dsRNA indicates that double-stranded RNA was not successfully packaged and purified.
[0098] These results indicate that the methods described here are suitable for packaging and purifying long sense and antisense RNA strand molecules of a type not previously reported in the literature. Such methods facilitate inexpensive large scale RNA synthesis of complex RNA molecules with long stretches of complementary sequence capable of forming double-stranded structures.