Compositions and methods for transient gene therapy with enhanced stability
11753434 · 2023-09-12
Assignee
Inventors
Cpc classification
A61K31/7088
HUMAN NECESSITIES
A61K35/17
HUMAN NECESSITIES
C12N15/101
CHEMISTRY; METALLURGY
A61K48/00
HUMAN NECESSITIES
C12Q2525/185
CHEMISTRY; METALLURGY
C07H21/00
CHEMISTRY; METALLURGY
C12Q2525/173
CHEMISTRY; METALLURGY
A61K31/7105
HUMAN NECESSITIES
C12Q2525/173
CHEMISTRY; METALLURGY
C12Q2525/185
CHEMISTRY; METALLURGY
A61K31/7115
HUMAN NECESSITIES
International classification
C07H21/00
CHEMISTRY; METALLURGY
A61K35/17
HUMAN NECESSITIES
Abstract
The present invention provides circularized RNA and methods of making, purifying, and using same.
Claims
1. A method for producing a chimeric antigen receptor (CAR) T-cell, the method comprising transfecting a T cell with a circularized nucleic acid, wherein the circularized nucleic acid comprises from 5′ to 3′: a 5′ imperfect complement-reverse complement (iCRC) sequence; a 5′ untranslated region (UTR) sequence; an RNA sequence that comprises an open reading frame; a 3′ UTR sequence; and a 3′ iCRC sequence; wherein the 5′ iCRC sequence and the 3′ iCRC sequence have the following characteristics: one or more nucleotide mismatches such that the 5′ iCRC sequence and the 3′ iCRC are not 100% complementary; an annealing temperature (T.sub.a) above about 16° C.; a melting temperature (T.sub.m) below about 37° C.; and wherein the 5′ terminus of the nucleic acid is ligated to its 3′ terminus, thereby producing a circularized nucleic acid, wherein the RNA sequence encodes a chimeric antigen receptor.
2. The method of claim 1, wherein the CAR is specific for an antigen comprising ErbB2 (HER2/neu), carcinoembryonic antigen (CEA), epithelial cell adhesion molecule (EpCAM), epidermal growth factor receptor (EGFR), EGFR variant III (EGFRvIII), CD19, CD20, CD30, CD40, disialoganglioside GD2, ductal-epithelial mucine, gp36, TAG-72, glycosphingolipids, glioma-associated antigen, beta-human chorionic gonadotropin, alphafetoprotein (AFP), lectin-reactive AFP, thyroglobulin, RAGE-1, MN-CA IX, human telomerase reverse transcriptase, RU1, RU2 (AS), intestinal carboxyl esterase, mut hsp70-2, M-CSF, prostate specific antigen (PSA), PAP, NY-ESO-1, LAGA-la, p53, prostein, PSMA, surviving and telomerase, prostate-carcinoma tumor antigen-1 (PCTA-1), MAGE, ELF2M, neutrophil elastase, ephrin B2, CD22, insulin growth factor (IGF1)-I, IGF-II, IGFI receptor, mesothelia, a major histocompatibility complex (MHC) molecule presenting a tumor-specific peptide epitope, 5T4, ROR1, Nkp30, NKG2D, tumor stromal antigens, the extra domain A (EDA) or extra domain B (EDB) of fibronectin or the A1 domain of tenascin-C(TnC A1), fibroblast associated protein (fap), CD3, CD4, CD8, CD24, CD25, CD33, CD34, CD133, CD138, CTLA-4, B7-1 (CD80), B7-2 (CD86), endoglin, a major histocompatibility complex (MHC) molecule, BCMA (CD269, TNFRSF 17), HIV specific antigen, an EBV-specific antigen, a CMV-specific antigen, a Lasse Virus-specific antigen, or an Influenza Virus-specific antigen.
3. The method of claim 1, wherein the RNA is expressed in vivo.
4. The method of claim 1, wherein the nucleic acid comprises A nucleotides, U nucleotides, G nucleotides, and C nucleotides, and wherein one or more of the following conditions apply: one or more of the A nucleotides are modified adenosine analogs; one or more of the U nucleotides are modified uridine analogs; one or more of the G nucleotides are modified guanosine analogs; or one or more of the C nucleotides are modified cytidine analogs.
5. The method of claim 4, wherein one or more of the following conditions apply: all of the A nucleotides are modified; all of the U nucleotides are modified; all of the G nucleotides are modified; or all of the C nucleotides are modified.
6. The method of claim 4, wherein one or more of the following conditions apply: approximately half of the A nucleotides are modified; approximately half of the U nucleotides are modified; approximately half of the G nucleotides are modified; or approximately half of the C nucleotides are modified.
7. The method of claim 4, wherein said modified nucleotide analogs are selected from the group consisting of N.sup.6-methyladenosine, 5-methylcytidine, pseudouridine, 2-thiouridine, N′-methylpseudouridine, and thienoguanosine.
8. The method of claim 1, further comprising converting the 5′ triphosphate of the nucleic acid into a 5′ monophosphate prior to the ligation step.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
DETAILED DESCRIPTION OF THE INVENTION
(16) The invention provides circularized nucleic acids (e.g. RNA), compositions comprising circularized nucleic acids, methods of circularizing nucleic acids, and methods of using circularized nucleic acids. The nucleic acids, compositions, and methods are based upon the previous observation that circularization is more dependent on the proximity and availability of the free ends of the RNA than the size of the RNA construct. Specifically, the present invention is an improvement of the inventions described in WO 2016/197121, the contents of which is incorporated by reference in its entirety.
(17) RNA-based therapy affords benefits of gene therapy while remaining transient. Because RNA may be used as a transient, cytoplasmic expression system, RNA-based therapies can be applied in quiescent and/or slowly proliferating cells (i.e., muscle cells and hepatocytes). However, the instability of RNA, which is largely attributable to exonuclease-mediated degradation, has limited the clinical translation of RNA. In particular, the majority of RNA is degraded by exonucleases acting at both ends or at one end of the molecule after deadenylation and/or decapping. The sub-optimal stability of linear RNA remains an unresolved issue hindering the feasibility of RNA-based therapies. The majority of efforts to stabilize RNA have focused on linear RNA and modification thereof.
(18) Linear RNA is prone to exonuclease degradation from the 5′ to 3′ end and from the 3′ to 5′ end, whereas circularized RNA transcripts have increased serum stability and/or intracellular stability, at least in part because there are no ends available to serve as substrates for exonucleases. However, there are currently no effective methods for producing or purifying large-scale circularized RNA suitable for therapeutic purposes, particularly for sequences that are longer than 0.5 kb.
(19) In contrast, the invention described in WO 2016/197121 possessed several new and advantageous features overcoming prior disadvantages encountered with other methods of creating circularized RNA. Specifically, the invention described in WO 2016/197121 had the following advantages: 1) an optimized method for generating circularized RNA in higher yields than previously obtained; 2) circularized RNA encoding therapeutic proteins; 3) circularized RNA having improved stability (in solution, in cells, and in vivo); 4) longer circularized RNA molecules than previously obtained; 5) use of circularized RNA for therapeutic gene transfer into cells; 6) use of circularized RNA for improved vaccination; and 7) use of circularized non-coding RNA for binding to endogenous target RNAs and/or RNA-binding proteins.
(20) The invention described in WO 2016/197121 was based upon the identification of motifs in the 5′ and 3′ untranslated regions of the transcript that enhance circularization efficiency and/or enable and enhance cap-independent translation. Specifically, it was described that complement-reverse complement (CRC) sequence motifs together with random nucleotides (e.g., nucleotides of a tail sequence) at the 5′ and 3′ ends of a desired RNA facilitates enzymatic circularization of RNA. While CRC sequence motifs greatly enhanced circularization efficiency, an unexpected problem is that the double stranded regions caused by the CRC sequence motifs can be recognized by the pattern recognition receptors of the immune system that consequently lead to decreased translation. To solve this problem, it was discovered that the addition of one or more point mutations into the CRC sequence motif can reduce the melting temperature such that the double stranded motif is intact during ligation temperatures to facilitate the circularization of the RNA but disassociates at body temperature. This provides the benefit of bringing the two ends of the mRNA molecule together (with a retarded off-rate) during ligation while bypassing the immune recognition and translation inhibition the CRC sequence motif can induce once the circular mRNA is introduced into cells.
(21) Circularized RNA
(22) The present invention is based upon 5′ and 3′ motifs that allow highly efficient enzymatic circularization of RNA. Specifically, complement-reverse complement (CRC) sequence motifs with one or more point mutations (i.e., nucleotide mismatches). These modified CRC motifs are referred to herein as imperfect CRCs or iCRCs.
(23) Accordingly, the invention provides a nucleic acid (DNA or RNA) comprising a 5′ imperfect complement-reverse complement (iCRC) sequence; a 5′ untranslated region (UTR) sequence; an RNA sequence (e.g., an open reading frame); a 3′ untranslated region (UTR) sequence; and a 3′ imperfect CRC sequence along with random nucleotides on the distal ends of the imperfect CRC motifs (e.g., 5′ and 3′ tail sequences). For example, in some embodiments, the invention provides a nucleic acid (DNA or RNA) comprising in 5′ to 3′ order: a 5′ iCRC sequence (e.g., a 5′ tail sequence and a 5′ sequence that hybridizes to the 3′ iCRC motif under ligation reaction conditions); a 5′ UTR sequence; an RNA sequence (e.g., an open reading frame); a 3′ UTR sequence; and a 3′ iCRC sequence (e.g., a 3′ sequence that hybridizes to the 5′ iCRC motif under ligation reaction conditions and a 3′ tail sequence). The RNA sequence may be an RNA sequence capable of being translated into a polypeptide; the RNA sequence may comprise an open reading frame; the RNA sequence may be a non-coding RNA, e.g., an RNA that is a reverse complement of an endogenous RNA, i.e., an mRNA, a miRNA, a tRNA, an rRNA, or a lncRNA; or the RNA sequence may be capable of binding to an RNA-binding protein (RBP). When the RNA sequence binds an RBP, the nucleic acid of the present invention prevents the RBP from binding to its canonical linear RNA binding partner.
(24) In a nucleic acid, the 5′ iCRC sequence and/or the 3′ iCRC sequence has one, two, three, four, five, six, seven, eight, nine, ten, or more nucleotide mismatches. In some embodiments, the number of mismatches in the base-pairing of the 5′ iCRC sequence and the 3′ iCRC sequence are such that the sequences are at least 70% and less than 100% complementary. For example, in some embodiments, the 5′ and 3′ iCRC sequences are between 70% and 95%, between 70% and 90%, between 70% and 80%, between 75% and 95%, between 75% and 90%, between 80% and 95%, or between 80% and 90% complementary.
(25) The one or more nucleotide mismatches are such that the 5′ iCRC sequence and the 3′ iCRC are not 100% complementary. The mismatches result in the 5′ iCRC and the 3′ iCRC having an annealing temperature (T.sub.a) less than 25° C. and/or a melting temperature (T.sub.m) greater than 25° C. Preferably, the T.sub.a is above the ligation temperature of about 16° C. and the T.sub.m is below body temperature (about 37° C.). To ensure adequate conditions—that is, that the 5′ iCRC and 3′ iCRC are predominantly annealed at a temperature at which ligation can occur and that the 5′ iCRC and 3′ iCRC are melted when introduced into the body—one preferably operates comfortably within a temperature range of between 16° C. and 37° C. For example, as mentioned above, selecting a T.sub.m of 25° C. will ensure that the 5′ iCRC and 3′ iCRC will be preferentially melted when introduced into the body and exposed to a temperature above 25° C., and that the 5′ iCRC and 3′ iCRC will be preferentially annealed during a ligation reaction performed at 20° C. Accordingly, in some embodiments, ideal melting temperatures for iCRC constructs are between 20° C. and 34° C., more preferably between 23° C. and 30° C., or between 25° C. and 28° C.
(26) Algorithms and methods for calculating T.sub.m are well known in the art and include, without limitation, methods of experimentally determining T.sub.m (e.g., by measuring the absorbance change of the oligonucleotide sequence with its complement as a function of temperature, and determining the halfway point on a plot of absorbance versus time) and methods of theoretically determining Tm (e.g., the nearest neighbors method, as described in: Freier S M, Kierzek R, Jaeger J A, Sugimoto N, Caruthers M H, Neilson T, & Turner D H (1986). Improved free-energy parameters for predictions of RNA duplex stability. Proc Natl Acad Sci, 83, 9373-9377; and Breslauer K J, Frank R, Blocker H, & Marky L A (1986). Predicting DNA duplex stability from the base sequence. Proc Natl Acad Sci, 83, 3746-3750). Additional methods of determining Tm are known in the art, e.g., as described in: Dumousseau M., Rodriguez N., Juty N., Le Novère N. (2012) MELTING, a flexible platform to predict the melting temperatures of nucleic acids. BMC Bioinformatics, 13: 101; on-line at: https://www.ebi.ac.uk/biomodels/tools/melting/; Kibbe W A. ‘OligoCalc: an online oligonucleotide properties calculator’. (2007) Nucleic Acids Res. 35(webserver issue): May 25; and on-line at: http://biotools.nubic.northwestern.edu/OligoCalc.html.
(27) The 5′ or 3′ iCRC sequence comprises 10 to 50 nucleotides, e.g., 10, 20, 30, 40, or 50 nucleotides. In some embodiments, the 5′ and/or 3′ iCRC sequence is selected from Table 1:
(28) TABLE-US-00001 TABLE 1 Imperfect CRC sequence list with predicted Tm′s falling between 25-37° C. SEQ SEQ Melting 5′ CRC (5′.fwdarw.3′ ID 3′CRC (5′.fwdarw.3′ ID CRC # Muta- Temp # Orientation) NO Orientation) NO Length tion (C.) 1 GCACGAATTGCACAA 1 ACTCGAAAGAACAGA 8 30 6 37 TCGGTACGTTCGAGT ATGTACAAATCGTGC 2 GTTACGTACCAACAC 2 ACCGAAGGCATTAAA 9 30 6 36.31 GTTATTGCCGTCGGT GTGATGGAACATAAC 3 GTTACGTACCAACAC 2 ACCGTCGGAAATGAC 10 30 7 35.45 GTTATTGCCGTCGGT GTATTGATTCGTAAC 4 GTTACGTACCAACAC 2 ACCGTCGGAAATCAC 11 30 6 34.97 GTTATTGCCGTCGGT GTATTGATAGGTAAC 5 GCACGAATTGCACAA 1 ACTCAAAAGTAACGA 12 30 6 34.4 TCGGTACGTTCGAGT ATGTGAAATTAGTGC 6 GTTACGTACCAACAC 2 ACCGACAGCAACAAC 13 30 5 33.71 GTTATTGCCGTCGGT CTGCTGGTACATAAC 7 GCACGAATTGCACAA 1 ACTCAAGCGTACTGA 14 30 7 33.48 TCGGTACGTTCGAGT GTGTGGAACTAGTGC 8 GTTACGTACCAACAC 2 ACCGCCGGAAATGAC 15 30 6 32.98 GTTATTGCCGTCGGT GTATTGATAGGTAAC 9 GTTACGTACCAACAC 2 ACCGTCGGAAATTAA 16 30 6 32.42 GTTATTGCCGTCGGT GTATTGATACGTAAC 10 GCACGAATTGCACAA 1 ACTCAAACATAACGA 17 30 6 32.4 TCGGTACGTTCGAGT TAGTGCAAATAGTGC 11 GCACGAATTGCACAA 1 ACTCAAGCGTACGGA 18 30 7 30.8 TCGGTACGTTCGAGT ATGTGGAAATGGTGC 12 GCACGAATTGCACAA 1 ACTCCAACATACAGA 19 30 6 30.45 TCGGTACGTTCGAGT TAGTGCAAATAGTGC 13 GTTACGTACCAACAC 2 ACCGCCGGAAATCAC 20 30 6 30.37 GTTATTGCCGTCGGT GTATTGATAGGTAAC 14 GCACGAATTGCACAA 1 ACTCAAACATACAGA 21 30 6 29.49 TCGGTACGTTCGAGT TAGTGCAAATAGTGC 15 GTTACGTACCAACAC 2 ACCGACAGCAACAAC 22 30 6 27.9 GTTATTGCCGTCGGT TTGCTGCTATGTAAC 16 GCACGAATTGCACAA 1 ACTCAAAAGTAACGA 23 30 7 26.94 TCGGTACGTTCGAGT ATGTGAAAATAGTGC 17 GTTACGTACCAACAC 2 ACCGACAGCAACAAC 24 30 5 25.53 GTTATTGCCGTCGGT CTGCTGCTATGTAAC 18 GCACGAATTGCACAA 3 ACCGATTGAGCTATA 25 20 3 36.04 TCGGT CGTGC 19 TGGCTGCACGAATTG 4 TTGTACAATTCATGCA 26 20 2 36.2 CACAA GCCA 20 GCACGAATTGCACAA 3 ACCGATTGTCCAATCC 27 20 2 35.35 TCGGT GTGC 21 TGGCTGCACGAATTG 4 TTGTGGAATCCGTGG 28 20 3 35.25 CACAA AGCCA 22 GTACGTGGCTGCACG 5 CAATACGTGCCGCCA 29 20 3 35.06 AATTG GGTAC 23 GCACGAATTGCACAA 3 ACCGGTTGTGAAATT 30 20 3 34.54 TCGGT GGTGC 24 TGGCTGCACGAATTG 4 TTGTCCAATTCCTGCA 31 20 2 33.31 CACAA GCCA 25 GCACGAATTGCACAA 3 ACCGCTTATGCACTTC 32 20 3 32.87 TCGGT GTGC 26 GTACGTGGCTGCACG 5 CAATTAGTGAAGCCT 33 20 3 31.99 AATTG CGTAC 27 GCACGAATTGCACAA 3 ACCGACTGTGCCATTG 34 20 3 31.8 TCGGT GTGC 28 GCACGAATTGCACAA 3 ACCGTTTGTTCAATTT 35 20 3 30.61 TCGGT GTGC 29 GCACGAATTGCACAA 3 ACCGGTTGTACAATCC 36 20 3 29.33 TCGGT GTGC 30 GCACGAATTGCACAA 3 ACCGGTTGTCCAATCC 37 20 3 27.42 TCGGT GTGC 31 GCACGAATTGCACAA 3 ACCGAATGAGCTATA 38 20 4 26.57 TCGGT CGTGC 32 GTACGTGGCTGCACG 5 CAATTCATGCATCCAG 39 20 3 26.36 AATTG GTAC 33 TGGCTGCACGAATTG 4 TTGTACACTTCATGCA 40 20 3 26.28 CACAA GCCA 34 GTACGTGGCTGCACG 5 CAATACGTGGAGACA 41 20 4 25.69 AATTG AGTAC 35 GCACGAATTGCACAA 6 TTGTCCAATTCGTGC 42 15 1 26.15 36 TGGCTGCACGAATTG 7 CAATACGTGCAGCCA 43 15 1 36.17 37 GCACGAATTGCACAA 6 TTGTGCAATGCGTGC 44 15 1 35.47 38 TGGCTGCACGAATTG 7 CAATTCGAGCAGCCA 45 15 1 35.19 39 GCACGAATTGCACAA 6 TTGTGSAATTCGTGC 46 15 1 34.39 40 GCACGAATTGCACAA 6 TTGTGCAATACGTGC 47 15 1 34.13 41 GCACGAATTGCACAA 6 TTGTGCAAATCGTGC 48 15 1 33.93 42 TGGCTGCACGAATTG 7 CAATTAGTGCAGCCA 49 15 1 33.39 43 TGGCTGCACGAATTG 7 CAATTCGTGAAGCCA 50 15 1 33.21 44 TGGCTGCACGAATTG 7 CANTCCGTGCAGCCA 51 15 1 31.93 45 GCACGAATTGCACAA 6 TTGTGCAATTAGTGC 52 15 1 31.44 46 GCACGAATTGCACAA 6 TTGTGCGAGTCGTGC 53 15 2 30.22 47 TGGCTGCACGAATTG 7 CAATTCGTACAGCCA 54 15 1 30.08 48 GCACGAATTGCACAA 6 TTGTGCCASTCGTGC 55 15 2 27.94 49 TGGCTGCACGAATTG 7 CAATTCGAGCTGCCA 56 15 2 27.34 50 TGGCTGCACGAATTG 7 CAATTCGCGCTGCCA 57 15 2 25
(29) The nucleic acid further includes a random nucleotide sequence (e.g., a tail sequence) at the 5′ end and the 3′ end. The 5′ random nucleotide sequence (e.g., the 5′ tail sequence) is upstream of the 5′ iCRC sequence, and the 3′ random nucleotide sequence (e.g., the 3′ tail sequence) is downstream of the 3′ iCRC sequence.
(30) The 5′ tail sequence and the 3′ tail sequence are present in the constructs provided herein to facilitate ligation. As would be understood by a person of ordinary skill in the art, ligation (e.g., 5′ to 3′ end ligation) will proceed more rapidly if the ends are within a ligatable distance relative to one another provided that they are not hybridized. As iCRC sequences are hybridized, ligation between a nucleotide of each sequence is not favored. Therefore, flexible ends are provided as tails to permit joining of so-called “free” ends to one another.
(31) In practice, it has been found that these free ends are approximately the same length and substantially non-complementary. In some embodiments, each tail sequence can comprise between 10 and 20 nucleotides if both are of approximately the same length. One can of course achieve the same outcome using tails of differing lengths in a single construct, e.g., by having one short tail and a longer tail having a degree of flexibility sufficient to place the ends in a ligatable proximity relative to one another. The arrangements are too numerous to list individually, but the following lists examples of 5′ and 3′ tail sequences.
(32) In some embodiments, it is preferred that the 5′ tail sequence and the 3′ tail sequence do not hybridize under ligation reaction conditions. Accordingly, in some embodiments, appropriate tail sequences and iCRC sequences should be such that, at the desired ligation temperature, the 5′ and 3′ tail sequences will be preferentially melted while the 5′ and 3′ iCRC sequences will be preferentially annealed. It should be appreciated, however, that ligation temperature considerations may be inconsequential where the 5′ and 3′ tail sequences are substantially non-complementary.
(33) In some embodiments, a DNA template used for in vitro transcription of the constructs described herein includes the 5′ and 3′ tail sequences. Many in vitro transcription methodologies known in the art utilize one of several RNA polymerase enzymes (e.g., T7 RNA polymerase, T3 RNA polymerase, and SP6 RNA polymerase) that require at least one G nucleotide at the 5′-most base position to function as a priming nucleotide to initiate transcription. As such, in some embodiments, the 5′ tail sequence comprises a G nucleotide at the 5′-most base position. With certain of these polymerases, it has been observed that efficiency of transcription is increased greatly when two or three G nucleotides are present at the 5′-most base positions. Accordingly, in some embodiments, the 5′ tail sequence comprises between one and three G nucleotides at the 5′-most base positions.
(34) In some embodiments, the 5′ tail sequence is a sequence of Formula (I):
5′-G.sub.X1-N.sub.X2—
(35) (I), wherein:
(36) each G is independently an unmodified or chemically modified G nucleotide;
(37) each N is independently an unmodified or chemically modified nucleotide;
(38) X1 is an integer from 1 to 3, inclusive; and
(39) X2 is an integer from 1 to 25, inclusive.
(40) In some embodiments, X2 is at least 3; and each N at base positions 1-3 is independently an unmodified or chemically modified A or T/U nucleotide.
(41) In some embodiments, X2 is at least 6; and each N at base positions 4-6 is independently an unmodified or chemically modified C, G, or A nucleotide.
(42) In some embodiments, each N is an unmodified or chemically modified A nucleotide.
(43) In some embodiments, each N is an unmodified or chemically modified C nucleotide.
(44) In some embodiments, the 5′ tail sequence is a sequence of Formula (II):
—[N].sub.X3-3′
(45) (II), wherein:
(46) each [N] is independently an unmodified or chemically modified nucleotide; and
(47) X3 is an integer from 1 to 30, inclusive.
(48) In some embodiments, X3 is at least 3; and each [N] at base positions 1-3 is independently an unmodified or chemically modified C or G nucleotide.
(49) In some embodiments, X3 is at least 6; and each [N] at base positions 4-6 is independently an unmodified or chemically modified A or T/U nucleotide.
(50) In some embodiments, X3 is at least 9; and each [N] at base positions 4-9 is independently an unmodified or chemically modified A or T/U nucleotide.
(51) In some embodiments, each [N] is an unmodified or chemically modified A nucleotide.
(52) In some embodiments, each [N] is an unmodified or chemically modified C nucleotide.
(53) In some embodiments, a 5′ and 3′ tail sequence is selected from Table 2:
(54) TABLE-US-00002 TABLE 2 Tail Sequence List (Regions that flanks CRC motif) SEQ SEQ ID ID # 5′ Tail NO 3′ Tail NO Length 1 GGGAATCGAC 58 CGGAATATAG 70 10 2 GGGAAAAAAA 59 AAAAAAAAAA 71 10 3 GGAAAAAAAA 60 AAAAAAAAAA 71 10 4 GAAAAAAAAA 61 AAAAAAAAAA 71 10 5 GCCCCCCCCC 62 CCCCCCCCCC 72 10 6 GCCCCCCCCC 62 AAAAAAAAAA 71 10 7 GGGAATCGACTACAG 63 CGGAATATAGAAGCA 73 15 8 GGGAAAAAAAAAAAA 64 AAAAAAAAAAAAAAA 74 15 9 GGAAAAAAAAAAAAA 65 AAAAAAAAAAAAAAA 74 15 10 GAAAAAAAAAAAAAA 66 AAAAAAAAAAAAAAA 74 15 11 GCCCCCCCCCCCCCC 67 CCCCCCCCCCCCCCC 75 15 12 GCCCCCCCCCCCCCC 67 AAAAAAAAAAAAAAA 74 15 13 GGGAATCGACTACAG 68 CGGAATATAGAAGCA 76 20 GAGGA TAAGA 14 GGGAAAAAAAAAAAA 69 AAAAAAAAAAAAAAA 77 20 AAAAA AAAAA 15 GGAAAAAAAA 60 AAAAAAAAAAAAAAA 77 20 AAA AA 16 GAAAAAAAAA 61 AAAAAAAAAAAAAAA 77 20 AAAAA 17 GCCCCCCCCC 62 CCCCCCCCCCCCCCC 78 20 CCCCC 18 GCCCCCCCCC 62 AAAAAAAAAAAAAAA 77 20 AAAAA
(55) Each random nucleotide sequence (e.g., a tail sequence) is between about 5 and 50 nucleotides, e.g., 10, 15, 20, or 25 nucleotides.
(56) Rather than having random nucleotide sequences, a nucleic acid may have one or two polyA sequences, with the polyA sequences being upstream of a 5′ CRC and/or downstream of a 3′ CRC and at the nucleic acid's end(s).
(57) Each polyA sequence is between about 5 and 50 nucleotides, e.g., 10, 15, 20, 25, or 30 nucleotides.
(58) Preferred 5′ or 3′ iCRC sequences are exemplified in
(59) The 5′ UTR is any UTR known in the art. For example, the 5′ UTR is polyAx30, polyAx120, HCV, CrPV, EMCV, or polyAx30-EMCV. Preferably, the 5′ UTR is EMCV. Any known 3′ UTR may be used in the present invention; examples include HbB1-PolyAx10, HbB 1, HbB 1x2, or an Elastin-derived 3′ UTR (e.g., a motif from the Elastin 3′ UTR). Preferably, the 3′ UTR is an Elastin-derived 3′ UTR. Multiple tandem copies (e.g., 2, 3, 4, or more) of a UTR may be included in a nucleic acid (e.g., more than one copy of a motif from the Elastin 3′ UTR and more than one copy of the EMCV 5′ UTR). As used herein, the number after an “x” in a UTR's name refers to the number of copies of the UTR (or motif thereof). As an example, an Elastin 3′ UTR (or a motif thereof) that is repeated twice is referred to as Elastinx2 and an Elastin 3′ UTR (or a motif thereof) that is repeated three times is referred to as Elastinx3.
(60) In accordance with the invention, very large target RNA sequences are able to be circularized. In a typical circularized RNA, an open reading frame would encode a single therapeutic protein. In other circularized RNA, however, the open reading frame can encode two or more therapeutic proteins. For example, therapeutically active peptides are intended to be encoded by an open reading frame of an RNA sequence provided herein. Additionally, very large polypeptides are intended to be encoded, for example, those requiring the encoding RNA sequence to be between 15 and 10000 or more nucleotides in length. More typically, the RNA sequence is between 15 and 6000 nucleotides in length, e.g., between 30 and 5000, between 50 and 4000, between 100 and 3000, between 200 and 3000, between 400 and 3000, between 600 and 3000, between 800 and 2000, between 900 and 2000, or between 1000 and 2000.
(61) In some instances, the RNA sequence encodes a much longer molecule, such as a chimeric protein, which would require a much longer open reading frame. In some embodiments, chimeric proteins can include two or more (e.g., 2, 3, 4, 5, or more) therapeutic proteins which can be encoded in a single RNA sequence. However, in some instances, the RNA sequence can encode a relatively small molecule, such as a polypeptide or a therapeutic RNA molecule that does not require translation to provide a therapeutic benefit. As such, the 5′ and 3′ motifs identified by the inventors allow any size target RNA to be circularized. The RNA sequence is at least 15, 30, 50, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 2000, 3000, 4000, 5000, 6000, 7000, 8000, 9000, 10000, or more nucleotides in length.
(62) The RNA (e.g., mRNA) sequence may encode any protein of interest, for example the target RNA encodes for a hormone, an antibody such as scFv, single-domain antibody (also known as a nanobody), cytokine, intracellular protein, extracellular protein, tumor-associated antigen, chimeric antigen receptor, bacterial antigen, viral antigen, transposase, nuclease, or transcription factor. The RNA may encode a therapeutic polypeptide, e.g., preproinsulin, hypocretin, human growth hormone, leptin, oxytocin, vasopressin, factor VII, factor VIII, factor IX, erythropoietin, G-CSF, alpha-galactosidase A, iduronidase, N-acetylgalactosamine-4-sulfatase, FSH, DNase, tissue plasminogen activator, glucocerebrosidase, interferon alpha, interferon beta, interferon gamma, or IGF-1. The translated protein would have endogenous post-translational modifications and could be retained intracellularly or secreted. The RNA sequence may encode a polypeptide that comprises an epitope for presentation by an antigen presenting cell. The polypeptide may lead to improved (e.g., more efficient and greater quantity) T cell priming, as determined by increased production of IFN-γ, including by proliferating cells.
(63) The RNA sequence may be an RNA that is a reverse complement of an endogenous RNA, i.e., an mRNA, a miRNA, a tRNA, an rRNA, or a lncRNA; by “endogenous” is meant an RNA that is naturally transcribed by a cell. An RNA sequence that is a reverse complement may be referred to as a “non-coding RNA” since it does not encode a polypeptide. When an RNA sequence of the present invention binds an endogenous RNA, the endogenous RNA's function may be blocked or reduced; for example, when the endogenous RNA is an miRNA, the RNA sequence of the present invention prevents the miRNA from binding to its target mRNAs.
(64) The RNA sequence may be capable of binding to an RNA-binding protein (RBP). When the RNA sequence binds an RBP, the nucleic acid of the present invention prevents the RBP from binding to its canonical linear RNA binding partner. Non-limiting examples of RBPs are found at the World Wide Web (www) at rbpdb.ccbr.utoronto.ca.
(65) A circularized nucleic acid will have greater stability (i.e., more resistant to degradation or enzymatic digestion) than a nucleic acid that has a similar sequence (e.g., identical or non-identical) but is non-circularized. The circularized nucleic acid will have greater stability in solution. A circularized nucleic acid will have greater stability in a cell, whether in vitro or in vivo (i.e., in an animal). By “greater stability” is meant a stability increase of 0.01%, 1%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 200%, 300%, 400%, 500%, 600%, 700%, 800%, 900%, 1000%, 2000%, 3000%, 4000%, 5000%, 6000%, 7000%, 8000%, 9000%, 10000%, or more or any percentage therebetween. For example, a greater (as defined above) fraction of the starting amount of circularized nucleic acid will remain in a solution or a cell after a certain amount of time when under identical conditions (e.g., temperature and presence/absence of digestive enzymes) than a corresponding non-circularized nucleic acid.
(66) A circularized nucleic acid may provide greater polypeptide translation (e.g., more polypeptide product and more efficient synthesis) relative to a nucleic acid that has a similar sequence (e.g., identical or non-identical) but is non-circularized. Specifically, the circularized nucleic acid according to the present invention provides greater polypeptide translation (e.g., more polypeptide product and more efficient synthesis) relative to a non-circularized nucleic acid having a similar sequence (e.g., identical or non-identical) but has CRC sequence motifs that are 100% complementary (i.e., no nucleotide mismatches). By “greater polypeptide translation” is meant an increase of 0.01%, 1%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 200%, 300%, 400%, 500%, 600%, 700%, 800%, 900%, 1000%, 2000%, 3000%, 4000%, 5000%, 6000%, 7000%, 8000%, 9000%, 10000%, or more or any percentage therebetween in the amount of polypeptide produced. For example, a greater (as defined above) number of polypeptides will be synthesized from a molecule of circularized nucleic acid than from a corresponding non-circularized nucleic acid or a nucleic acid circularized using CRC sequence motifs that are 100% complementary.
(67) A nucleic acid may comprise an internal ribosome entry site (IRES). Exemplary IRES sequences are listed at the World Wide Web at iresite.org. Preferably, the IRES is an encephalomyocarditis virus (EMCV) IRES.
(68) A nucleic acid of the present invention may be in a cell (e.g., in vitro or in vitro in a non-human mammal). Non-limiting examples of cells include T cells, B cells, Natural Killer cells (NK), Natural Killer T (NKT) cells, mast cells, eosinophils, basophils, macrophages, neutrophils, dendritic cells, mesenchymal cells, endothelial cells, and epithelial cells.
(69) A circularized nucleic acid of the present invention may be included in a composition, e.g., a pharmaceutical composition suitable for administration to a subject, e.g., a mammal, including a human. The composition may include both a circularized nucleic acid of the present invention and a nucleic acid having a similar sequence (e.g., identical or non-identical) but is non-circularized or a nucleic acid having a dissimilar sequence.
(70) Methods for Circularizing RNA
(71) In some embodiments, the present application provides methods for preparing a circular nucleic acid (e.g., a circular mRNA). For example,
(72) Following in vitro transcription, RNA product 102 comprises a 5′-triphosphate. In some embodiments, it can be desirable to convert the 5′-triphosphate to a 5′-monophosphate. For example, 5′-monophosphate groups are generally preferable to 5′-triphosphates for efficient 5′ to 3′ ligation reactions. Accordingly, as shown, the 5′-triphosphate of RNA product 102 is converted to 5′-monophosphate RNA 103. In some embodiments, the conversion to 5′-monophosphate is accomplished chemically. In some embodiments, the conversion to 5′-monophosphate is accomplished enzymatically.
(73) In some embodiments, the conversion to 5′-monophosphate is accomplished in a two-step process comprising: (A) dephosphorylation; and (B) phosphorylation. In some embodiments, this two-step process can be accomplished enzymatically, e.g., through the activity of a phosphatase in (A) to remove the triphosphate and a kinase in (B) to add a 5′-monophosphate. Suitable phosphatase and kinase enzymes are known in the art and described elsewhere herein.
(74) In some embodiments, the conversion to 5′-monophosphate is accomplished in a single-step process using a single enzyme that catalyzes the conversion of a 5′-polyphosphate to a 5′-monophosphate. For example, in some embodiments, the single enzyme is an enzyme whose activity cleaves a pyrophosphate group from a 5′-triphosphate, such as RppH or apyrase. Following 5′-phosphate modification, 5′-monophosphate RNA is enzymatically ligated (e.g., using a T4 RNA ligase) to produce mixed population 104 that comprises circular RNA and remnant linear RNA.
(75) It is often desirable to obtain sufficiently pure circular RNA that minimizes or eliminates remnant linear RNA of mixed population 104. For example, remnant linear RNA and any other by-products and impurities can detrimentally affect use of the circular RNA as a therapeutic—e.g., through induction of innate immune response. Additionally, RNA molecules having longer sequences and/or extensive secondary structure can diminish ligation efficiency such that the amount of remnant linear RNA exceeds that of the circular RNA. Accordingly, aspects of the present application relate to methods of preparing purified circular RNA 105 by purifying mixed population 104 comprising linear and circular nucleic acids.
(76) In some embodiments, mixed population 104 is purified by gel chromatography. In some embodiments, mixed population 104 is purified by column chromatography. In some embodiments, mixed population 104 is purified by HPLC. In some embodiments, mixed population 104 is purified by ion-pair reversed-phase HPLC. In some embodiments, mixed population 104 is purified by (i) contacting mixed population 104 to a purification column; (ii) eluting purified circular nucleic acid 105 by passing a liquid through the purification column; and (iii) collecting an eluate comprising purified circular nucleic acid 105. In some embodiments, the purification column comprises a stationary phase having a plurality of microspheres. In some embodiments, the plurality of microspheres comprise a polystyrene-divinylbenzene copolymer.
(77) Circular RNA purified in accordance with the techniques described herein, in some embodiments, is in a preparation that is substantially free of linear RNA. For example, the circular RNA may have trace amounts of linear RNA, which would not be expected to illicit a detrimental immune response in a therapeutic application. In some embodiments, a circular RNA that has been purified by a method described herein is in a composition comprising trace amounts of its linear form up to about 15% of its linear form (or fragments thereof. In some embodiments, the circular RNA composition comprises between about 0.1% and 10%, between about 0.5% and 5%, between about 0.5% and 1%, between about 1% and 5%, between about 0.1% and 1%, between about 0.1% and 0.5%, between about 0.01% and 0.1%, or between about 0.05% and 0.1% of its linear form (or fragments thereof).
(78)
(79) As shown, mixed population 114 is subjected to an enzymatic digestion using an exonuclease that selectively degrades remnant linear RNA of mixed population 114 to produce digested mixed population 115 comprising circular RNA and impurities (e.g., degradation products, exonuclease, remnant linear RNA). While it is appreciated that RNase R is used in this example, many exonucleases suitable for degrading remnant linear RNA are known in the art and described elsewhere herein. Digested mixed population 115 is subjected to HPLC purification to obtain purified circular RNA 116.
(80)
(81) As shown, mixed population 124 is subjected to a poly(A) tailing reaction to produce tailed mixed population 125 comprising circular RNA and tailed remnant linear RNA. The activity of poly(A) polymerase requires a free 3′ terminal end for polyadenylation to occur. In this example, circular RNA cannot be modified by polyadenylation because the enzymatic ligation is accomplished by 5′ to 3′ end ligation such that these ends are unavailable for modification by poly(A) polymerase.
(82) In some embodiments, selective polyadenylation of remnant linear RNA increases the efficiency of its subsequent degradation in a mixed sample. For example, as shown, tailed mixed population 125 is subjected to an enzymatic digestion using an exonuclease (e.g., RNase R) that selectively degrades remnant linear RNA of tailed mixed population 125 to produce digested mixed population 126 comprising circular RNA and impurities (e.g., degradation products, exonuclease, remnant linear RNA).
(83) As described elsewhere herein (see, e.g., Example 9 and
(84) While it is appreciated that steps of polyadenylation and RNase R digestion are used in the example process illustrated in
(85) For example, a selective modification of a molecule can be made for the purpose of affecting its movement on a column relative to the unmodified molecule. Such modifications can include, by way of example and not limitation, size modifications and charge modifications which increase chromatographic separation of one RNA form relative to the other, and capture moiety modifications which permit selective capture of one form over the other.
(86) Size modifications can be made to permit separation based on a difference in size between one RNA form relative to another, e.g., by size exclusion chromatography and other purification techniques which discriminate based on size. Examples of size modifications include selectively increasing the size of one RNA form relative to another. This can be accomplished by any number of means known to a practitioner, including selectively ligating one or more molecules to a linear RNA (e.g., through the action of a poly(A) polymerase), selectively ligating a nucleic acid to a linear RNA (e.g., through the action of a ligase), selectively coupling a protein element to either RNA form (e.g., by chemical coupling means), and selectively annealing a nucleic acid to either form (e.g., using an oligonucleotide that anneals across the splice junction of a circular form). Size modifications further include those which selectively decrease the size of one RNA form relative to another. Typically, such methods can involve some form of degradation and would therefore involve selective modification of the linear RNA form. For example, selective degradation can be accomplished by means which discriminate based on the available free ends of the linear form (e.g., enzymatic degradation via an exonuclease).
(87) Charge modifications can be made to permit separation based on a difference in molecular charge of one form relative to another, e.g., by ion chromatography or by electrophoresis. This can be accomplished by selectively ligating or annealing a charged molecule to one RNA form. As one example, nucleic acids are generally negatively charged under neutral conditions. Therefore, selectively ligating or annealing a nucleic acid to one form of RNA would be expected to decrease the overall charge of that form. As should be appreciated, such modifications would invariably result in the modified form being more attracted to a positive charge, thereby providing a means of separation by charge.
(88) Capture moiety modifications can be made to permit selective capture of one form over another. A process for selective capture can generally involve: (i) selectively modifying either circular or linear RNA of a mixed population with a capture moiety; and (ii) capturing the capture moiety-modified RNA by contacting the mixed population with a binding partner of the capture moiety. For example, in some embodiments, a capture moiety is annealed or ligated to one form of RNA in a mixed population, and a binding partner of the capture moiety is contacted to the mixed population.
(89) The capture moiety can be in the form of a capture moiety-modified nucleotide, for example, which can be ligated to one or both terminal ends of a linear RNA (e.g., polyadenylating linear RNA in the presence of biotin-modified adenosine). Alternatively, a capture moiety-modified primer could be designed to preferably anneal to one RNA form under a particular set of conditions. By way of example and not limitation, a capture moiety-modified primer comprising a sequence that is complementary to a region bridging the splice junction of a circular RNA could be used to preferably anneal to—and capture—the circular RNA.
(90) Additional capture moiety modifications include ligating one or more nucleotides to the linear form, and capturing the linear form using an oligonucleotide probe that selectively anneals to the one or more nucleotides ligated thereon. For example, polyadenylation of a linear form produces a poly(A) tail, which functions as a capture sequence for binding to a poly(T) nucleic acid probe.
(91) A capture moiety and/or binding partner can comprise, for example, biotin, avidin, streptavidin, digoxigenin, inosine, avidin, GST sequences, modified GST sequences, biotin ligase recognition (BiTag) sequences, S tags, SNAP-tags, enterokinase sites, thrombin sites, antibodies or antibody domains, antibody fragments, antigens, receptors, receptor domains, receptor fragments, or combinations thereof.
(92) Capture can be performed using any of a variety of techniques known in the art. For example, a capture moiety comprising an affinity purification tag could permit capture via passage through an affinity column. and preferably bound by the solid phase. Alternatively, a capture moiety comprising a charged moiety could permit capture by electrophoretic means or column chromatography. As an additional example, a capture moiety comprising a paramagnetic bead could permit capture by applying a magnetic field to the sample.
(93) The nucleic acid comprising the RNA sequence to be circularized can be produced by methods known in the art.
(94) For example, primers can be designed to generate PCR templates suitable for in vitro transcription (IVT), for example by T7, T3, or S6 RNA polymerase. Preferably, the primers are designed with the following motifs:
(95) Forward primer: {RNA polymerase promoter sequence-5′-(random nucleotides, e.g., of a tail sequence)-(5′ iCRC sequence)-(desired 5′ UTR)-(1.sup.st 20 nucleotides of desired RNA CDS)-3′}
(96) Reverse primer: {5′-(random nucleotides, e.g., of a tail sequence)-(3′iCRC sequence)-(reverse complement of desired 3′ UTR)-(reverse complement of last 20 nucleotides of desired RNA CDS)-3′}
(97) Circularized RNA is produced by transcription of the PCR products generated with the above primers, or another set of primers, to produce RNA. Circularized RNA may also be produced by transcription of a plasmid or a fragment thereof to produce RNA. The synthesized RNA is then treated to produce a 5′ monophosphate RNA. For example, 5′ monophosphate RNA is produced by treating the RNA with RNA 5′ pyrophosphohydrolase (RppH) or an ATP diphosphohydrolase.
(98) The 5′ monophosphate RNA is then enzymatically circularized for example with an RNA ligase such as T4 RNA ligase.
(99) A nucleic acid of the present invention, which is non-circularized, may be circularized by ligating its 5′ terminus to its 3′ terminus. Ligating may be enzymatic, e.g., by a ligase. Preferably, the ligase is T4 RNA ligase.
(100) Prior to ligation, a non-circularized nucleic acid is contacted with a phosphatase, e.g., RNA 5′ pyrophosphohydrolase (RppH) or an ATP diphosphohydrolase, to produce a 5′ monophosphate RNA. Alternately, a non-circularized nucleic acid is contacted with a phosphatase, e.g., Antarctic Phosphatase, Shrimp Alkaline Phosphatase, and Calf Intestinal Phosphatase, and then contacted with a kinase, e.g., Polynucleotide Kinase.
(101) A nucleic acid may undergo multiple (e.g., two, three, four, five, or more) rounds of ligation, thereby ensuring that the majority of nucleic acids, in a sample, is circularized, e.g., about 100%, about 90%, about 80%, about 70%, about 60%, about 51%, or any amount therebetween.
(102) Optionally, non-circularized (i.e., linear) RNA is removed using an exonuclease to digest the linear RNA, e.g., RNase R, Exonuclease T, λ Exonuclease, Exonuclease I, Exonuclease VII, T7 Exonuclease, or XRN-1. Preferably, the exonuclease is RNase R and/or XRN-1.
(103) Methods for Purifying Circularized RNA
(104) The established method for isolating circular mRNA (Beaudry, 1995) is limited in terms of yield and the size of RNA that can be isolated. Most significantly, the mRNA isolated from a denaturing PAGE gel is not suitable for translation.
(105) Accordingly, another aspect of the invention provides a novel method of isolating pure circular mRNA. The unpurified circular mRNA sample is exposed to poly(A) polymerase, followed by HPLC. Exonuclease-mediated degradation of residual linear mRNA may be performed prior to HPLC. This method (e.g., HPLC) has the added benefit of removing impurities present in in vitro transcribed mRNA samples.
(106) Specifically, prior to HPLC, the unpurified circular mRNA sample is treated with a polyadenylase that adds a ˜100-200 nucleotide poly(A) tail to linear mRNA only, as circular mRNA does not have a free 3′ end to which the enzyme could make additions. This method allows efficient separation of the circular and linear forms of mRNA when the samples are run on an RNAsep HPLC column.
(107) Alternatively, another aspect of the invention provides a second novel method of isolating pure circular mRNA. The unpurified circular mRNA sample is exposed to a second nucleic acid of different length along with a splint and a ligase, followed by HPLC.
(108) Specifically, prior to HPLC, the unpurified circular mRNA sample is treated with a ligase in the presence of a nucleic acid that is considerably longer or shorter than the sequence that was circularized. The second sequence may be DNA or RNA. The splint across the first and second sequence may be DNA or RNA. The ligase may be a T4 RNA ligase. The second sequence may have a length that is at least 100 nucleotides different than the first sequence. The second sequence may have a length that is between about 100 and 5000 nucleotides different than the first sequence. For example, the second sequence may have a length that is between 200 and 4000, between 300 and 3000, between 400 and 2500, between 500 and 2000, between 600 and 2500, between 700 and 2000, between 800 and 1500, between 900 and 1000 nucleotides different than the first sequence. The second sequence may have a length that is 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1500, 2000, 2500, 3000, 4000, or 5000 nucleotides different than the first sequence. The ligase will ligate the second sequence to linear mRNA only, as circular mRNA does not have free 5′ or 3′ ends to which the enzyme could make additions. While the splint should enrich for ligation between the first and second sequences over ligation between two or more molecules of the second sequence, use of a second sequence that contains a 5′ hydroxyl will render it incompetent to ligation such that the second sequence will have to be ligated via its 3′ end to a 5′ monophosphate on the residual linear molecules of the first sequence. Performance of an additional round of ligation with ligase may even increase the yield of circular RNA. The splint may have a length of 20, 30, 40, 50, 60, 70, 80, 90, 100, 120, 140, 160, 180, or 200 nucleotides. This method allows efficient separation of the circular and linear forms of mRNA when the samples are run on an RNAsep HPLC column.
(109) Methods of Using Circularized RNA
(110) The circularized RNA produced according to the methods of the invention are useful in gene therapy. In particular, the circularized RNA is useful for protein replacement therapy or in the production of RNA-based vaccines for an array of antigens. For example, the circularized RNA (e.g., mRNA) can encode tumor-associated antigens useful as cancer vaccines. In another aspect, the circularized RNA (e.g., mRNA) can encode a bacterial or viral antigen to prevent or alleviate a symptom of a bacterial or viral infection, e.g., as a vaccine. Additional embodiments include use of circularized RNA for use in cancer immunotherapies, infectious disease vaccines, genome engineering, genetic reprogramming, and protein-replacement/supplementation therapies.
(111) Alternatively, the circularized RNA (e.g., mRNA) can encode a chimeric antigen receptor and be used to create a chimeric antigen receptor T cell useful in immunotherapy. Chimeric antigen receptors (CARs) comprise binding domains derived from natural ligands or antibodies specific for cell-surface antigens, genetically fused to effector molecules such as the TCR alpha and beta chains, or components of the TCR-associated CD3 complex. Upon antigen binding, such chimeric antigen receptors link to endogenous signaling pathways in the effector cell and generate activating signals similar to those initiated by the TCR complex. A CAR typically has an intracellular signaling domain, a transmembrane domain, and an extracellular domain.
(112) The transmembrane and/or intracellular domain may include signaling domains from CD8, CD4, CD28, 4-1BB, OX40, ICOS, and/or CD3-zeta. The transmembrane domain can be derived either from a natural or from a synthetic source. The transmembrane domain can be derived from any membrane-bound or transmembrane protein.
(113) The transmembrane domain may further include a stalk region positioned between the extracellular domain (e.g., extracellular ligand-binding domain) and the transmembrane domain. The term “stalk region” used herein generally means any oligo- or polypeptide that functions to link the transmembrane domain to the extracellular ligand-binding domain. In particular, stalk region are used to provide more flexibility and accessibility for the extracellular ligand-binding domain. A stalk region may comprise up to 300 amino acids, preferably 10 to 100 amino acids and most preferably 25 to 50 amino acids. Stalk region may be derived from all or part of naturally occurring molecules, such as from all or part of the extracellular region of CD8, CD4, or CD28, or from all or part of an antibody constant region. Alternatively, the stalk region may be a synthetic sequence that corresponds to a naturally occurring stalk sequence, or may be an entirely synthetic stalk sequence. In a preferred embodiment said stalk region is a part of human CD8 alpha chain.
(114) The signal transducing domain or intracellular signaling domain of the CAR of the invention is responsible for intracellular signaling following the binding of extracellular ligand binding domain to the target resulting in the activation of the immune cell and immune response. In other words, the signal transducing domain is responsible for the activation of at least one of the normal effector functions of the immune cell in which the CAR is expressed. For example, the effector function of a T cell can be a cytolytic activity or helper activity including the secretion of cytokines. Thus, the term “signal transducing domain” refers to the portion of a protein which transduces the effector signal function signal and directs the cell to perform a specialized function. Signal transduction domain comprises two distinct classes of cytoplasmic signaling sequence, those that initiate antigen-dependent primary activation, and those that act in an antigen-independent manner to provide a secondary or co-stimulatory signal. Primary cytoplasmic signaling sequence can comprise signaling motifs which are known as immunoreceptor tyrosine-based activation motifs of ITAMs. ITAMs are well defined signaling motifs found in the intracytoplasmic tail of a variety of receptors that serve as binding sites for syk/zap70 class tyrosine kinases. Examples of ITAM used in the invention can include as non-limiting examples those derived from TCR zeta, FcR gamma, FcR beta, FcR epsilon, CD3 gamma, CD3 delta, CD3 epsilon, CD5, CD22, CD79a, CD79b and CD66d. In a preferred embodiment, the signaling transducing domain of the CAR can comprise the CD3 zeta signaling domain, or the intracytoplasmic domain of the Fc epsilon RI beta or gamma chains.
(115) The CAR may further include one or more additional costimulatory molecules positioned between the transmembrane domain and the intracellular signaling domain, to further augment potency. Examples of costimulatory molecules include CD27, CD28, CD8, 4-1BB (CD137), OX40, CD30, CD40, PD-1, ICOS, lymphocyte function-associated antigen-1 (LFA-1), CD2, CD7, LIGHT, NKG2C, B7-H3 and a ligand that specifically binds with CD83 and the like. In some embodiments the intracellular signaling domain contains 2, 3, 4, or more costimulatory molecules in tandem.
(116) The extracellular domain may include an antibody such as a Fab, a scFV, or a single-domain antibody (sdAb also known as a nanobody) and/or may include another polypeptide described herein. In a preferred embodiment, said extracellular ligand-binding domain is a single chain antibody fragment (scFv) comprising the light (VL) and the heavy (VH) variable fragment of a target antigen specific monoclonal antibody joined by a flexible linker. Other binding domain than scFv can also be used for predefined targeting of lymphocytes, such as camelid single-domain antibody fragments (which are examples of an sdAb) or receptor ligands, antibody binding domains, antibody hypervariable loops or CDRs as non-limiting examples.
(117) As non-limiting examples, the antigen of the CAR can be a tumor-associated surface antigen, such as ErbB2 (HER2/neu), carcinoembryonic antigen (CEA), epithelial cell adhesion molecule (EpCAM), epidermal growth factor receptor (EGFR), EGFR variant III (EGFRvIII), CD19, CD20, CD30, CD40, disialoganglioside GD2, ductal-epithelial mucine, gp36, TAG-72, glycosphingolipids, glioma-associated antigen, beta-human chorionic gonadotropin, alphafetoprotein (AFP), lectin-reactive AFP, thyroglobulin, RAGE-1, MN-CA IX, human telomerase reverse transcriptase, RU1, RU2 (AS), intestinal carboxyl esterase, mut hsp70-2, M-CSF, prostate specific antigen (PSA), PAP, NY-ESO-1, LAGA-1a, p53, prostein, PSMA, surviving and telomerase, prostate-carcinoma tumor antigen-1 (PCTA-1), MAGE, ELF2M, neutrophil elastase, ephrin B2, CD22, insulin growth factor (IGF1)-I, IGF-II, IGFI receptor, mesothelia, a major histocompatibility complex (MHC) molecule presenting a tumor-specific peptide epitope, 5T4, ROR1, Nkp30, NKG2D, tumor stromal antigens, the extra domain A (EDA) and extra domain B (EDB) of fibronectin and the A1 domain of tenascin-C (TnC A1) and fibroblast associated protein (fap); a lineage-specific or tissue specific antigen such as CD3, CD4, CD8, CD24, CD25, CD33, CD34, CD133, CD138, CTLA-4, B7-1 (CD80), B7-2 (CD86), endoglin, a major histocompatibility complex (MHC) molecule, BCMA (CD269, TNFRSF 17), or a virus-specific surface antigen such as an HIV-specific antigen (such as HIV gp120); an EBV-specific antigen, a CMV-specific antigen, a HPV-specific antigen, a Lasse Virus-specific antigen, an Influenza Virus-specific antigen as well as any derivate or variant of these surface markers.
(118) A circularized nucleic acid of the present invention may encode a CAR and may be transfected or infected into a T-cell using any technique known in the art. A T cell that expresses the CAR is referred to as a chimeric T cell receptor cell (CART). The CART will express and bear on the cell surface membrane the chimeric antigen receptor encoded by the RNA sequence of a circularized nucleic acid of the present invention.
(119) The present invention includes a nucleic acid encoding a CAR, methods for preparing a nucleic acid encoding a CAR, compositions comprising a nucleic acid encoding a CAR, methods for producing a CART, methods for treating a diseases using a CART, an isolated CART, and non-human mammals comprising a CART.
(120) Any of the herein-described aspects or embodiments can be combined with any other aspect or embodiment described herein.
Definitions
(121) The term “nucleotide” refers to a ribonucleotide or a deoxyribonucleotide or modified form thereof, as well as an analog thereof. Nucleotides include species that comprise purines, e.g., adenine, hypoxanthine, guanine, and their derivatives and analogs, as well as pyrimidines, e.g., cytosine, uracil, thymine, and their derivatives and analogs.
(122) Nucleotide analogs include nucleotides having modifications in the chemical structure of the base, sugar and/or phosphate, including, but not limited to, 5-position pyrimidine modifications, 8-position purine modifications, modifications at cytosine exocyclic amines, and substitution of 5-bromo-uracil; and 2′-position sugar modifications, including but not limited to, sugar-modified ribonucleotides in which the 2′-OH is replaced by a group such as an H, OR, R, halo, SH, SR, NH2, NHR, NR2, or CN, wherein R is an alkyl moiety as defined herein. Nucleotide analogs are also meant to include nucleotides with bases such as inosine, queuosine, xanthine, sugars such as 2′-methyl ribose, non-natural phosphodiester linkages such as methylphosphonates, phosphorothioates and peptides.
(123) Modified bases refer to nucleotide bases such as, for example, adenine, guanine, cytosine, thymine, and uracil, xanthine, inosine, and qucuosine that have been modified by the replacement or addition of one or more atoms or groups. Some examples of types of modifications that can comprise nucleotides that are modified with respect to the base moieties, include but are not limited to, alkylated, halogenated, thiolated, aminated, amidated, or acetylated bases, individually or in combination. More specific examples include, for example, 5-propynyluridine, 5-propynylcytidine, 6-methyladenine, 6-methylguanine, N,N,-dimethyladenine, 2-propyladenine, 2-propylguanine, 2-aminoadenine, 1-methylinosine, 3-methyluridine, 5-methylcytidine, 5-methyluridine and other nucleotides having a modification at the 5 position, 5-(2-amino)propyl uridine, 5-halocytidine, 5-halouridine, 4-acetylcytidine, 1-methyladenosine, 2-methyladenosine, 3-methylcytidine, 6-methyluridine, 2-methylguanosine, 7-methylguanosine, 2,2-dimethylguanosine, 5-methylaminoethyluridine, 5-methyloxyuridine, deazanucleotides such as 7-deaza-adenosine, 6-azouridine, 6-azocytidine, 6-azothymidine, 5-methyl-2-thiouridine, other thio bases such as 2-thiouridine and 4-thiouridine and 2-thiocytidine, dihydrouridine, pseudouridine, queuosine, archaeosine, naphthyl and substituted naphthyl groups, any O- and N-alkylated purines and pyrimidines such as N6-methyladenosine, 5-methylcarbonylmethyluridine, uridine 5-oxyacetic acid, pyridine-4-one, pyridine-2-one, phenyl and modified phenyl groups such as aminophenol or 2,4,6-trimethoxy benzene, modified cytosines that act as G-clamp nucleotides, 8-substituted adenines and guanines, 5-substituted uracils and thymines, azapyrimidines, carboxyhydroxyalkyl nucleotides, carboxyalkylaminoalkyl nucleotides, and alkylcarbonylalkylated nucleotides. Modified nucleotides also include those nucleotides that are modified with respect to the sugar moiety, as well as nucleotides having sugars or analogs thereof that are not ribosyl. For example, the sugar moieties may be, or be based on, mannoses, arabinoses, glucopyranoses, galactopyranoses, 4′-thioribose, and other sugars, heterocycles, or carbocycles. The term nucleotide is also meant to include what are known in the art as universal bases. By way of example, universal bases include but are not limited to 3-nitropyrrole, 5-nitroindole, or nebularine. The term “nucleotide” is also meant to include the N3′ to P5′ phosphoramidate, resulting from the substitution of a ribosyl 3′ oxygen with an amine group. Preferably, the modified base is 5-methylcytidine (5mC).
(124) Further, the term nucleotide also includes those species that have a detectable label, such as for example a radioactive or fluorescent moiety, or mass label attached to the nucleotide.
(125) The term “nucleic acid” and “polynucleotide” are used interchangeably herein to describe a polymer of any length, e.g., greater than about 2 bases, greater than about 10 bases, greater than about 100 bases, greater than about 500 bases, greater than 1000 bases, up to about 10,000 or more bases composed of nucleotides, e.g., deoxyribonucleotides or ribonucleotides, and may be produced enzymatically or synthetically (e.g., PNA as described in U.S. Pat. No. 5,948,902 and the references cited therein) which can hybridize with naturally occurring nucleic acids in a sequence specific manner analogous to that of two naturally occurring nucleic acids, e.g., can participate in Watson-Crick base pairing interactions. Naturally occurring nucleotides include guanine, cytosine, adenine and thymine (G, C, A and T, respectively).
(126) The terms “ribonucleic acid” and “RNA” as used herein mean a polymer composed of ribonucleotides.
(127) As used herein, the terms “mRNA” and “RNA” may be synonyms.
(128) The terms “deoxyribonucleic acid” and “DNA” as used herein mean a polymer composed of deoxyribonucleotides. “Isolated” or “purified” generally refers to isolation of a substance (compound, polynucleotide, protein, polypeptide, polypeptide composition) such that the substance comprises a significant percent (e.g., greater than 1%, greater than 2%, greater than 5%, greater than 10%, greater than 20%, greater than 50%, or more, usually up to about 90%-100%) of the sample in which it resides. In certain embodiments, a substantially purified component comprises at least 50%, 80%-85%, or 90-95% of the sample. Techniques for purifying polynucleotides and polypeptides of interest are well-known in the art and include, for example, ion-exchange chromatography, affinity chromatography and sedimentation according to density. Generally, a substance is purified when it exists in a sample in an amount, relative to other components of the sample, that is not found naturally.
(129) The term “oligonucleotide”, as used herein, denotes a single-stranded multimer of nucleotides from about 2 to 500 nucleotides, e.g., 2 to 200 nucleotides. Oligonucleotides may be synthetic or may be made enzymatically, and, in some embodiments, are 4 to 50 nucleotides in length. Oligonucleotides may contain ribonucleotide monomers (i.e., may be RNA oligonucleotides) or deoxyribonucleotide monomers. Oligonucleotides may be 5 to 20, 11 to 30, 31 to 40, 41 to 50, 51-60, 61 to 70, 71 to 80, 80 to 100, 100 to 150 or 150 to 200, up to 500 nucleotides in length, for example.
(130) The term “duplex” or “double-stranded” as used herein refers to nucleic acids formed by hybridization of two single strands of nucleic acids containing complementary sequences. In most cases, genomic DNA is double-stranded.
(131) The term “complementary” as used herein refers to a nucleotide sequence that base-pairs by non-covalent bonds to a target nucleic acid of interest. In the canonical Watson-Crick base pairing, adenine (A) forms a base pair with thymine (T), as does guanine (G) with cytosine (C) in DNA. In RNA, thymine is replaced by uracil (U). As such, A is complementary to T and G is complementary to C. In RNA, A is complementary to U and vice versa. Typically, “complementary” refers to a nucleotide sequence that is at least partially complementary. The term “complementary” may also encompass duplexes that are fully complementary such that every nucleotide in one strand is complementary to every nucleotide in the other strand in corresponding positions. In certain cases, a nucleotide sequence may be partially complementary to a target, in which not all nucleotide is complementary to every nucleotide in the target nucleic acid in all the corresponding positions, that is having one or more nucleotide mismatches.
(132) As defined herein, “RNA ligase” means an enzyme or composition of enzyme that is capable of catalyzing the joining or ligating of an RNA acceptor oligonucleotide, which has an hydroxyl group on its 3′ end, to an RNA donor, which has a 5′ phosphate group on its 5′ end. The invention is not limited with respect to the RNA ligase, and any RNA ligase from any source can be used in an embodiment of the methods and kits of the present invention. For example, in some embodiments, the RNA ligase is a polypeptide (gp63) encoded by bacteriophage T4 gene 63; this enzyme, which is commonly referred to simply as “T4 RNA ligase,” is more correctly now called “T4 RNA ligase 1” since Ho, C K and Shuman, S (Proc. Natl. Acad. Sci. USA 99: 12709-12714, 2002) described a second RNA ligase (gp24.1) that is encoded by bacteriophage T4 gene 24.1, which is now called “T4 RNA ligase 2.” Unless otherwise stated, when “T4 RNA ligase” is used in the present specification, is meant “T4 RNA ligase 1”. For example, in some other embodiments, the RNA ligase is a polypeptide derived from or encoded by an RNA ligase gene from bacteriophage TS2126, which infects Thermus scotoductus, as disclosed in U.S. Pat. No. 7,303,901 (i.e., bacteriophage TS2126 RNA ligase).
(133) Linear nucleic acid molecules are said to have a “5′-terminus” (5′ end) and a “3′-terminus” (3′ end) because nucleic acid phosphodiester linkages occur at the 5′ carbon and 3′ carbon of the sugar moieties of the substituent mononucleotides. The end of a polynucleotide at which a new linkage would be to a 5′ carbon is its 5′ terminal nucleotide. The end of a polynucleotide at which a new linkage would be to a 3′ carbon is its 3′ terminal nucleotide. A terminal nucleotide, as used herein, is the nucleotide at the end position of the 3′- or 5′-terminus
(134) “Transcription” means the formation or synthesis of an RNA molecule by an RNA polymerase using a DNA molecule as a template. The invention is not limited with respect to the RNA polymerase that is used for transcription. For example, a T7-type RNA polymerase can be used.
(135) “Translation” means the formation of a polypeptide molecule by a ribosome based upon an RNA template.
(136) “Melting temperature” (T.sub.m) is defined as the temperature at which half of the DNA strands are in the random coil or single-stranded (ssRNA) state.
(137) “Annealing temperature” (T.sub.a) is defined as the temperature in which single-stranded nucleic acids associate such that double-stranded molecules are formed, often by heating and cooling.
(138) It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting. As used in this specification and the appended claims, the singular forms “a”, “an”, and “the” include plural referents unless the content clearly dictates otherwise. Thus, for example, reference to “a cell” includes combinations of two or more cells, or entire cultures of cells; reference to “a polynucleotide” includes, as a practical matter, many copies of that polynucleotide. Unless specifically stated or obvious from context, as used herein, the term “or” is understood to be inclusive. Unless defined herein and below in the reminder of the specification, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the invention pertains.
(139) Unless specifically stated or obvious from context, as used herein, the term “about”, is understood as within a range of normal tolerance in the art, for example within 2 standard deviations of the mean. “About” can be understood as within 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, 0.9%, 0.8%, 0.7%, 0.6%, 0.5%, 0.4%, 0.3%, 0.2%, 0.1%, 0.09%, 0.08%, 0.07%, 0.06%, 0.05%, 0.04%, 0.03%, 0.02%, or 0.01% of the stated value. Unless otherwise clear from the context, all numerical values provided herein are modified by the term “about.”
(140) A “subject” in the context of the present invention is preferably a mammal. The mammal can be a human, non-human primate, mouse, rat, dog, cat, horse, or cow, but are not limited to these examples.
(141) As used herein, the term “encode” refers broadly to any process whereby the information in a polymeric macromolecule is used to direct the production of a second molecule that is different from the first. The second molecule may have a chemical structure that is different from the chemical nature of the first molecule.
(142) For example, in some aspects, the term “encode” describes the process of semi-conservative DNA replication, where one strand of a double-stranded DNA molecule is used as a template to encode a newly synthesized complementary sister strand by a DNA-dependent DNA polymerase. In other aspects, a DNA molecule can encode an RNA molecule (e.g., by the process of transcription that uses a DNA-dependent RNA polymerase enzyme). Also, an RNA molecule can encode a polypeptide, as in the process of translation. When used to describe the process of translation, the term “encode” also extends to the triplet codon that encodes an amino acid. In some aspects, an RNA molecule can encode a DNA molecule, e.g., by the process of reverse transcription incorporating an RNA-dependent DNA polymerase. In another aspect, a DNA molecule can encode a polypeptide, where it is understood that “encode” as used in that case incorporates both the processes of transcription and translation.
(143) The invention will be further illustrated in the following non-limiting examples.
EXAMPLES
Example 1
Circularized RNA Synthesis
(144) RNA was synthesized using the HiScribe T7 High Yield RNA Synthesis Kit (NEB, #E2040S) according to manufacturer's instructions. 500-1000 ng of PCR product encoding the desired RNA sequence was used as template in these in vitro transcription (IVT) reactions. Synthesized RNA was then treated with RNA 5′ Pyrophosphohydrolase, or RppH, (NEB, #M0356S) to provide the 5′ monophosphate end necessary for enzymatic circularization. RppH-treated RNA was enzymatically circularized in reactions containing final concentrations of: 10% DMSO, 200 μM ATP, 1× NEB Buffer 4, 40 U RNaseOUT (Life Technologies, #10777-019), and 30 U of T4 RNA Ligase 1 (NEB, #M0204L) for 2 hours at 37° C. Remaining linear RNA in the circularization reactions is removed by HPLC (described in Example 4). After each step, reactions were purified using the GeneJet RNA Purification Kit (Thermo Scientific, #K4082). Circularization of RNA less than 1000 nucleotides was confirmed by running 500 ng of RNA product on a 6% polyacrylamide gel in 7 M Urea-TBE (Life Technologies, #EC6865) for 3 hours at 180 V, 4° C. Circularized product characteristically migrates slower than linear RNA, so a slower migrating band indicated circularized product when run alongside control non-circularized RNA. Additional confirmation was carried out using outward-oriented PCR (OOPCR), where primers are oriented outward from one other with respect to the linear template (as opposed to traditional PCR in which primers are oriented towards each other). cDNA was synthesized (Life Technologies, #4402954) from RNA samples and used as template in the OOPCR reactions. cDNA derived from non-circularized, linear RNA was used a negative control. An amplicon is generated solely from the circularized construct, as the polymerase can extend through the ligated ends.
Example 2
Generating CRC, 5′ and 3′ UTR Constructs
(145) CRC sequences and experimental 5′/3′ UTRs were appended to RNA coding sequence (CDS) by generating PCR templates for IVT that had been amplified with primers of the following design:
(146) TABLE-US-00003 Forward primer: (SEQ ID NO: 79) 5′-(TAATACGACTCACTATAGGG)-(ttatgataac)-(tggctgcacga attgcacaa)-(desired 5′ UTR)-(varied based on RNA CDS)-3′
(147) {5′-(RNA polymerase promoter sequence)-(random nucleotides, e.g., of a tail sequence)-(5′CRC sequence)-(desired 5′ UTR)-(1.sup.st 20 nucleotides of desired RNA CDS)-3′}.
(148) TABLE-US-00004 Reverse primer: (SEQ ID NO: 80) 5′-(agcgacttcg)-(ttgtgcaattcgtgcagcca)-(desired 3′ UTR)-(varied based on RNA CDS)-3′
(149) {5′-(random nucleotides, e.g., of a tail sequence)-(3′CRC sequence)-(reverse complement of desired 3′ UTR)-(reverse complement of last 20 nucleotides of desired RNA CDS)-3′}
(150) PCR templates generated with the above primers were used to generate circularized product in accordance with the procedures described in the below Examples.
Example 3
Measuring Translation Efficiency and RNA Stability
(151) RNA constructs encoding nanoluciferase and complexed with Lipofectamine® 2000 (Life Technologies, #11668) were transfected into Hep3B cells (human hepatocyte cell line) seeded at 10,000 cells/well in a 96-well plate. Protein expression kinetics were measured using the Nano-Glo Luciferase Assay System (Promega, #N1110) using samples taken at 24, 48, and 72 hours post-transfection.
(152) To measure RNA stability, qPCR was carried out using samples derived from cells that had been transfected as described above. cDNA was synthesized at each time point using the Power SYBR® Green Cells-to-Ct Kit (Life Technologies, #4402954) according to the manufacturer's instruction. The housekeeping gene β-actin was used to normalize the results.
Example 4
HPLC Purification of Circular mRNA
(153) The established method for isolating circular mRNA (Beaudry, 1995) is limited in terms of yield and the size of RNA that can be isolated. Most significantly, the mRNA isolated from the denaturing PAGE gel is not suitable for translation. Alternative methods that have been attempted to optimize are not sufficiently effective, either; exonuclease treatment to degrade residual linear mRNA produces fragments of linear byproducts that are recognized by innate immune receptors, while oligo(dT)-mediated column separation of poly(A)-tailed linear mRNA leaves high levels of residual linear mRNA in the sample. An efficient method has been developed for isolating pure circular mRNA by exposing samples to poly(A) polymerase, followed by HPLC. This method has the added benefit of removing impurities present in in vitro transcribed mRNA samples. Prior to HPLC, circular mRNA samples are treated with a polyadenylase that adds a ˜100-200 nucleotide poly(A) tail to linear mRNA only, as circular mRNA does not have a free 3′ end to which the enzyme could make additions. This treatment allows efficient separation of the circular and linear forms of mRNA when the samples are run on an RNAsep HPLC column. In the absence of polyadenylation, the circular and linear forms cannot be separated by HPLC, as they elute at the same time owing to their identical lengths.
Example 5
Confirmation of IRES Activity in Circular mRNA
(154) Characterization studies of pure circular mRNA have provided evidence that the putative IRES “PPT19” has little to no IRES activity, suggesting that previous translation from samples that included the PPT19 sequence was attributable to residual linear mRNA in those samples. The poly(A)+HPLC method has been used to isolate pure circular mRNA that contains: an EMCV IRES, a putative PPT19 IRES, or a non-IRES derived from the 5′ UTR of insulin to evaluate the IRES activity of the 5′ UTR motifs that have been used most commonly. The translation capacity of the pure circular mRNA was tested in H1299 and HepG2 cells and in a cell-free rabbit reticulocyte lysate translation system. In the latter, IRES activity was enriched for by adding excess cap analog, which sequesters the initiation components required for cap-dependent translation to occur. Using these assays, the IRES activity of EMCV in circular mRNA was confirmed, while little to no activity was observed with the PPT19 IRES.
Example 6
Evaluation of Methods for Preparing 5′-monophosphate Ends
(155) Different methods of 5′-monophosphate end preparation were evaluated, and the results are depicted in
(156) In
(157) In
Example 7
HPLC Purification Following RNase R Digestion
(158) It was found that HPLC purification is necessary for removing immuno-stimulatory byproducts following RNase R digestion. In
Example 8
Confirmatory Methods for Verifying Circularization and Purity
(159) Circular RNA was exposed to RNase R to remove residual linear RNA from the sample. In parallel, equal levels of the linear form of a given RNA as well as a commercial mRNA (Cleancap eGFP, Trilink) were digested alongside the circular samples to provide positive controls that are expected to be completely degraded upon exposure to RNase R.
(160) In
(161) To verify ligation, the circular RNAs (and the indicated controls) were transfected into HEK293T cells, total RNA was isolated after 24 hours, and RT-PCR was performed using divergent primers (with respect to the linear construct) that amplify the region containing the 5′-3′ junction. As expected, only the circular samples produce amplicon using divergent primers (
Example 9
Poly(A) Tailing to Promote Digestion of Linear mRNA
(162) It was found that poly(A) tailing allows for complete RNase R digestion of residual linear mRNA in circularization reactions.
(163) To optimize RNase R reaction conditions, linear forms of commercial linear mRNA (Trilink) and the final constructs were exposed to the temperatures and incubation times indicated in
(164) To determine if this resistance was due to the G content in the elastinx3 3′ UTR of the constructs, constructs with varying 3′ UTRs among 3 different forms of RNA were generated (without cap and tail, with cap and tail, and in circular form). Results for RNase R digestion of these constructs is shown in
(165) To confirm that the addition of a poly(A) stretch at the 3′ end of the RNA is sufficient to allow for robust (>80%) RNase R-mediated degradation of linear RNA, linear RNA (+/−poly(A) tail and/or +/−CRC) was incubated to show that the poly(A) allowed RNAs to be sufficiently resensitized to RNase R degradation (
Example 10
A CRC Motif Enhances Circularization Efficiency without Hindering Translation or Inducing an Interferon Response
(166)
(167) To evaluate circularization efficiency, a panel of constructs varying in their size and the presence of CRC were circularized and digested with RNase R. The presence of a CRC at the ends of these constructs greatly enhanced the amount of circular RNA produced (
(168) To evaluate translation efficiency, a Nanoluciferase (Nluc)-encoding RNA construct was transfected into HEK293T cells, and the variations in protein expression (as measured by luciferase activity) were compared (
(169) To evaluate cytokine response, IFNβ induction was measured, and it was confirmed that the addition of this 10-nucleotide CRC motif did not significantly induce a type I interferon response.
Example 11
Effects of CRC Melting Temperature on Translational Capacity of Linear and Circular RNA
(170) Each construct listed in
Example 12
Persistence of Circular MRNA is Observed In Vitro and In Vivo
(171) A diagram of a target mRNA and its corresponding amplicons when using inward- and outward-oriented primers in semi-quantitative RT-PCR is shown in
(172) BALB/c mice were injected intravenously (
Example 13
EMCV-IRES-Mediated Protein Translation in Circular MRNA Persists Longer than that Derived from Canonical Linear MRNA
(173) Protein expression kinetics from circular mRNA versus linear mRNA (both encoding Nluc) was tracked over a 3-day time course in HepG2 cells (
Example 14
Circular MRNA can be Robustly Expressed In Vivo Following Intravenous Injection
(174) 10 μg of Nluc-encoding linear (
Example 15
Erythropoietin (EPO) Construct Design and Confirmation
(175) A codon-optimized mouse erythropoietin mRNA was generated with an EMCV 5′ UTR and a poly(A)x50 in its 3′ UTR (
Example 16
EMCV-Mediated Protein Translation is Inhibited in Immortalized Cell Lines and Primary Cells when Modified Nucleotides are Incorporated into the MRNA but Maintains its Functionality In Vivo
(176) As shown in
(177) The lack of protein expression is not due to disparate levels of RNA present within the cell, as roughly equal levels of RNA are detected within the cell at early time points: 4 hours post-transfection in HepG2 cells (
(178) 2 μg of mEpo mRNA in linear or circular form (+/−nucleotide modifications) was injected into mice intradermally. As a functional readout for mEpo protein production, reticulocyte counts in whole blood were measured at 6, 24, 72, and 144 hours post-injection using a flow-based assay (BD Bioscience, Retic-Counter). As shown in
OTHER EMBODIMENTS
(179) While the invention has been described in conjunction with the detailed description thereof, the foregoing description is intended to illustrate and not limit the scope of the invention, which is defined by the scope of the appended claims. Other aspects, advantages, and modifications are within the scope of the following claims.