Trans-splicing RNA (tsRNA)
11517583 · 2022-12-06
Assignee
Inventors
Cpc classification
A61P25/14
HUMAN NECESSITIES
A61P1/14
HUMAN NECESSITIES
A61K48/00
HUMAN NECESSITIES
C12N15/63
CHEMISTRY; METALLURGY
C12N15/113
CHEMISTRY; METALLURGY
A61P43/00
HUMAN NECESSITIES
A61P7/04
HUMAN NECESSITIES
A61P25/28
HUMAN NECESSITIES
A61K31/7105
HUMAN NECESSITIES
C12N15/64
CHEMISTRY; METALLURGY
C12N15/67
CHEMISTRY; METALLURGY
International classification
A61K31/7105
HUMAN NECESSITIES
A61P35/00
HUMAN NECESSITIES
A61K48/00
HUMAN NECESSITIES
A61P25/14
HUMAN NECESSITIES
C12N15/113
CHEMISTRY; METALLURGY
C12N15/64
CHEMISTRY; METALLURGY
Abstract
The invention concerns a trans-splicing RNA (tsRNA) molecule comprising one or multiple unstructured binding domains; a cell or vector comprising said tsRNA; and a method for killing cells or treating a disease using said tsRNA.
Claims
1. A trans-splicing RNA (tsRNA) molecule comprising: a plurality of binding domains specific for parts of genes that associate with or are biomarkers for a disease to be treated; nucleic acid encoding at least one expressible suicide protein or a protein that is a component of a suicide system; and at least one splice signal, wherein the tsRNA includes at least one of: said nucleic acid encoding at least one expressible protein comprises two A/G-rich exonic splice enhancers (ESE) generated by using degenerative alternative codons that do not alter the amino acid sequence; said plurality of binding domains comprise a binding site comprising at least 25 consecutive unstructured nucleotides (nt) having no internal binding and/or self-complementary sequences and within or outside said binding site said binding domain, when of a length of 44 nt or longer, have at least one, or a plurality of, mismatch nucleotide(s) with respect to said genes; or said plurality of binding domains are complementary to different genes that associate with or are biomarkers for a disease to be treated.
2. The trans-splicing RNA molecule according to claim 1 wherein said plurality of binding domains include a binding site that comprises a sequence of nucleotides selected from the list comprising or consisting of: 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 262, 263, 264, 265, 266, 267, 268, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 299, 300 or more nucleotides.
3. The trans-splicing RNA molecule according to claim 1 wherein said plurality of binding domains comprise a sequence of nucleotides that is at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91, 92, 93, 94, 95, 96, 97, 98, 99% or 100% complementary to said part of a gene that associates with or is a biomarker for a disease to be treated.
4. The trans-splicing RNA molecule according to claim 1 wherein mismatches in said plurality of binding domains are positioned to avoid any stretches of 45 nt or longer that are perfectly complementary to the target, including or excluding said binding site.
5. The trans-splicing RNA molecule according to claim 1 wherein said at least one expressible suicide protein is the herpes simplex virus thymidine kinase (HSVtk).
6. The trans-splicing RNA molecule according to claim 1 wherein said tsRNA also includes a spacer sequence adjacent to said plurality of binding domains.
7. The trans-splicing RNA molecule according to claim 1 wherein said tsRNA, outside each binding domain, comprises at least one cis-binding or self-binding domain.
8. The trans-splicing RNA molecule according to claim 1 wherein said tsRNA comprises, outside each binding domain and 3′ of said molecule, a highly structured sequence of RNA that is folded, or pairs with itself, due to the presence of self-complementary sequences.
9. The trans-splicing RNA molecule according to claim 1 wherein highly structured RNA is adjacent a spacer located between it and a polyA site.
10. The trans-splicing RNA molecule according to claim 1 wherein said disease is cancer or a viral infection or a bacterial infection or an acquired genetic disease caused by mutations triggered by transposable elements, radiation, chemicals, or unknown triggers.
11. The trans-splicing RNA molecule according to claim 10 wherein said cancer is hepatocellular carcinoma (HCC), cervical cancer, vaginal cancer, vulvar cancer, penile cancer, skin cancers, melanoma, malignant melanoma, squamous-cell carcinoma, basal-cell carcinoma, Merkel cell carcinoma, lung cancer, cell bladder cancer, breast cancer, colon or rectal cancer, anal cancer, endometrial cancer, kidney cancer, leukemia, acute myelogenous or myeloid leukemia (AML), acute lymphoblastic leukemia (ALL), chronic lymphocytic leukemia (CML), chronic myelogenous or myeloid leukemia (CML), hairy cell leukemia (HCL), T-cell prolymphocytic leukemia (P-TLL), large granular lymphocytic leukemia, adult T-cell leukemia, lymphoma, myeloma, non-Hodgkin lymphoma, pancreatic cancer, prostate cancer, thyroid cancer, nasopharyngeal cancer, mouth or throat cancer, oropharyngeal cancers, stomach cancer, brain tumours, bone cancer, or stem cell cancers.
12. The trans-splicing RNA molecule according to claim 10 wherein said viral infection is an infection with a retrovirus, human T-cell lymphotropic virus (HTLV), lentiviruses, human immunodeficiency virus types 1 and 2 (HIV-1 and HIV-2), human papillomavirus types 16 and 18 (HPV-16 and HPV-18), hepadnavirus HAV, HBV, HCV, HDV, and/or HEV, a herpesvirus, herpes simplex (HSV), Epstein-Barr virus (EBV), cytomegalovirus (CMV), an adenovirus, an adeno-associated virus, or an influenza virus.
13. The trans-splicing RNA molecule according to claim 10 wherein said bacterial infection is an infection with Bartonella henselae, Francisella tularensis, Listeria monocytogenes, salmonella species, Salmonella typhi, Brucella species, Legionella species, Mycobacteria species, Mycobacterium tuberculosis, Nocardia species, Rhodococcus species, Yersinia species, or Neisseria meningitides.
14. The trans-splicing RNA molecule according to claim 10 wherein said acquired genetic disease is Neurofibromatosis 1 and 2, Mc Cune Albright, Duchenne muscular dystrophy (DMD), Epidermolysis bullosa, Fanconi A and C, Philadelphia chromosome, Hemophilia A and B, cystic fibrosis, Muckle Wells syndrome, lipoprotein lipase deficiency, B-thalassemia, or pyruvate dehydrogenase complex deficiency.
15. A cell containing said tsRNA according to claim 1.
16. A vector containing said tsRNA according to claim 1.
17. A method of targeting a diseased cell comprising: topical application; intranasal application; alveolar application; systemic application; oral application; intravenous application; intramuscular application; subcutaneous application; cutaneous application; intraperitoneal application; or injection into a tumor with tsRNA, or a vector containing the tsRNA, according to claim 1 in vivo and, optionally, exposing said cell to other component(s) of said suicide system effective to kill said cell.
18. A medicament comprising said tsRNA according to claim 1, further comprising at least one further component of said suicide system effective to trigger death of a cell expressing said trans-spliced RNA.
19. A pharmaceutical composition comprising said tsRNA according to claim 1, further comprising at least one further component of said suicide system effective to trigger death of a cell expressing said trans-spliced RNA; and a carrier suitable for human or veterinary use.
20. The pharmaceutical composition according to claim A wherein said one further component of said suicide system is selected from the group comprising or consisting of ganciclovir, cytosine deaminase-5-fluorocytosine, cytochrome P450-ifosfamide, cytochrome P450-cyclophosphamide, and nitroreductase-5-[aziridin-1-yl]-2,4-dinitrobenzamide.
21. The trans-splicing RNA molecule according to claim 1 wherein said nucleic acid encoding at least one expressible protein comprises two A/G-rich exonic splice enhancers (ESE) generated by using degenerative alternative codons that do not alter the amino acid sequence.
22. A trans-splicing RNA according to claim 1, wherein said binding domain comprise a binding site comprising at least 25 consecutive unstructured nucleotides (nt) having no internal binding and/or self-complementary sequences and within or outside said binding site said binding domain, when of a length of 44 nt or longer, have at least one, or a plurality of, mismatch nucleotide(s) with respect to said genes.
23. The trans-splicing RNA molecule according to claim 1 wherein said plurality of said binding domains are complementary to the same parts of a gene that associates with or is a biomarker for a disease to be treated.
24. The trans-splicing RNA molecule according to claim 1 wherein said plurality of binding domains are complementary to different genes that associate with or are biomarkers for a disease to be treated.
25. A trans-splicing RNA (tsRNA) molecule comprising: a plurality of binding domains specific for parts of genes that associate with or are biomarkers for a disease to be treated; nucleic acid encoding at least one expressible protein; and at least one splice signal, wherein said plurality of binding domains comprise a binding site comprising at least 25 consecutive unstructured nucleotides (nt) having no internal binding and/or self-complementary sequences and within or outside said binding site said binding domain, when of a length of 44 nt or longer, have at least one, or a plurality of, mismatch nucleotide(s) with respect to said genes.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) The patent or application file contains at least one drawings executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.
(2) An embodiment of the present invention will now be described by way of example only with reference to the following wherein:
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
METHODS AND MATERIALS
(16) RNA design: The trans-splicing constructs were designed combining various reported and novel molecular features to improve activity and target specificity. The 3′ER ts constructs consisted of a CMV promoter (pEGFP-N1, Clontech acc no. U55762) followed by a binding domain (BD) of 50 bases complementary to the target AFP intron 5. The BD included two mismatches at positions 18 and 19 to inhibit potential antisense (as) effects that can be triggered by longer dsRNA in the nucleus of the cell. Software ‘foldanalyze’ (HUSAR, DKFZ) was used to select short unstructured BDs within the complete antisense RNA structure space that can be directed against the AFP intron 5. Structures of the selected BDs were confirmed by RNA 2° structure (minimum free energy and centroid) predictions using software tools mfold and RNAfold. Such selected BDs were then fused with the rest of the trans-splicing RNA making sure that the BDs remained unstructured upon fusion and were not involved in base-pairing the trans-splicing or coding domains which was achieved by implementing suitable spacers. The selected 3′ splice signal (3′ ss) was designed to functionally compete with the cellular cis-splice site and was supported by an intronic splice enhancer (ISE) (McCarthy, et al., 1998; Konczak, et al., 2000; Yeo et al., 2004), a branchpoint (BP) (Eul, 2006) and polypyrimidine tract (Ppt) (Nobel, et al., 1998; Taggart, et al, 2012). The HSVtk cds was preceded with a sequence coding for a proteolytic cleavage site P2A (Kim, et al, 2011) to ensure endogenous release of the native HSV-tk from the AFP-HSVtk fusion protein that initially results from the trans-splicing process. The HSVtk gene is devoid of a start codon and can only be translated after trans-splicing using the translational start of the target message. The HSVtk gene was equipped with an A/G-rich exonic splice enhancer (ESE) generated by using degenerative alternative codons that do not after the HSV-tk amino acid sequence (Fairbrother, et al, 2002; Jin et al., 2003) (supplementary FIG. 1a). A beta-globin mini-intron of 133 bases (pCMVTNT™, acc num. AF477200.1) was introduced in the HSVtk gene at a splice site consensus motif (3′ ss CAG/G and 5′ ss MAG. For transcriptional termination the SV40 polyA sequence (pcDNA3.1, Life Technologies) was used.
(17) The 5′ER ts constructs were designed with the same molecular features as p3ER but with different orientation including a translational signal motif along with the CMV promoter. All the structural elements important for translation of eukaryotic mRNA were included: original cap site of AFP (Gibbs, et al. 1987) followed by the consensus Kozak sequence GCCRGCCAUGG (Kozak, 1995, 1999, 2005). Immediately after the translation start signal was the coding domain HSVtk inclusive of the ESE and mini intron followed by a 5′ ss signal (Freund, et al. 2005). The 5′ BD was designed in a similar way with mismatches at positions 24 and 25 to avoid as-effects. Following the BD, a hammerhead ribozyme (HH Rz) (Saksmerprome, et al. 2004) was incorporated for enhanced cleaving of the BD after delivery into the nucleus. The HH RZ is followed by a long spacer to isolate the polyA from the ribozyme followed by the SV40 polyA.
(18) Mutation designs: The 3′/5′ΔHSVtk were designed to produce a full-length inactive HSVtk protein with two point mutations: A to G at position 115 (glycine to glutamic acid), G to A mutation at position 649 (histidine to arginine) (Sasadeusz, et al. 1997). The 3/5′Δss were designed to check the importance of having an active splice signal, mutation in the ss should result in reduced ts. The 3′Δss1 had the conserved BP changed from A to C, 3′Δss2 had 6/8 nucleotides changed including the BP and both the ss has mutated AG to TC acceptor ss. The 5′Δss1 had 7/11 bases changed with consensus donor ss GT intact and 5′Δss2 had 10/11 bases changed including the mutation of donor ss from GT to AC. The 5′ΔHH Rz had the conserved cleavage motif GUC changed to ACA to eliminate or greatly reduce the cleaving efficiency of the ribozyme.
(19) Plasmid construction: The 3′ and 5′ parental exon replacement (ER) constructs named p3ER_ΔBD-opt and p5ER_ΔBD-opt_HH respectively were gene synthesised (GeneArt, Regensburg) and cloned into pVAX1 (AddGene) using SpeI and BbsI to be used as master vectors to sub-clone the remaining of the trans-splicing constructs. The AFP mini-gene consisting of exons 3-6 and introns 3 and 5 (AFP_E3-E6) derived from NCBI (acc num M16110) was gene synthesised (GeneArt, Regensburg) and cloned into pVAX1 using NheI and KpnI. The HSVtk positive control was sub-cloned from the p3EL and p5EL expression plasmids using SacI and BamHI. The complete 1136 base cds of the HSVtk gene (NCBI acc num AF057310) was gene synthesized as part of trans-splicing construct.
(20) A total of 80 constructs were designed and region of change was either gene synthesised or PCR amplified and were sub-cloned into the p3ER_ΔBD-opt and p5ER_ΔBD-opt_HH parental or master vectors. The p3ER_ΔBD-opt_Δss1 and p3ER_ΔBD-opt_Δss2 are 3′ splice site mutations sub-cloned inside p3ER_ΔBD-opt with BbvCI and SacI, NheI and PvuI respectively to replace the wild type ss. Similarly the p5ER_ΔBD-opt_HH_Δss1 was cloned inside p5ER_ΔBD-opt_HH with BssHII and BbsI. The p5ER_ΔBD-opt_HH_Δss2 was synthesised using nested PCR method to generate the desired 5′ ss mutation and cloned into the master vector with BssHII and KpnI. The 3′ and 5′ substitution mutation to generate a weaker HSVtk protein namely p3ER_ΔBD-opt_ΔHSVtk and p5ER_ΔBD-opt_HH_ΔHSVtk was cloned into their parental vectors using PvuI and PstI, PstI and NheI respectively. The 5′HH Rz mutation namely p5ER_ΔBD-opt_ΔHH was generated using nested PCR method and cloned into the parental vector with KpnI and BbvCI. The p3ER_BD-opt with NheI and BbvCI and p5ER_BD-opt_HH with KpnI and BbvCI were sub-cloned to generate the BDs with no mismatch (state-of-the-art BDs) with the target. The p3ER_BD(−) was generated by removing the BD from the p3ER_ΔBD-opt with NheI and BbvCI and replacing a small oligo with same RE overhang. However the p5ER_BD(−)_HH and p5ER_BD(−)_ΔHH were generated by replacing the 5′BD with a random 8-mer to bind with the stem Ill of the HH Rz, sub-cloning into the parental vector with KpnI and BbvCI. The p5ER_ΔBD-opt_HH(−) with no HH was sub-cloned using KpnI and BbvCI. The 3′ structured BDs namely p3ER_ΔBD-struc1, p3ER_ΔBD-struc2 and p3ER_BD-opt-inv were sub-cloned using NheI and BbvCI. The 5′ structured BDs namely p5ER_ΔBD-struc1_HH and p5ER ΔBD-opt-inv_HH were sub-cloned with KpnI and BbsI. More 5′ER constructs to look at the stability of the 3′ end of the RNA like p5ER_ΔBD-opt_hp_HH and p5ER_ΔBD-opt_Y_HH were cloned inside the parental using KpnI and BbsI. Additional BDs to study the specificity of on-target and alternative trans-splicing in the 3′ER namely p3ER_ΔBD-opt_D, p3ER_ΔBD-opt_E, p3ER_ΔBD-opt_F, p3ER_ΔBD-opt_EF, p3ER_ΔBD-opt_DEF and p3ER_BD(−)_D were cloned into the parental using NheI and BbvCI. To further study the effects of these sub-optimal trans-splicing constructs in context to splice mutants, they were cloned into the p3ER_ΔBD-opt_Δss1 and p3ER_ΔBD-opt_Δss2 for 3′ set and p5ER_ΔBD-opt_HH_Δss1 and p5ER_ΔBD-opt HH_Δss2 for 5′ set. The 3′ structured BDs (6 different constructs: p3ER_ΔBD-struc1/2/opt-inv_Δss1/2), 3′ no mismatch BDs (2 constructs: p3ER_BD-opt_Δss1/2) and 3′ no BDs (2 constructs: p3ER_BD(−)_Δss1/2) were cloned using NheI and BbvCI. The 5′ structured BDs (4 different constructs: p5ER_ΔBD-struc1/opt-inv_HH_Δss1/2), 5′ no mismatch BDs (2 constructs: p5ER_BD-opt_HH_Δss1/2), 5′ no BDs_wt HH (2 constructs: p5ER_BD(−)_HH_Δss1/2) and 5′ no BD_mut HH (2 constructs: p5ER_BD(−)_ΔHH_Δss1/2) were cloned using KpnI and BbvCI. To study the effect of sub-optimal trans-splicing constructs in context of no mismatch or state-of-the-art BDs, the structured BDs were made perfect complementary with the target and cloned into p3ER_ΔBD-opt with NheI and BbvCI (total 3 constructs: p3ER_BD-struc1/2/opt-inv), cloned into p5ER_ΔBD-opt_HH with KpnI and BbsI (total 2 constructs: p5ER_BD-struc1/opt-inv_HH).
(21) To improve overall trans-splicing, constructs were designed to target two pre-mRNAs (one against AFP and the other against either HCCA2, CD24 or VEGF) simultaneously in the 3′ER context, p3ER ΔBD-opt_AFP+HCCA2 and p3ER_ΔBD-opt_HCCA2+AFP were sub-cloned into the p3ER parental vector using NheI and BbvCI. The other targets namely p3ER_ΔBD-opt_AFP+CD24 and p3ER_ΔBD-opt_AFP+VEGF were further sub-cloned into the p3ER_ΔBD-opt_AFP+HCCA2 vector with EcoRI and BbvCI by replacing the HCCA2 serving as the second BD. Similarly p3ER_ΔBD-opt_CD24+AFP and p3ER_ΔBD-opt_EGF+AFP were sub-cloned into the p3ER_ΔBD-opt_HCCA2+AFP vector with EcoRI and NheI by replacing the HCCA2 BD serving as the first BD. For flow cytometry analyses, the GFP gene (amplified from pEGFP.C2) was cloned into pGL3-control using HindIII and XbaI, the SV40 promoter-GFP-SV40 polyA-SV40 enhancer cassette from pGL3 plasmid was cloned into a self-generated MCS site in pVAX1-trans-splicing vectors using BgIII and SaII. The GFP cassette in pVAX1-AFP negative control vector was cloned directly using KpnI and BamHI. To generate the trans-splicing constructs targeting HPV16 genes, the BD from the parental vector p3ER ΔBD-opt was digested with Bam HI and XhoI and replaced with BDs E1a and E5 to generate p3ER ΔBD-opt_E1a and p3ER_ΔBD-opt_E5 respectively. Similarly BDs E2 and E6 were cloned into p3ER ΔBD-opt by replacing AFP BD using XhoI and XbaI to generate p3ER_ΔBD-opt_E2 and p3ER_ΔBD-opt_E6. For 5′ER, the parental vector p5ER_ΔBD-opt_HH containing AFP BD was replaced with HPV16 BD E1b with enzymes HindIII and BamHI to generate p5ER_ΔBD-opt_E1b_HH and p5ER_ΔBD-opt_HH(−) vector's AFP BD was replaced with E6 BD to form p5ER_ΔBD-opt_E6_HH(−) with HindIII and BamHI.
(22) Dumbbell (db) construction: Generating dumbbells for trans-splicing from the plasmid vectors was done using the ELAN method of db production. The Enzymatic Ligation Assisted by Nucleases (ELAN) is a three step process which includes digestion of the transcription cassette from the plasmid, ligation of the closing loops on either side followed by exonuclease treatment to eliminate the unclosed db plasmids.
(23) (a) Phosphorylation of Stem-Loop Primers
(24) The stem loops consisting of individual RE site were synthesised by AIT Biotech (Singapore) and was phosphorylated using the following reaction shown in Table 1:
(25) TABLE-US-00001 TABLE 1 Reaction setup for stem-loop phosphorylation using polynucleotide kinase (PNK) COMPONENTS STEM LOOP PRIMERS Stem loop oligo 10 μM 60 pmoles 10X Buffer A 2 μL PNK enzyme 1-2 U 10 mM ATP 2 μL Water nuclease-free Make up volume TOTAL 20 μL
(26) The Stem-loop primers were Stem loop-SpeI and Stem-loop-BamHI.
(27) (b) ELAN Method
(28) In the ELAN loop-ligation method, the gene expression cassette was directly cut out from parental plasmid. 50 times more stem-loops were added in the ligation reaction to ensure that most of the gene expressing cassettes could be capped. By-products such as loop dimers were cleaved by the restriction enzymes and were destroyed during the exonuclease treatment. Detailed setups of the reaction are shown in Table 2.
(29) TABLE-US-00002 TABLE 2 Reaction setup for the generation of trans-splicing dumbbells using the ELAN loop-ligation strategy COMPONENTS AMOUNT CONDITIONS Digestion Parental plasmid 6 pmoles 37° C. incubation for 4 hours SpeI RE 5 U and heat inactivation at 65° C. BamHI RE 5 U for 15 minutes HindIII RE 5 U 10x Fast digest buffer 5 μL Water nuclease-free Make up volume TOTAL 50 μL ELAN reaction Digestion mix 50 μL 22° C. for 4 hours to overnight Loop-1 60 pmoles and heat inactivation at 85° C. Loop-2 60 pmoles for 5 min 10X Fast digest buffer 10 μL 100 mM ATP 1.5 μL SpeI RE 1 U BamHI RE 1 U BgIII RE 1 U HindIII RE 1 U XbaI RE 1 U T4 DNA ligase 3 U Water nuclease-free Make up volume TOTAL 15 μL Exonuclease treatment ELAN mix 148 μL 37° C. incubation for 2 hours T7 DNA polymerase 10 U and heat inactivation at 85° C. for 5 min
(30) Cell Culture: Human hepatocytes (HepG2), human cervical cancer cell lines (Siha, HeLa) and mouse cervical cancer cell line (C3) were maintained at 37° C. in a humidified incubator with 5% CO2 in Dubecco's Modified Eagle's Medium (HyClone, Thermo Scientific), supplemented with 10% Fetal Bovine Serum (HyClone) and 1% penicillin-streptomycin. The cells were passaged every 3-4 days at desired density.
(31) Transfection of plasmid DNAs: HepG2 cells were transfected at ˜70-90% confluency in a 6-well plate for Western blotting, 12-well for FACS analyses and 24-well plate for all other analyses using either Lipofectamine 3000 (for FACS studies only) or Lipofectamine 2000, (Life Technologies) according to manufacturer's protocol. A total of 1 μg DNA was co-transfected or transfected in a 24-well plate format (500 ng:500 ng of ts construct: AFP_E3_E6 minigene) in over-expression studies and 1 μg of ts constructs only in endogenous studies. For 12-well and 6-well formats, the amount of total DNA was scaled up to 2 μg and 4 μg respectively. For FACS experiments, either 500 ng of pEGFP-C2 plasmid was co-transfected along with the ts and AFP mini-gene vectors to perform experiments with some 3′ER and 5′ER plasmids and dumbbells, or the GFP infused ts vectors were used along with/without AFP mini-gene for over-expression and endogenous studies respectively.
(32) Total RNA Isolation: RNA was isolated 24 hours post-transfection using RNeasy plus kit (Qiagen) following the manufacturer's protocol. RNA concentrations were measured using NanoDrop 2000.
(33) cDNA conversion and real-time RT-PCR: 500 ng RNA from all samples was converted into cDNA using the First Strand SuperScript RTIII (Invitrogen) kit with 200 ng of random hexamers and 10 uM of dNTPs. The reaction conditions were 25° C. for 5 min, followed by 50° C. for 2 h and enzyme inactivation at 70° C. for 15 min. 20 ng of cDNA was used as template for real time RT-PCR. TaqMan quantification was performed in ABI 7900HT of the cDNAs by designing specific probe and primer sets for each cis- and trans-splicing detection. One set of probes were designed in the AFP regions; namely AFP probe exon 5 and AFP probe exon 4 to detect 3′ER and 5′ER cis and trans-splicing respectively. Primers to detect 3′ER cis-splicing along with AFP probe were FP afp exon 5 (set1) and RP afp exon 6 and for 3′ER trans-splicing were FP afp exon 5 (set1) and RP HSVtk. Primers to detect 5′ER cis-splicing along with AFP probe were FP afp exon 3 and RP afp exon 4 (set1) and for 5′ER trans-splicing were FP HSVtk (set1) and RP afp exon 4 (set1). Another set of probes were designed in the distal HSVtk region to detect 3′ER, named HSVtk probe for 3′ER and proximal HSVtk region to detect 5′ER, named HSVtk probe for 5′ER trans-splicing alone. The primers to detect 3′ER trans-splicing along with HSVtk probes were FP afp exon 5 (set2) and RP HSVtk. Primers to detect 5′ER trans-splicing along with HSVtk probes were FP HSVtk (set2) and RP afp exon 4 (set2). The number of cycles in over-expression studies and endogenous studies were 40 and 50 respectively. RT-PCR: Reverse transcription PCR was performed on the cDNA samples using Taq DNA polymerase (Fermentas) with 60 cycles of two-step PCR (30+30 cycles or 35+35 cycles) to detect 3′ and 5′ cis and trans-splicing and the bands were visualized on a 1% agarose gel. The primers used to detect 3′ER cis-splicing and mock were FP afp exon 5 (set1) and RP afp exon 6, to detect 3′ER trans-splicing were FP afp exon 5 (set1) and RP HSVtk. To detect 5′ER cis-splicing and mock were primers FP afp exon 3 and RP afp exon 4 (set1), to detect 5′ER trans-splicing were primers FP HSVtk (set1) and RP afp exon 4 (set1). To visualize specific versus alternative on-target trans-splicing, the p3ER_ΔBD and p3ER_BD(−) cDNAs were amplified using primers FP afp exon 5 (set2) and RP HSVtk for specific ts and with primers FP afp exon 3 and RP HSVtk for alternative on-target ts. The p5ER_ΔBD_HH and p5ER_BD(−)_HH cDNAs were amplified with primers FP HSVtk (set2) and RP afp exon 4 (set2) for specific ts and with primers FP HSVtk (set2) and RP afp exon 6 for alternative on-target ts.
(34) Alamar assay: To check the functional activity of trans-splicing, drug Ganciclovir (GCV) (Sigma) was added to the cells at a concentration of 10 μM, 100 μM and no GCV (internal negative control) 24 hours post-transfection followed by addition of AlamarBlue® cell viability reagent (Thermo Scientific) 24 hours post drug for a duration of 6 days with replacement of fresh media and drug every day after each alamar reading. The fluorescence was measured at 230/290 nm after 90 minutes of incubation at 37° C. The positive and negative controls for the assay were designed as mentioned in the manufacturer's protocol.
(35) Single cell gel electrophoresis: Also known as the Comet Assay, it was carried out to check for double-stranded DNA breaks upon 10 μM GCV administration 24 hours post-transfection and cells harvested 24 hours post-drug treatment using alkaline lysis method (Olive, et al., 2006). The comets were stained with Propidium Iodide 10 μg/ml (Life Technologies) and analysed under a fluorescent microscope at 10× and 20× magnifications. Approximately 150-200 comets were scored/sample.
(36) Flow Cytometry: To check for apoptosis, cells were harvested 48 hours post 100 μM GCV treatment and stained using Propidium Iodide and Alexa Fluor 647 Annexin V (Life Technologies) in Annexin-binding buffer according to manufacturer's protocol. The samples were gated based on single live cell populations which were positive for GFP. The final % apoptosis values are indicated as (early and late apoptosis+GCV)−(early and late apoptosis-GCV).
(37) Western Blotting: To detect both AFP and HSVtk proteins, cells were harvested 24 hours post-transfection and a total of 50 μg protein was loaded in a 10% SDS-PAGE, transferred on a PVDF membrane and blocked with 5% milk (w/v). The respective primary antibodies (HSV-tk vL-20 goat polyclonal Santa Cruz cat no. sc-28038, AFP goat polyclonal Santa Cruz cat no. sc-8108 and Beta actin (internal control) rabbit polyclonal Santa Cruz cat no.sc-130656) were incubated in 5% milk overnight at 4° C., followed by 2 hour incubation with secondary antibodies (HSVtk and AFP anti goat Santa Cruz cat no. sc-2020 and Beta actin anti-rabbit Santa Cruz cat no. sc-2357. The blots underwent chemiluminescence using Pierce ECL Western Blotting substrate (Thermo Scientific) and developed in an imager (BioRad).
(38) Prediction of splice sites: The strength and nature of the splice sites were predicted using softwares Alternative Splice Site Predictor (ASSP) (Wang, 2006) http://wangcomputing.com/assp/overview.html and Berkeley Drosophila Genome Project Splice Site Prediction (BDGP SSP) (Reese, et al., 1997) fruitfly.org/seqtools/splice.html using default cut-off values for the splice site predictions. To predict the nature of splice sites in HPV16 genome, ASSP was used to document constitutive or cryptic splice acceptors and donors based on the overall score and confidence generated by the software. The alternative splice sites with confidence >0.89 and score >5.5 and constitutive splice sites with confidence >0.1 and score >7.7 besides the documented splice sites (Johansson, 2013 and Schmitt, et al, 2011) were selected for the trans-splicing analyses.
(39) Results and Discussion
(40) De Novo Designed Trans-Splicing RNA for 5′ and 3′ Exon Replacement (ER) Triggers Targeted Trans-Splicing Towards an Overexpressed or Endogenous Pre-mRNA Target
(41) This invention refers to novel optimized RNA sequences and structures designed to achieve higher trans-splicing activity and specificity. We designed parental trans-splicing RNA (tsRNA) molecules both for 5′ER or 3′ER comprising the following molecular features (
(42) In addition, we furnished the tsRNA for 5′ER with a tertiary structure-stabilised hammerhead ribozyme (HHRz) (Saksmerprome, et al. 2004) positioned downstream of the BD to crop itself together with the SV40 polyA site in order to trigger nuclear RNA retention and to avoid trans-splicing-independent HSVtk expression. Formation of the active ribozyme structure was supported by inserting a spacer between the ribozyme and the polyA site. The tsRNA for 3′ER, on the other hand, was equipped with the P2A proteolytic cleavage site (Kim J H, et al. 2011) positioned immediately downstream of the splice acceptor SA site to trigger proteolytic release of the HSVtk from the chimeric fusion protein which results from the trans-splice reaction. The constructs were designed for maximum activity.
(43) Central 2 nt target mismatches were included into the binding domains (ΔBD) to avoid that target binding generates long double-stranded nuclear RNA which might trigger antisense effects, including A-to-I editing by adenosine deaminases acting on RNA (ADARs), which could impair the trans-splice strategy. That way optimised BDs for 3′ or 5′ER were termed p3ER_ΔBD-opt or p5ER_ΔBD-opt_HH. As controls we designed for 5′ and 3′ER each two splice site mutants (Δss1 and Δss2), the partly inactive HSVtk mutant (A to G mutation at position 115 and a G to A mutation at position 649 of HSVtk gene), and for 5′ER a control harbouring an inactive HHRz cleavage site (
(44) Design of Binding Domain Sequence and Structure Substantially Improves 3′ but not 5′ Exon Replacement
(45) We investigated the role of BD RNA secondary in RNA trans-splicing. Using our previously described software tool foldanalyse (Senger, et al. 1995) we identified the least structured BDs of about 50 nt in length that can be targeted against the AFP pre-mRNA: 3ER_BD-opt or 5ER_BD-opt for 3′ or 5′ER which bind to intron 5 or intron 3, respectively (
(46)
(47) Multiple Binding Domains Enhance Targeted Trans-Splicing and Suppress Alternative On-Target Trans-Splicing Thereby Increasing the Specificity of 3′ER
(48) To suppress alternative on-target trans-splicing and to improve the trans-splicing specificity we designed novel RNAs for 3′ER which harbor multiple target and/or self-binding domains (
S=TSspec/TSalt*100% (1)
(49) All secondary BDs increased the specificity of trans-splicing as reflected by an increase in the specificity factor. While BD-F and more pronounced BD-D suppressed both specific and alternative on-target trans-splicing, BD-E and the combination of BD-E and -F enhanced specific but suppressed alternative on-target trans-splicing. Most successful, however, was the combination of all three additional BDs D, E, and F which enhanced specific trans-splicing about 2-fold and reduced alternative splicing about 5-fold thus exhibiting a 10-fold higher specificity of trans-splicing as compared with the parental construct. In the construct without any target binding domain, internal BD-D suppressed trans-splicing towards the strong splice donor D1 or the moderately strong donor D2 20-fold or 130-fold presuming trans-splicing to any other cellular off-targets was suppressed to a similar extent. Notably, a comparable reduction of trans-splicing was observed in BD(−) constructs harbouring splice sites that were weakened by mutagenesis.
(50) Trans-Splicing Towards Over-Expressed or Endogenous AFP Pre-mRNA Triggers Death in a Human Liver Carcinoma Cell Line
(51) As suicide gene system we chose the combination of the herpes simplex virus thymidine kinase (HSVtk) and the prodrug ganciclovir (GCV). Trans-splicing will trigger HSVtk expression in a target RNA-dependent manner which then catalyses phosphorylation of GCV into its monophosphate, which is subsequently converted into its di- and tri-phosphate derivatives by cellular kinases. The toxic GCV-triphosphate then acts as a deoxyguanosine triphosphate (dGTP) analogue, thus sitting on the DNA chain during replication, causing chain termination and cell death. We investigated death of human liver carcinoma cells HepG2 triggered by 5′ or 3′ER towards the overexpressed or the endogenous AFP pre-mRNA using three different assays. Firstly, the alamar blue cell viability assay. Cells were transfected and 10 or 100 μM GCV was added to the medium 24 hours post-transfection, alamar dye was added 24 hours post drug treatment and fluorescence was measured after an incubation time of 90 min. After alamar reading, the media and drug was replenished and the process repeated for 6 consecutive days. Highest levels of cell death, i.e. up to 80% at 100 μM GCV (
(52) Trans-Splicing RNA Simultaneously Targeting Two Endogenous Liver Cancer Markers Triggered Enhanced Cell Death at 10-Fold Lower Ganciclovir Doses
(53) As many other human diseases, the carcinogenesis of hepatocellular carcinoma (HCC) is a multi-factorial, multistep, complex process and a single biomarker is not accurately indicating the disease and its stages. To increase both HCC specificity and sensitivity of our approach, we investigated bispecific (dual targeting) tsRNAs targeting two HCC biomarkers simultaneously using distinct BDs. Multiple HCC biomarkers have been reported in the literature and we measured the abundance of 12 corresponding pre-mRNAs and mRNAs in 10 different cell lines or cells (Supplementary
(54) HPV-16-Targeting Suicide RNA Specifically Killed HPV-16-Transformed Tissue Culture Cells
(55) The universal design of our tsRNA facilitates replacement of the BD together with the spacer in order to target any pre-mRNA of interest. As a second clinically relevant target we chose the human papillomavirus type 16 (HPV-16). HPVs establish productive infections in keratinocytes of the skin and the mucous membranes causing benign papilloma's, premalignant lesions, and cancer. HPV infection is the most frequent sexually transmitted disease worldwide and the two high risk types HPV-16 and HPV-18 cause about 70% of all cervical cancer cases. Prior to cell transformation, HPV-16 genomes integrate into the host cell genome and there is no way to erase the viral DNA from an infected individual. However, selective destruction of the infected cells by suicide gene therapy may represent an approach to solving this problem. In HPV infections, alternative splicing generates multiple isoforms of viral mRNA. We computationally selected the five most favourable unstructured antisense BDs (opt_E6, _E1a, _E1b, _E2 and _E5) which can be directed against HPV-16 transcripts targeting the early viral genes E6, E1, E2, and E5 (
(56) Minimalistic Dumbbell-Shaped DNA Vectors for Cellular Delivery of Trans-Splicing Molecules
(57) Dumbbell-shaped DNA minimal vectors, or dumbbell vector, have several advantages over traditional plasmid vectors or viral vectors. While many viral vectors are fraught with safety risks, plasmids being much larger can trigger side effects including immunotoxicity and suffer from transgene silencing in primary cells. Dumbbells do not have these disadvantages and compared with plasmids are more efficient in terms of cellular delivery, nuclear diffusion, and gene expression. First-time we generated dumbbell vectors to deliver RNA trans-splicing. That was achieved by cutting the trans-splicing cassette out from the plasmid and subsequently closing the ends by ligation of hairpin loop structures (
LIST OF OLIGONUCLEOTIDES, PROBES AND PRIMERS
(58) TABLE-US-00003 List of oligonucleotides, probes and primers No. Oligonucleotide names Sequence (5′-3′) Modifications For general cloning and sequencing SEQ ID NO: 1 FP-pVAX1 (set 1) GACTCTTCGCGATGTACGGG SEQ ID NO: 2 RP-pVAX1 (set 1) GTGTTGATGGCAGGGGTACG SEQ ID NO: 3 FP-pVAX1 (set 2) TTACCCTGTTTCGGGCCC SEQ ID NO: 4 RP-pVAX1 (set 2) GCTTCCCAACCTTACCAGAGG SEQ ID NO: 5 FP-pVAX1 (set 3) CGGTTTGACTCACGGGGAT SEQ ID NO: 6 RP-pVAX1 (set 3) GGCAAACAACAGATGGCTGG SEQ ID NO: 7 FP-pVAX1 (set 5) CTTTTCGGGGAAATGTGCG SEQ ID NO: 8 FP-pVAX1 (3′ and 5′ par) TGAGAAAGCGCCACGCTTC SEQ ID NO: 9 FP-pSUPER (set 1) GTGAGCGCGCGTAATACGA SEQ ID NO: 10 RP-pSUPER (set 1) AGGCACCCCAGGCTTTACAC SEQ ID NO: 11 FP-pEGFP.C2 (set1) CAAGGTGAACTTCAAGATCCGC SEQ ID NO: 12 RP-pEGFP.C2 (set1) ATTGCATTCATTTTATGTTTCAGGTTC SEQ ID NO: 13 FP 3′BAD GACGCAAATGGGCGGTAG SEQ ID NO: 14 FP-5′par (2) GTGAACCGTCAGATCGCTAGC SEQ ID NO: 15 FP-5′par (3) CACAGGGTGAGATATCGGCC SEQ ID NO: 16 FP-5′par (4) AGGCTCCATACCGACGATCT SEQ ID NO: 17 RP-5′par (2) CGGGCGATTGGTCGTAATCC SEQ ID NO: 18 RP-5′par (3) TGTCTCGACAAGCCCAGTT SEQ ID NO: 19 RP-5′par (4) CAATGGGGCGGAGTTG SEQ ID NO: 20 RP-5′par (5) CCATTGACGTCAATGGGGTGG SEQ ID NO: 21 FP-BssHII (check par change) ACCTGGCGCGCACGT SEQ ID NO: 22 RP-BbsI (check par change) CCAGCATGCCTGCTATTGTCT SEQ ID NO: 23 FP-3′ss mut change (NheI) TCAGATCGCTAGCACCTCTCTAAA SEQ ID NO: 24 RP-3′ss mut change (PvuI) AAAAAACGATCGGCAGACAACCCAGGCCCAG GCC SEQ ID NO: 25 FP-5′ss mut change (BssHII) CCTGGCGCGCACGTTTGCGCGGGAGATGGGG GAGGCGAACTGATACACG SEQ ID NO: 26 FP-5′ss mut change GGAGGCGAACTGATACACGGCATAGCTTTCAT (mismatch) TTTTGTCTTTTTTTAAC SEQ ID NO: 27 RP-5′ss mut change (KpnI) GAAGGAGGGGTACCCCC SEQ ID NO: 28 FP-HSVtk (HindIII) CTAGACATAAGCTTATGGCTTCGTACCCCTGC SEQ ID NO: 29 RP-HSVtk (KpnI) ACTGATCTGGTACCTCAGTTCGCCTCCCCC SEQ ID NO: 30 FP-HHRz change (mutation) CTCATACCACATTTCAGACATAAATACATCCCC ACAGCCTAAGGC SEQ ID NO: 31 FP-HHRz change (middle 2) ATACATATACACACACACACACACTCATACCAC ATTTCAGACATAAATG SEQ ID NO: 32 FP-HHRz change (middle 1) CCTCCTCCTCCTCCTCCTATATACATATACACA CACACACACACTCA SEQ ID NO: 33 FP-HHRz change (KpnI) TGGGGGTACCCCTCCTTCCTCCTCCTCCTCCT CCTCCTCCTATATA SEQ ID NO: 34 RP-HHRz change (BbvCI) TAAACAAGTTGCTGAGGCTATCTC SEQ ID NO: 35 3′NO BD oligo 1 CTAGCGGATCCAAGCTTCC 5′PO.sub.4 SEQ ID NO: 36 3′BO BD oligo 2 TGAGGAAGCTTGGATCCG 5′PO.sub.4 SEQ ID NO: 37 FP-3′block ppy (NheI) CGCAGACGCGTGTTGATG SEQ ID NO: 38 RP-3′block ppy (MluI) TCAGATCGCTAGCCTGGAAAAAATCGGAAAAA AAACCTCTCTAAACCTAAATTAAATTTTCTTCTC For real time RT-PCR to detect splicing SEQ ID NO: 39 HSV-tk uni amplicon CATGCCTTATGCCGTGACCGACG 5′FAM 3′BHQ SEQ ID NO: 40 AFP probe exon 4 CTTGCACACAAAAAGCCCACTCCAGC 5′FAM 3′BHQ SEQ ID NO: 41 AFP probe exon 5 CCTACAATTCTTCTTTGGGCTGCTCGCT 5′FAM 3′BHQ SEQ ID NO: 42 HSV-tk probe (for 3ER) CCCCTGCCATCAACACGCGTC 5′FAM 3′BHQ SEQ ID NO: 43 HSV-tk probe (for 5ER) CGACCTGGCGCGCACGTTT 5′FAM 3′BHQ SEQ ID NO: 44 FP-HSV-mini intron uni amp GACATCCACTTTGCCTTTCTCTC SEQ ID NO: 45 FP-HSV-uni amp AAGCGCCCAGATAACAATGG SEQ ID NO: 46 FP-HSV tk half half uni amp GCCTCGACCAGGGTGAGAT SEQ ID NO: 47 RP-HSV-uni amplicon CTCATATCGGGGGGGAGG SEQ ID NO: 48 RP-afp exon 3 TTATTCACTCACCTGGTTTTCTAAACA SEQ ID NO: 49 FP-afp intron 5 GTAAATTTGTCTGGCACAGATGCA SEQ ID NO: 50 FP afp uni amp exon 4 AAGTGAAGAGGGAAGACATAACTGTTT SEQ ID NO: 51 RP afp exon 4 (set 1) GAACTTGGAAAAGTGGGATCGA SEQ ID NO: 52 RP afp uni amp exon 5 CATAGCGAGCAGCCCAAAG SEQ ID NO: 53 FP-HSV-tk (set 1) GATGGGGGAGGCGAACTG SEQ ID NO: 54 FP-afp exon 3 AGTCTTCAGGGTGTTTAGAAAACCA SEQ ID NO: 55 FP-afp exon 5 (set 1) AAGGCATCCCTTCCTGTATGC SEQ ID NO: 56 RP-afp exon 6 AAGCTGCTTTCTCTTAATTCTTTTGTAAC SEQ ID NO: 57 FP-HSV-tk (set 2) CAGGCTCCATACCGACGATC SEQ ID NO: 58 RP-afp exon 4 (set 2) TTCTCATGGCAAAGTTCTTCCA SEQ ID NO: 59 FP-afp exon 5 (set 2) CATCTTGCTGCAAAGCTGAAAA SEQ ID NO: 60 RP-HSV-tk TTGCTGTGTCCCCGTGATC SEQ ID NO: 61 FP-HSVtk (for 3ER) CGTTTAAACTTAAGCTTATGGCTTCG SEQ ID NO: 62 RP-HSVtk (for 5ER) CAGTTCGCCTCCCCCATC SEQ ID NO: 63 Stem loop uni probe TCGCACTGGATACG 5′FAM 3′BHQ SEQ ID NO: 64 FP-5′HH cleavage ACACACACACACACTCATACCACATT SEQ ID NO: 65 Stem Loop 5′par GTCGTATCCAGTGCAGGGTCCGAGGTATTCGC ACTGGATACGACACATTT SEQ ID NO: 66 Stem Loop 5′HH mut GTCGTATCCAGTGCAGGGTCCGAGGTATTCGC ACTGGATACGACGGTATT SEQ ID NO: 67 RP-stem loop uni CAGTGCAGGGTCCGAGGT SEQ ID NO: 68 HCCA2-FP-exon 2 CCCTCTTCCTCACTTCCCTGTACT SEQ ID NO: 69 HCCA2-RP-exon 3 CATCAGCTCATTTGCTTCCTTTT SEQ ID NO: 70 CD24-FP-exon 2 GACATGGGCAGAGCAATGGT SEQ ID NO: 71 CD24-RP-exon 3 ACTCTGGGAGGAGTTACTTGAAGTTC SEQ ID NO: 72 VEGF-FP-exon 2 AGAAGGAGGAGGGCAGAATCA SEQ ID NO: 73 VEGF-RP-exon 3 AGCTGCGCTGATAGACATCCA For real time RT-PCR to detect HCC markers SEQ ID NO: 74 FP-AFP TTTGGGCTGCTCGCTATGA SEQ ID NO: 75 RP-AFP (exon) TTGTTTGGAAGCATTCAACTGC SEQ ID NO: 76 RP-AFP (intron) GACTACAGTACTGGTTCAGATATCCACG SEQ ID NO: 77 FP-CD24 GCATCCTGCTAGACGCGC SEQ ID NO: 78 RP-CD24 (exon) TGCGTGGGTAGGAGCAGTG SEQ ID NO: 79 RP-CD24 (intron) GCGCCAGGGCCTCAC SEQ ID NO: 80 FP-HCCA2 CTAGGAAAATGCTGTGAAGAGATCC SEQ ID NO: 81 RP-HCCA2 (exon) CCAGAATGGGAGCCGGT SEQ ID NO: 82 RP HCCA2 (intron) TTCTGCCTCCTACAGACTTCGAG SEQ ID NO: 83 FP-HSP70 TACGTGGCCTTCACGGACA SEQ ID NO: 84 RP-HSP70 AACTTGCGGCCAATCAGC SEQ ID NO: 85 FP-AFU TCCTGTGTCTTGGAACTGGAACT SEQ ID NO: 86 RP-AFU (exon) CTTCCGGAGAGCTGTTCCC SEQ ID NO: 87 RP-AFU (intron) CCAGATCCAAAGAGATAACAGAGTAACC SEQ ID NO: 88 FP-TGFB1 CAAGGACCTCGGCTGGAAG SEQ ID NO: 89 RP-TGFB1 (exon) TGCTGTACTGCGTGTCCAGG SEQ ID NO: 90 RP-TGFB1 (intron) GCATCTCGTAGCCCGGTG SEQ ID NO: 91 FP-VEGF AGTACATCTTCAAGCCATCCTGTGT SEQ ID NO: 92 RP-VEGF (exon) GGTGATGTTGGACTCCTCAGTG SEQ ID NO: 93 RP-VEGF (intron) TTGCCCCACTTCCCAAAG SEQ ID NO: 94 FP-GPC3 CGGACGCCACCTGTCAC SEQ ID NO: 95 RP-GPC3 (exon) TGGCACGGGAGTTTCTGG SEQ ID NO: 96 RP-GPC3 (intron) TAGCGCGCTCAGGGTACAG SEQ ID NO: 97 FP-IGFII GGAAGCTGTTGATACCAAAAATAATG SEQ ID NO: 98 RP-IGFII (exon) GATAAGTGCGTGTCTTCAAGTCGT SEQ ID NO: 99 RP-IGFII (intron) TTGCCAATCCAGTAATTTCAGGTAG SEQ ID NO: 100 FP-GGT GGTCCTGGTGCTGGTCATTG SEQ ID NO: 101 RP-GGT (exon) TCCCAATCTTCGAGCACTGC SEQ ID NO: 102 RP-GGT (intron) ATGTCCCATGCCCTGCC SEQ ID NO: 103 FP-GP73 CAGGCTGCCCTGTCAGTGA SEQ ID NO: 104 RP-GP73 (exon) CCGGCAGCTTCCTGCTC SEQ ID NO: 105 RP-GP73 (intron) GTCTGGTGGAAGGGAGTCCA SEQ ID NO: 106 FP-HGP TTGATAAAGCAAGAAAACAATGCC SEQ ID NO: 107 RP-HGP (exon) TTTGTTTTCATAGAGGTCAAATTCATG SEQ ID NO: 108 RP-HGP (intron) TTATGCAATATTTAGGGAGAAGTCAGTTAC Oligos synthesized for generating trans-splicing constructs SEQ ID NO: 109 3ER_ΔBD-opt GCTAGCACCTCTCTAAACCTAAATTAAATTTTC TTCTCTAAACCTCTTTCCTAGTTggactgacgtcactc agtcctcctCCTCAGC SEQ ID NO: 110 3ER_BD-opt GCTAGCACCTCTCTAAACCTAAAAAAAATTTTC TTCTCTAAACCTCTTTCCTAGTTggactgacgtcactc agtcctcctCCTCAGC SEQ ID NO: 111 3ER_ΔBD-struc1 GCTAGCGGACTGCTTGAAGCAAGTAGTTAATG GTGGATATTATTCTACCggactgacgtcactcagtcctcct CCTCAGC SEQ ID NO: 112 3ER_ΔBD-struc2 GCTAGCCTTCTCAGTTACAAAAAATACGATGTT ATCACGGCTTGACACTGAGAGTAggactgacgtcac tcagtcctcctCCTCAGC SEQ ID NO: 113 3ER_ΔBD-opt-inv GCTAGCACCTCTCTAAACCTAAATTAAATTTTC TTCTCTAAACCTCTTTCCTAGTTaactaggaaagag gttagagagaaatttaattaggttgagaggtCCTCAGC SEQ ID NO: 114 3ER_BD-struc1 GCTAGCGGACTGCTGGTAGCAAGTAGTTAATG GTGGATATTATTCTACCggactgacgtcactcagtcctcct CCTCAGC SEQ ID NO: 115 3ER_BD-struc2 GCTAGCCTTCTCAGTTACAAAAAATACGATTGT ATCACGGCTTGACACTGAGAGTAggactgacgtcac tcagtcctcctCCTCAGC SEQ ID NO: 116 3ER_BD-opt-inv GCTAGCACCTCTCTAAACCTAAAAAAAATTTTC TTCTCTAAACCTCTTTCCTAGTTaactaggaaagag gttagagagaaatttatttaggttgagaggtCCTCAGC SEQ ID NO: 117 5ER_ΔBD-opt_HH GGTACCcctccttcctcctcctcctcctcctcctcctATATACAT ATACACACACACACACACTCATACCACATTTCA GACATAAAT SEQ ID NO: 118 5ER_BD-opt_HH GGTACCcctccttcctcctcctcctcctcctcctcctATATACAT ATACACACACACACACAGACATACCACATTTCA GACATAAAT SEQ ID NO: 119 5ER_ΔBD-struc1_HH GGTACCcctccttcctcctcctcctcctcctcctcctGATGTAA CTTTTGGAATTGCAAATACATATGTATATATTAT GTAATATACATACATATATACATA SEQ ID NO: 120 5ER_ΔBD-opt-inv GGTACCctgaaatgtgtagattgtgtgtgtggtgtatatgtaatAT ATACATATACACACACACACACACTCATACCAC ATTTCAGACATAAAT SEQ ID NO: 121 5ER_BD-struc1_HH GGTACCcctccttcctcctcctcctcctcctcctcctGATGTAA CTTTTGGAATTGCAAATACATATGTATATATTAT GGTATATACATACATATATACATA SEQ ID NO: 122 5ER_BD-opt-inv GGTACCctgaaatgtgtagattgtgtgtgtggtgtatatgtaatAT ATACATATACACACACACACACAGACATACCAC ATTTCAGACATAAAT SEQ ID NO: 123 5ER_ΔBD-opt_hp_HH GGTACCcctccttcctcctcctcctcctcctcctcctATATACAT ATACACACACACACACACTCATACCACATTTCA TGAAATGTGGTGACATAAAT SEQ ID NO: 124 5ER_ΔBD-opt_Y_HH GGTACCcctccttcctcctcctcctcctcctcctcctATATACAT ATACACACACACACACACTCATACCACATTTCA GACATAAATGCGTAGATAATAACCAGTGAACTT AGCGTGCATGAAAACATGCACGCTGAATCGTA GAAAAAAATCTACGATTCAAGTTCACTGGAATT TATCTACGCGACATAAAT SEQ ID NO: 125 3ER_ΔBD-opt_HCCA2 AACAGTTCCTTTATACAAATTCACCAGATGATA CAAAACATATTTTTTTA SEQ ID NO: 126 3ER_ΔBD-opt_CD24 GATCTAAAGATCCTCTACACCAACAAAATTATT ATGTTTATTAAACTGAC SEQ ID NO: 127 3ER_ΔBD-opt_VEGF TTTCTTTCTTGATCCTTATATTCCTGTGCCCCTT TTCCTTCCTCCCCACA Underlined = Restriction enzymes Lowercase = spacer/linker sequences