LEAPER TECHNOLOGY BASED METHOD FOR TREATING MPS IH AND COMPOSITION
20230060518 · 2023-03-02
Inventors
- Pengfei YUAN (Beijing, CN)
- Yanxia Zhao (Beijing, CN)
- Nengyin Liu (Beijing, CN)
- Zexuan Yi (Beijing, CN)
- Gangbin Tang (Beijing, CN)
Cpc classification
C12N9/22
CHEMISTRY; METALLURGY
C12N2740/16043
CHEMISTRY; METALLURGY
C12N15/87
CHEMISTRY; METALLURGY
C12N15/113
CHEMISTRY; METALLURGY
C12N15/90
CHEMISTRY; METALLURGY
C12N9/78
CHEMISTRY; METALLURGY
A61K31/7105
HUMAN NECESSITIES
C12N9/2402
CHEMISTRY; METALLURGY
International classification
C12N15/10
CHEMISTRY; METALLURGY
C12N15/87
CHEMISTRY; METALLURGY
C12N15/113
CHEMISTRY; METALLURGY
C12N15/86
CHEMISTRY; METALLURGY
Abstract
The present application relates to an LEAPER-based method for targeted editing of RNA, which comprises: safely and effectively performing adenosine-to-inosine RNA editing (A-to-I RNA editing) by LEAPER in vivo, thereby the pathogenic mutation sites can be accurately repaired, and all diseases caused by G>A mutations, such as MPS IH, can be treated.
Claims
1. A LEAPER-based method for target editing of a target RNA in a target cell, wherein the target RNA is an RNA comprising a G-to-A mutation in an IDUA gene transcript, and the method comprises: delivering a construct comprising an adenosine deaminase-recruiting RNA (arRNA) for editing the target RNA or a construct for encoding the arRNA to the target cell, wherein the arRNA comprises a complementary RNA sequence that hybridizes with the target RNA, and is capable of recruiting adenosine deaminase acting on RNA (ADAR) to deaminate target adenosine (A) in the target RNA.
2. The method according to claim 1, wherein the arRNA comprises base C, A, U or G that is paired with the target A.
3. The method according to claim 1, wherein the length of the arRNA is about 151-61 nt.
4. The method according to claim 3, wherein a distance between a targeting base in the arRNA and the 3′-terminal is 45-5 nt.
5. The method according to claim 3, wherein the distance between the targeting base and the 5′-terminal in the arRNA is 80-30 nt.
6. The method according to claim 1, wherein the target cell is a human cell.
7. The method according to claim 1, wherein the target RNA is a RNA comprising the NM_000203.4(IDUA)-c.1205G-A (p.Trp402Ter) mutation site.
8. The method according to claim 1, wherein the arRNA comprises any one of the following sequences: SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 9, SEQ ID NO: 13, SEQ ID NO: 17, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 34, SEQ ID NO: 44 and SEQ ID NO: 52.
9. (canceled)
10. The method according to claim 1, wherein the arRNA is chemically modified.
11. The method according to claim 10, wherein the chemical modification comprises 2-O′-methylation (2′-OMe) or thiophosphate modification.
12. The method according to claim 11, wherein the chemical modification is one or more selected from the group consisting of: 2′-OMe modification on each of the first three nucleotides and the last three nucleotides in the arRNA sequence; thiophosphate bond linkage between nucleotides of the first three nucleotides and the last three nucleotides; 2′-OMe modification on all bases U in the arRNA sequence; 2′-OMe modification on the base A which is the 3′ base nearest to the targeting base; 2′-OMe modification on the base C which is the 5′ base nearest to the targeting base; thiophosphate bond linkage between the targeting base with its 3′ nearest base and 5′ nearest base respectively; 2′-OMe modification on each of the first five nucleotides and the last five nucleotides; and thiophosphate bond linkage between nucleotides of the first five nucleotides and the last five nucleotides.
13. The method according to claim 1, wherein the construct for encoding the arRNA is a linear nucleic acid chain, virus vector or plasmid.
14. The method according to claim 13, wherein the virus vector is an adeno-associated virus (AAV) vector or lentivirus expression vector.
15. The method according to claim 1, wherein the delivery method is electrotransfection, lipofection, or lipid nanoparticle (LNP) delivery or infection.
16. (canceled)
17. The method according to claim 1, wherein the delivery concentration of the arRNA is equal to or greater than 2.5 nM, 5 nM, 10 nM, 15 nM or 20 nM.
18. An arRNA for LEAPER-based target editing of a target RNA in a target cell or a coding sequence of the arRNA, wherein the arRNA comprises or consists of any one of the following sequences: SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 9, SEQ ID NO: 13, SEQ ID NO: 17, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 34, SEQ ID NO: 44, and SEQ ID NO: 52.
19. A plasmid, virus vector, liposome, lipid nanoparticle, composition or biological product, comprising the arRNA or the coding sequence of the arRNA according to claim 18.
20. (canceled)
21. A method for treating MPS IH in an individual, comprising correcting a G-to-A mutation, associated with MPS IH, in a target cell of the individual by the method according to claim 1.
22. The method according to claim 21, wherein the mutation is NM_000203.4(IDUA)-c.1205G-A (p.Trp402Ter) mutation.
23. The method according to claim 21, wherein the arRNA is administrated at a frequency of ≥every 10 days.
Description
BRIEF DESCRIPTION OF THE ACCOMPANYING DRAWINGS
[0054]
[0055]
[0056]
[0057]
[0058]
[0059]
[0060]
[0061]
[0062]
[0063]
[0064]
[0065]
[0066]
[0067]
DETAILED DESCRIPTION OF THE INVENTION
Definition
[0068] RNA editing is a natural process occurring in eukaryotic cells. RNA editing involves the conversion of base A (adenosine) to base I (inosine) in RNA that occurs before protein translation and after DNA transcription, during translation, inosine is identified as G, and A-to-I RNA editing diversifies the transcriptome. The total number of RNAs will be increased several folds by accurately converting the RNA molecules. Such editing is catalyzed by adenosine deaminase acting on RNA (ADAR), so it is called site-directed RNA editing. The editing may occur in coding regions comprising intron and exon sequences, or may occur in non-coding regions, and the editing in coding regions can redefine protein-coding sequences.
[0069] As used herein, “LEAPER” is a technique using engineered RNA to recruit endogenous ADAR used for RNA editing, which has been disclosed in WO2020074001A1. As used herein, the engineered RNA refers to adenosine deaminase-recruiting RNA (arRNA), which can recruit ADAR or some compounds comprising an ADAR domain to deaminate target adenosine in target RNA.
[0070] As used herein, the term “adenosine deaminase acting on RNA (ADAR)” refers to a class of adenosine deaminase widely expressed in tissues of eukaryotes (including mammals such as humans), which can catalyze the conversion of adenosine (A) to inosine (I) in an RNA molecule. However, in the synthesis process of proteins in eukaryotes, I is usually translated into G.
[0071] As used herein, the “complementarity” of nucleic acids refers to the ability of a nucleic acid to form hydrogen bonds with another nucleic acid by the conventional Watson-Crick base pairing. The percentage of complementarity refers to the percentage of residues in a nucleic acid molecule that can form hydrogen bonds with residues in another nucleic acid molecule (i.e. Watson-Crick base pairing). For example, among 10 residues of a nucleic acid molecule, about 5, 6, 7, 8, 9 or 10 residues can form hydrogen bonds with residues in another nucleic acid molecule, which means the percentage of complementarity of the two nucleic acid molecules is about 50%, 60%, 70%, 80%, 90% or 100%. “Complete complementation” refers to that all continuous residues in a nucleotide sequence can form hydrogen bonds with the same number of continuous residues in another nucleotide sequence. As used herein, “substantially complementary” refers to that in about 40, 50, 60, 70, 80, 100, 150, 200, 250 or more nucleotide regions, the percentage of complementarity is at least about 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98% or 99%, or refers to that two nucleic acids hybridize under strict conditions. For a single base or single nucleotide, the pairing of A with T or U, and pairing of C with G or I in Watson-Crick base pairing principle can be called complementation or matching, and vice versa. The pairing of the bases otherwise should be called non-complementation or non-matching.
[0072] “Hybridization” refers to a process in which one or more polynucleotides react to form a compound, and the compound is stabilized by hydrogen bonds between the bases of the nucleotide residues. The hydrogen bonds can also be formed by Watson Crick base pairing, Hoogstein pairing or any other sequence-specific methods. A sequence that can hybridize with a given sequence is called a “complementary sequence” of the given sequence.
[0073] As used herein, the term “electrotransfection” refers to electroporation transfection, by which cells are exposed to an electric field for microseconds to milliseconds, apertures or openings are temporarily formed on the cell membrane, and macromolecules, such as DNA, are delivered to the cells.
[0074] As used here, the term “lipofection (Lipo)” refers to a transfection technique using a liposome as a delivery vector in vivo or in vitro. Liposomes include neutral liposomes and cationic liposomes. For a neutral liposome, a macromolecule, such as nucleic acid, is wrapped with the lipid membrane, and then delivered to the cell membrane via the lipid membrane, and for a cationic liposome that is positively charged, a macromolecule to be delivered is not embedded in the cationic liposome in advance, the macromolecule that is negatively charged automatically binds to the positively charged liposome to form a macromolecule-cationic liposome compound, which will be adsorbed to the surface of the cell membrane that is negatively charged, and delivered to a cell by endocytosis.
[0075] As used herein, the term “lipid nanoparticle (LNP) delivery” refers to transmembrane delivery of a macromolecule, such as nucleic acid and protein, via a lipid nanoparticle. Lipid nanoparticles refer to particles synthesized by mixing two phases, which comprise an ethanol phase containing ionizable lipids, auxiliary phospholipids, cholesterol, and PEG lipids, and an acidic aqueous phase containing macromolecules such as nucleic acids and proteins. For example, RNA wrapped with LNP can be delivered to the cytoplasm by endocytosis.
[0076] Herein, the NM_000203.4(IDUA)-c.1205G-A (p.Trp402Ter) mutation refers to a G-to-A mutation at the 1205 site on the No. 000203.4 transcript of the IDUA gene, which will convert a coding sequence of tryptophan (Trp) on the 402 site of a peptide chain translated by the transcript to a termination codon (Ter), whereby the finally translated peptide chain lacks all amino acids after the 402 site and loses the IDUA enzymatic activity. Such a mutation will lead to a deficiency of active α-L-iduronidase, which affects the degradation of glycosaminoglycans in the lysosomes of cells, and eventually causes teratogenicity and even death of patients. However, the technical solutions of the present application can reverse the mutation during transcription to recover the IDUA enzymatic activity.
[0077] As used herein, the term “target RNA” refers to target RNA to be edited, which comprises adenosine (A) to be edited. The target RNA may be mature mRNA, or may be precursor mRNA. In the present application, the target RNA is more preferably precursor mRNA. Herein, a cell comprising the “target RNA” is called a target cell. The adenosine to be edited is called a “target base”, “target adenosine” or “target A”. In the present application, “target base”, “target adenosine” and “target A” can be used interchangeably. A base, adjacent to target adenosine, at the 5′-terminal of target RNA is called a “5′ adjacent base”; a base, adjacent to the target adenosine, at the 3′-terminal of the target RNA is called a “3′ adjacent base”; and a base triplet composed of the target base, the 3′ adjacent base, and the 5′ adjacent base is called a “target base triplet” herein. When arRNA hybridizes with the target RNA, the base, corresponding to the target base, on the arRNA is called “targeting base”; the base, adjacent to the targeting base, at the 5′-terminal of the arRNA is called the “5′ nearest base”; a base, adjacent to the targeting base, at the 3′-terminal of the arRNA is called the “3′ nearest base”, and a base triplet composed of the targeting base, the 3′ nearest base, and the 5′ nearest base is called a “targeting base triplet” herein.
[0078] As uses herein, the term “construct” refers to a nucleic acid vector comprising a certain nucleotide sequence, which may be a linear nucleic acid molecule, plasmid or virus vector. The nucleic acid molecule may be a single-stranded or double-stranded molecule. The nucleotide sequence may be a DNA sequence or RNA sequence. In some embodiments, the nucleotide sequence can work without being subjected to transcription, translation or expression. In some embodiments, the nucleotide sequence is a DNA sequence, and works in the form of RNA molecules that are generated by transcribing the nucleotide sequence. In some embodiments, the nucleotide sequence is an RNA sequence, and works in the form of polypeptides or proteins that are generated by translating the nucleotide sequence. In some embodiments, the nucleotide sequence is a DNA sequence, and work in the form of proteins generated by transcribing and translating the nucleotide sequence. The construct can be prepared into the form of virus, lipid nanoparticle or exosome, and then delivered to a target cell, or can be delivered to a target cell by electrotransformation, microjection or chemical conversion.
[0079] As used herein, the term “delivery” refers to introduction of biomacromolecules, such as nucleic acids and proteins, into the cell membrane by certain ways. The “delivery” methods include electrotransfection, lipofection, lipid nanoparticle delivery, virus delivery, and exosome delivery.
[0080] The term “modification” used in the present application refers to that one or more properties or functions of a nucleic acid or protein are changed by a chemical or biological method. For example, the composition or structure of the nucleic acid or protein is changed by gene engineering.
[0081] Unless otherwise define, all technical terms and scientific terms used herein have the same meanings as commonly understood by those of ordinary skill in the art to which the present application belongs.
[0082] RNA Editing Method
[0083] The present application provides an LEAPER-based method for targeted editing of target RNA comprising a G-to-A mutation in the IDUA gene in the target cell, which comprises: a construct containing adenosine deaminase-recruiting RNA (arRNA) used for editing target RNA or a construct for encoding the arRNA is delivered to the target cell, wherein the arRNA comprises a complementary RNA sequence that hybridizes with the target RNA, and can recruit adenosine deaminase acting on RNA (ADAR) to deaminize target adenosine (A) in the target RNA. In some embodiments, the target RNA is precursor mRNA. In some embodiments, the target RNA is mature mRNA. In some embodiments, the target RNA is transcribed RNA comprising the NM_000203.4(IDUA)-c.1205G-A (p.Trp402Ter) mutation site in the IDUA gene.
[0084] In some embodiments, the arRNA comprises base C, A, U or G that is paired with the target A. The bases paired with the target A are sorted by priority as C>A>U>G. That is, under the conditions of the same length of arRNA, the same distance between targeting bases and the 5′-terminal, the same distance between the targeting bases and the 3′-terminal, and the completely same arRNA sequence except the targeting bases, the targeting bases are sorted by priority as C>A>U>G. If the targeting base is C, the arRNA can be expressed as X nt-c-Y nt, where, X refers to that a distance between the targeting base and the 5′-terminal is X nt, Y refers to that a distance between the targeting base and the 3′-terminal is Y nt, and X and Y can be any natural number.
[0085] In some embodiments, the target cell is a eukaryotic cell. In some embodiments, the target cell is a mammalian cell. In some embodiments, the target cell is hepatocyte or fibroblast. In some embodiments, the target cell is a human or mouse cell.
[0086] In some embodiments, the length of the arRNA is about 151-61 nt, 131-66 nt, 121-66 nt, 111-66 nt, 91-66 nt or 81-66 nt. In some embodiments, a distance between a targeting base in the arRNA and the 3′-terminal is 45-5 nt, 40-5 nt, 35-10 nt, 25-15 nt or 24-11 nt. In some embodiments, a distance between the targeting base in the arRNA and the 5′-terminal is 80-30 nt, 70-35 nt, 60-40 nt, 55-35 nt or 55-45 nt. The distance between the targeting base and the 3′-terminal refers to the number of bases from the 3′ nearest base of the targeting base to the last base at the 3′-terminal; and the distance between the targeting base and the 5′-terminal refers to the number of bases from the 5′ nearest base of the targeting base to the last base at the 5′-terminal.
[0087] In some embodiments, if the target cell is a human cell, and the target RNA is transcribed RNA comprising the NM_000203.4(IDUA)-c.1205G-A (p.Trp402Ter) mutation site in the IDUA gene, the full length of the arRNA is equal to or greater than 66 nt, such as about 121-66 nt, 111-66 nt, 101-66 nt, 91-66 nt, and 81-66 nt, that is, the full length of the arRNA is any natural number selected from the above length ranges, such as 67 nt, 68 nt, 69 nt, 70 nt, 71 nt, 72 nt, 73 nt, 74 nt, 75 nt, 76 nt, 77 nt, 78 nt, 79 nt, 80 nt, 81 nt, 82 nt, 83 nt, 84 nt, 85 nt, 86 nt, 87 nt, 88 nt, 89 nt, 90 nt, 91 nt, 95 nt, 100 nt, 110 nt, 115 nt, and 120 nt. In some embodiments, a distance between a targeting base in the arRNA and the 3′-terminal is 45-5 nt, 40-5 nt, 35-10 nt, 25-15 nt or 24-11 nt, that is, the distance between the targeting base in the arRNA and the 3′-terminal is any natural number selected from the above distance ranges, such as 12 nt, 13 nt, 14 nt, 16 nt, 17 nt, 18 nt, 19 nt, 20 nt, 21 nt, 22 nt, and 23 nt. In some embodiments, a distance between the targeting base in the arRNA and the 5′-terminal is 80-30 nt, 70-35 nt, 60-40 nt, 55-35 nt or 55-45 nt, that is, the distance between the targeting base in the arRNA and the 5′-terminal is any natural number selected from the above distance ranges, such as 46 nt, 47 nt, 48 nt, 49 nt, 50 nt, 51 nt, 52 nt, 53 nt, and 54 nt. In some embodiments, the arRNA comprises any one of the following sequences: SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 9, SEQ ID NO: 13, SEQ ID NO: 17, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 30, SEQ ID NO: 31, and SEQ ID NO: 34.
[0088] In some embodiments, the target cell is a mouse cell (e.g. W392X mouse cell), and the target RNA is transcribed RNA comprising a mutation, corresponding to the human W402X mutation, in the IDUA gene. In some embodiments, the target cell is a W392X mouse cell. In some embodiments, the length of the arRNA is about 121-53 nt, 111-61 nt, 101-61 nt, 91-61 nt, 81-61 nt, 111-66 nt or 105-66 nt, that is, the full length of the arRNA is any natural number selected from the above length ranges, such as 67 nt, 68 nt, 69 nt, 70 nt, 71 nt, 72 nt, 73 nt, 74 nt, 75 nt, 76 nt, 77 nt, 78 nt, 79 nt, 80 nt, 81 nt, 82 nt, 83 nt, 84 nt, 85 nt, 86 nt, 87 nt, 88 nt, 89 nt, 90 nt, 91 nt, 95 nt, 100 nt, 110 nt, 115 nt, and 120 nt. In some embodiments, a distance between a targeting base in the arRNA and the 3′-terminal is 55-10 nt or 50-10 nt, that is, the distance between the targeting base in the arRNA and the 3′-terminal is any natural number selected from the above length ranges, such as 11 nt, 12 nt, 13 nt, 14 nt, 16 nt, 17 nt, 18 nt, 19 nt, 20 nt, 21 nt, 22 nt, 23 nt, 24 nt, 25 nt, 26 nt, 27 nt, 28 nt, 29 nt, 30 nt, 31 nt, 32 nt, 33 nt, 34 nt, 35 nt, 35 nt, 37 nt, 38 nt, 39 nt, 40 nt, 41 nt, 42 nt, 43 nt, 44 nt, 45 nt, 46 nt, 47 nt, 48 nt, 49 nt, and 50 nt. In some embodiments, a distance between the targeting base in the arRNA and the 5′-terminal is 80-30 nt, 70-35 nt, 60-40 nt, 55-35 nt or 55-45 nt, that is, the distance between the targeting base in the arRNA and the 5′-terminal is any natural number selected from the above distance ranges, such as 33 nt, 36 nt, 47 nt, 46 nt, 47 nt, 48 nt, 49 nt, 50 nt, 51 nt, 52 nt, 53 nt, 54 nt, 60 nt, 65 nt, and 75 nt. In some embodiments, the arRNA comprises any one of the following sequences: SEQ ID NO: 44 and SEQ ID NO: 52.
[0089] In some embodiments, the arRNA is chemically modified. In some embodiments, the chemical modification comprises 2-O′-methylation and/or thiophosphate modification. In some embodiments, the chemical modification is selected from any one or more of:
[0090] the first three nucleotides and the last three nucleotides in the sequence are respectively subjected to 2′-OMe modification;
[0091] the first three nucleotides and the last three nucleotides are respectively linked via thiophosphate bonds
[0092] all bases U in the sequence are subjected to 2′-OMe modification.
[0093] the 3′ nearest base of the targeting base is A subjected to 2′-OMe modification,
[0094] the 5′ nearest base of the targeting base is C subjected to 2′-OMe modification,
[0095] the targeting base is linked to the 3′ nearest base and the 5′ nearest base of the targeting base respectively via thiophosphate bonds,
[0096] the first five nucleotides and the last five nucleotides are respectively subjected to 2′-OMe modification, and
[0097] the first five nucleotides and the last five nucleotides are respectively linked via thiophosphate bonds.
[0098] As described herein, the construct for encoding the arRNA is a construct comprising a coding sequence of the arRNA. In some embodiments, the arRNA is generated by transcribing the construct for encoding the arRNA after the construct for encoding the arRNA is delivered to a target cell. In some embodiments, the construct for encoding the arRNA is a linear nucleic acid chain, virus vector or plasmid. In some embodiments, the virus vector is an adeno-associated virus (AAV) vector or lentivirus vector. In some embodiments, after being delivered to the target cell, the construct for encoding the arRNA inserts the coding sequence of the arRNA into the genome of the target cell by homologous recombination or non-homologous recombination so as to continuously generate the arRNA by transcription. In some embodiments, after being delivered to the target cell, the construct for encoding the arRNA enables the coding sequence of the arRNA to exist as a part of a free nucleic acid in the target cell, so that the coding sequence of the arRNA can generate the arRNA by transcription in a certain time period.
[0099] In some embodiment, the delivery method is electrotransfection, lipofection, or lipid nanoparticle (LNP) delivery or infection. In some embodiments, if the target cell is hepatocyte, the delivery method is LNP delivery. In some embodiments, if the target cell is fibroblast, the delivery method is lipofection. In some embodiments, the delivery concentration of the arRNA is equal to or greater than 2.5-5 nM, and preferably equal to or greater than 10-20 nM, such as equal to or greater than 15 nM. In the present application, the delivery concentration refers to the arRNA content in one volume unit of a delivery system where one volume unit of arRNA and a target cell are located after the arRNA construct is delivered to the target cell. The delivery system comprises the arRNA or its construct, the target cell, and a fluid matrix around the arRNA and the target cell. In some embodiments, the fluid matrix may be a cell culture medium, PBS or other solutions that have the same osmotic pressure as that of the cytoplasm and can maintain stable survival conditions of cells. In some embodiments, the delivery system further comprises a reagent for promoting delivery.
[0100] arRNA
[0101] The present application also provides arRNA, which can be used for LEAPER-based targeted editing of target RNA, such as transcribed RNA comprising the NM_000203.4(IDUA)-c.1205G-A (p.Trp402Ter) mutation site, in a target cell, in order to deaminate target A in the target RNA to inosine (I). In the subsequent translation process of the target cell, I will be identified as G, so that the G>A mutation can be recovered to G, and the target RNA can be translated into a correct protein after being edited by the arRNA. In some embodiments, the target RNA is precursor mRNA. In some embodiments, the target RNA is mature mRNA.
[0102] In some embodiments, the arRNA has the base (targeting base) C, A, U or G that is paired with the target A, and the bases are sorted by editing efficiency as C>A>U>G. In some embodiments, in addition to the targeting base, other bases in the arRNA can be paired with the target RNA in a complementary manner. In some embodiments, except the targeting base, one or more bases in the arRNA are mispaired with the target RNA.
[0103] In some embodiments, the target cell is a eukaryotic cell. In some embodiments, the target cell is a mammalian cell. In some embodiments, the target cell is hepatocyte or fibroblast. In some embodiments, the target cell is a human or mouse cell (e.g. W392X mouse cell).
[0104] In some embodiments, the length of the arRNA is about 151-61 nt, 131-66 nt, 121-66 nt, 111-66 nt, 91-66 nt or 81-66 nt. In some embodiments, a distance between a targeting base in the arRNA and the 3′-terminal is 45-5 nt, 40-5 nt, 35-10 nt, 25-15 nt or 24-11 nt. In some embodiments, a distance between the targeting base in the arRNA and the 5′-terminal is 80-30 nt, 70-35 nt, 60-40 nt, 55-35 nt or 55-45 nt. The distance between the targeting base and the 3′-terminal refers to the number of bases from the 3′ nearest base of the targeting base to the last base at the 3′-terminal; and the distance between the targeting base and the 5′-terminal refers to the number of bases from the 5′ nearest base of the targeting base to the last base at the 5′-terminal.
[0105] In some embodiments, if the target cell is a human cell, and the target RNA is transcribed RNA comprising the NM_000203.4(IDUA)-c.1205G-A (p.Trp402Ter) mutation site in the IDUA gene, the full length of the arRNA is equal to or greater than 66 nt, such as about 121-66 nt, 111-66 nt, 101-66 nt, 91-66 nt, and 81-66 nt, that is, the full length of the arRNA is any natural number selected from the above length ranges, such as 67 nt, 68 nt, 69 nt, 70 nt, 71 nt, 72 nt, 73 nt, 74 nt, 75 nt, 76 nt, 77 nt, 78 nt, 79 nt, 80 nt, 81 nt, 82 nt, 83 nt, 84 nt, 85 nt, 86 nt, 87 nt, 88 nt, 89 nt, 90 nt, 91 nt, 95 nt, 100 nt, 110 nt, 115 nt, and 120 nt. In some embodiments, a distance between a targeting base in the arRNA and the 3′-terminal is 45-5 nt, 40-5 nt, 35-10 nt, 25-15 nt or 24-11 nt, that is, the distance between the targeting base in the arRNA and the 3′-terminal is any natural number selected from the above distance ranges, such as 12 nt, 13 nt, 14 nt, 16 nt, 17 nt, 18 nt, 19 nt, 20 nt, 21 nt, 22 nt, and 23 nt. In some embodiments, a distance between the targeting base in the arRNA and the 5′-terminal is 80-30 nt, 70-35 nt, 60-40 nt, 55-35 nt or 55-45 nt, that is, the distance between the targeting base in the arRNA and the 5′-terminal is any natural number selected from the above distance ranges, such as 46 nt, 47 nt, 48 nt, 49 nt, 50 nt, 51 nt, 52 nt, 53 nt, and 54 nt. In some embodiments, the arRNA comprises any one of the following sequences: SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 9, SEQ ID NO: 13, SEQ ID NO: 17, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 30, SEQ ID NO: 31, and SEQ ID NO: 34.
[0106] In some embodiments, if the target cell is a mouse cell (e.g. W392X mouse cell), and the target RNA is transcribed RNA comprising a mutation, corresponding to the human W402X mutation, in the IDUA gene, the length of the arRNA is about 121-53 nt, 111-61 nt, 101-61 nt, 91-61 nt, 81-61 nt, 111-66 nt or 105-66 nt, that is, the full length of the arRNA is any natural number selected from the above length ranges, such as 67 nt, 68 nt, 69 nt, 70 nt, 71 nt, 72 nt, 73 nt, 74 nt, 75 nt, 76 nt, 77 nt, 78 nt, 79 nt, 80 nt, 81 nt, 82 nt, 83 nt, 84 nt, 85 nt, 86 nt, 87 nt, 88 nt, 89 nt, 90 nt, 91 nt, 95 nt, 100 nt, 110 nt, 115 nt, and 120 nt. In some embodiments, a distance between a targeting base in the arRNA and the 3′-terminal is 55-10 nt or 50-10 nt, that is, the distance between the targeting base in the arRNA and the 3′-terminal is any natural number selected from the above distance ranges, such as 11 nt, 12 nt, 13 nt, 14 nt, 16 nt, 17 nt, 18 nt, 19 nt, 20 nt, 21 nt, 22 nt, 23 nt, 24 nt, 25 nt, 26 nt, 27 nt, 28 nt, 29 nt, 30 nt, 31 nt, 32 nt, 33 nt, 34 nt, 35 nt, 35 nt, 37 nt, 38 nt, 39 nt, 40 nt, 41 nt, 42 nt, 43 nt, 44 nt, 45 nt, 46 nt, 47 nt, 48 nt, 49 nt, and 50 nt. In some embodiments, a distance between the targeting base in the arRNA and the 5′-terminal is 80-30 nt, 70-35 nt, 60-40 nt, 55-35 nt or 55-45 nt, that is, the distance between the targeting base in the arRNA and the 5′-terminal is any natural number selected from the above distance ranges, such as 33 nt, 36 nt, 47 nt, 46 nt, 47 nt, 48 nt, 49 nt, 50 nt, 51 nt, 52 nt, 53 nt, 54 nt, 60 nt, 65 nt, and 75 nt. In some embodiments, the arRNA comprises any one of the following sequences: SEQ ID NO: 44 and SEQ ID NO: 52.
[0107] In some embodiments, the arRNA is generated by transcribing and expressing the construct for encoding the arRNA. In some embodiments, the arRNA is obtained by transcribing and expressing the construct for encoding the arRNA in vitro and purifying. In some embodiments, the arRNA is obtained by directly expressing the construct for encoding the arRNA in vivo and used for editing. In some embodiments, the construct is a virus vector, plasmid or linear nucleic acid. In some embodiments, the virus is AAV or lentivirus.
[0108] In some embodiments, the arRNA is chemically synthesized. In some embodiments, the arRNA is chemically modified. In some embodiments, the chemical modification comprises 2-O′-methylation and/or thiophosphate modification. In some embodiments, the chemical modification is selected from any one or more of:
[0109] the first three nucleotides and the last three nucleotides in the sequence are respectively subjected to 2′-OMe modification;
[0110] the first three nucleotides and the last three nucleotides are respectively linked via thiophosphate bonds;
[0111] all bases U in the sequence are subjected to 2′-OMe modification;
[0112] the 3′ nearest base of the targeting base is A subjected to 2′-OMe modification;
[0113] the 5′ nearest base of the targeting base is C subjected to 2′-OMe modification;
[0114] the targeting base is linked to the 3′ nearest base and the 5′ nearest base of the targeting base respectively via thiophosphate bonds,
[0115] the first five nucleotides and the last five nucleotides are respectively subjected to 2′-OMe modification; and
[0116] the first five nucleotides and the last five nucleotides are respectively linked via thiophosphate bonds.
[0117] In some embodiments, the chemical modification is selected from any one or more of:
[0118] CM1, which refers to that the first three nucleotides and the last three nucleotides in the sequence are respectively subjected to 2′-OMe modification, the first three nucleotides and the last three nucleotides are respectively linked via thiophosphate bonds, and meanwhile, all bases U in the sequence are subjected to 2′-OMe modification;
[0119] CM2, which refers to that the first three nucleotides and the last three nucleotides in the sequence are respectively subjected to 2′-OMe modification, the first three nucleotides and the last three nucleotides are respectively linked via thiophosphate bonds, and meanwhile, the 3′ nearest base of the targeting base is A subjected to 2′-OMe modification;
[0120] CM3, which refers to that the first three nucleotides and the last three nucleotides in the sequence are respectively subjected to 2′-OMe modification, the first three nucleotides and the last three nucleotides are respectively linked via thiophosphate bonds, and meanwhile, the 5′ nearest base of the targeting base is C subjected to 2′-OMe modification;
[0121] CM4, which refers to that the first three nucleotides and the last three nucleotides in the sequence are respectively subjected to 2′-OMe modification, the first three nucleotides and the last three nucleotides are respectively linked via thiophosphate bonds, and meanwhile, the targeting base is linked to the 3′ nearest base and the 5′ nearest base of the targeting base respectively via thiophosphate bonds; and
[0122] CM6, which refers to that the first five nucleotides and the last five nucleotides in the sequence are respectively subjected to 2′-OMe modification, and the first five nucleotides and the last five nucleotides are respectively linked via thiophosphate bonds.
[0123] Construct
[0124] The present application also provides a construct for encoding the above arRNA. In some embodiments, the construct is a virus vector, plasmid or linear nucleic acid. In some embodiments, the virus vector is an AAV vector or lentivirus expression vector.
[0125] The present application further provides a virus, lipid nanoparticle, liposome, exosome or cell, which comprises the construct.
[0126] Preparation and Biological Product
[0127] The present application also provides a composition, preparation or biological product, which comprises any arRNA or construct in the foregoing description, and can be used for editing of transcribed target RNA comprising the NM_000203.4(IDUA)-c.1205G-A (p.Trp402Ter) mutation site in a target cell, in order to recover normal functions of the IDUA gene. In some embodiments, the arRNA or construct is wrapped within a liposome. In some embodiments, the arRNA or construct is prepared into lipid nanoparticles. In some embodiments, the arRNA or construct is introduced into the body of a subject by virus (e.g. adeno-associated virus or lentivirus) delivery.
[0128] In some embodiments, the preparation comprising the arRNA is a therapeutic agent, which can be infused into the body of a patient to treat a disease. In some embodiments, the infusion method is local injection, local perfusion or intravenous infusion, or local perfusion or local injection. In some embodiments, the therapeutic agent is in a form applicable to local injection into the liver, such as arterial perfusion into the liver. In some embodiments, the therapeutic agent is in a form applicable to intramuscular injection. In some embodiments, the therapeutic agent is in a form applicable to intravenous injection.
[0129] Kit
[0130] The present application also provides a kit used for editing of target RNA in a target cell, which comprises the above arRNA, construct for encoding the arRNA or preparation. The kit can be used for editing of transcribed target RNA comprising the NM_000203.4(IDUA)-c.1205G-A (p.Trp402Ter) mutation site in the target cell.
[0131] In some embodiments, the kit comprises the above arRNA or construct for encoding the arRNA, and a dyeing auxiliary, which are packaged in different containers. In some embodiments, the dyeing auxiliary is a lipid solution. For example, the dyeing auxiliary is Lipo (lipofectmine RNAiMAX, catalog No. 13778150) purchased from Invitrogen or a reagent comprising the same functional ingredients.
[0132] In some embodiments, the kit further comprises an instruction for introducing various ingredients and the content in the kit to users, and/or the use method of the kit.
[0133] Treatment Method
[0134] The present application further provides a method for treating mucopolysaccharidosis IH in an individual, which comprises: correcting a G-to-A mutation in the IDUA gene, such as the NM_000203.4(IDUA)-c.1205G-A (p.Trp402Ter) mutation, in the cell of an individual by the above method. In some embodiments, the diseases include Hurler syndrome. In some embodiments, if the individual is a human, the arRNA is administrated at a frequency of ≥every 21 days, ≥every 17 days, ≥every 14 days or ≥every 10 days. In some embodiments, if the individual is a mouse, the arRNA is administrated at a frequency of ≥every 8 days. In some embodiments, the treatment method uses a construct for encoding the arRNA, and the construct can integrate a coding sequence of the arRNA into the target cell, and the arRNA is administrated once.
[0135] The LEAPER-based method for targeted editing of RNA provided in the present application has the following advantages.
[0136] 1. The method does not rely on the expression of exogenous proteins, so it avoids difficulty in loading and in vivo delivery via virus vectors due to the large molecular weight of proteins, avoids an off-target effect caused by the overexpression of exogenous proteins, avoids an immune response and injury of an organism caused by the expression of exogenous proteins, and avoids the failure of gene editing caused by neutralization of exogenous enzymes used for editing or effector proteins with antibodies pre-existing in the organism.
[0137] 2. The enzyme-guided site-specific RNA editing method provided in the present application is different from DNA editing, RNA editing is reversible and controllable. Diseases can be treated and functions of proteins and RNA can be studied by recoding amino acid codons. Potential side effects of RNA editing are reversible, so RNA editing is safer.
[0138] 3. Compared with the prior art, the method can chemically synthesize the arRNA, and deliver the arRNA to a patient by electrotransfection or lipofection, or via a vector such as an adeno-associated virus or lentivirus, the delivery methods are flexible and changeable, and the editing efficiency is higher.
[0139] The preferred embodiments of the present application have been described above, but the present application is not limited thereto. Many simple modifications including combinations of various technical features in any other suitable manners can be made within the scope of the technical idea of the present application, and these simple modifications and combinations shall be also regarded as the content disclosed in the present application and fall within the scope of protection of the present application.
[0140] The technical solutions of the present application will be further described in detail below with reference to specific examples, but the present application is not limited to the following examples. Unless otherwise defined, reagents involved in the following description are commercially available products. For brevity, some experiments are described without specifying parameters, steps and instruments used, and it should be understood that these experiments are well known and reproducible by those skilled in the art. Cells (GM06214) used herein are purchased from Coriell (USA), and are fibroblasts from patients with Hurler syndrome. arRNA used for editing is synthesized by Synthego (USA) or Biosyntech (Suzhou), the Sanger sequencing is performed by RuiBioTech (Beijing), the next generation sequencing is performed by Novogene or the sequencing platform of China National Rice Research Institute.
Examples
Example 1: Detection of the Mutation Genotype of GM06214 Cells
[0141] GM06214 cells (fibroblasts from patients with Hurler syndrome) were placed into a fibroblast culture solution comprising 15% serum (ScienCell, FM, catalog No. 2301), a 1% fibroblast growth supplement (ScienCell, FGS, catalog No. 2301) was added, and the cells were cultured in an incubator with 5% CO.sub.2 at 37° C. for 2-3 d. After growing to 90% confluence, the cells were digested with 0.25% pancreatic enzyme, and then the digestion was terminated with a fibroblast culture solution comprising 15% serum. DNA was extracted by using a TianGene® (TIANGEN Biotech (Beijing) Co., Ltd.) DNA extraction kit (catalog No. DP304-03) according to the instruction.
[0142] Primers for the upstream and downstream sequences of a mutation site in the IDUA gene were designed on NCBI-Primer blast (https://www.ncbi.nlm.nih.gov/tools/primer-blast/). SEQ ID NO: 1 CGCTTCCAGGTCAACAACAC (forward primer hIDUA-F1); SEQ ID NO: 2 CTCGCGTAGATCAGCACCG (reverse primer hIDUA-R1). PCR reaction is performed, and then the PCR product was subjected to Sanger sequencing. It was determined that the mutation genotype of the cells was a pathogenic G-to-A mutation at the 15704 site in the IDUA genome, as shown in
Example 2: Screening of GM06214 Cell Electrotransfection Conditions
[0143] After growing to about 90% confluence, GM06214 cells were digested, and after the digestion was terminated, the cells were counted. During electrotransfection, 6 million cells were resuspended with 400 μL of pre-mixed electrotransfection solution (Lonza, catalog No. V4XP-3024), 20 μg of GFP plasmids (Lonza, catalog No. V4XP-3024) was added to and uniformly mixed with the cells, 20 μL of electrotransfection system was taken as a sample and detected on an Lonza nuclear electrotransfection instrument under eight conditions including seven electrotransfection conditions (see
Example 3: IDUA Enzymatic Activity and Editing Efficiency in GM06214 Cells after Electrotransfection with arRNA
[0144] The following arRNA sequences were designed and synthesized for the upstream and downstream sequences of mutation sites on pre-mRNA (precursor mRNA) and mature-mRNA (mature mRNA) after transcription of the IDUA gene: SEQ ID NO: 3 GACGCCCACCGUGUGG UUGCUGUCCAGGACGGUCCCGGCCUGCGACACUUCGGCCCAGAGCUGCUCCUCAUCC AGCAGCGCCAGCAGCCCCAUGGCCGUGAGCACCGGCUU (Pre-55nt-c-55nt); SEQ ID NO: 4 GACGCCCACCGUGUGGUUGCUGUCCAGGACGGUCCCGGCCUGCGACACUUCGGCCC AGAGCUGCUCCUCAUCUGCGGGGCGGGGGGGGGCCGUCGCCGCGUGGGGUCGUUG (m-55nt-c-55nt); and SEQ ID NO: 5 UACCGCUACAGCCACGCUGAUUUCAGCUAUACCUGC CCGGUAUAAAGGGACGUUCACACCGCGAUGUUCUCUGCUGGGGAAUUGCGCGAUAU UCAGGAUUAAAAGAAGUGC (Random-111nt). A base, corresponding to the mutation site, in the synthesized arRNA was converted from T to C so as to form A-C mispairing. The length of the synthesized arRNA was preferably 111 nt. Cells were subjected to electrotransfection under the optimal electrotransfection condition obtained in Example 2, and 48 h after electrotransfection, the cells were collected and subjected to enzymatic activity assay and editing efficiency detection.
[0145] Detection of the Editing Efficiency:
[0146] The designed and synthesized arRNA was dissolved to the required concentration in RNase-free water (TransGen, catalog No. GI201-01) and stored at −80° C. After growing to about 90% confluence, GM06214 cells were digested, and after digestion was terminated, the cells were counted. 1 million cells were added with 200 pmol of arRNA until the volume of the mixture was 100 uL, the cells were subjected to electrotransfection under the CA-137 condition. 48 h after the electrotransfection, the cells were counted and subjected to cell viability assay. The cells were transferred into an RNA enzyme-free centrifuge tube and centrifuged, a supernate was removed, and RNA was extracted by using a QIAGEN RNA extraction kit (QIAGEN, catalog No. 74134). According to an operation manual, 5×10.sup.5 cells were added to and uniformly mixed with 0.35 mL of Buffer RLT Plus by being pipetted evenly (for cryopreserved cells, RNA is directly extracted, and it is recommended to wash once with PBS). The lysed cell solution was added to a gDNA Eliminator spin column and centrifuged at ≥8000 g for 30 s, the column was removed, and a fluid was collected. 70% ethanol with the same volume as that of the fluid was added to and uniformly mixed with the fluid by being pipetted, and the next step was performed. The liquid was added to an RNeasyMinElute spin column and centrifuged at ≥8000 g for 15 s, and liquid waste was removed. 700 μL of Buffer RW1 was added to the RNeasyMinElute spin column and centrifuged at ≥8000 g for 15 s, and liquid waste was removed. 500 μL of Buffer RPE was added to the RNeasyMinElute spin column and centrifuged at ≥8000 g for 15 s, and liquid waste was removed. 500 μL of 80% ethanol was added to the RNeasyMinElute spin column and centrifuged at ≥8000 g for 2 min, and liquid waste was removed. The RNeasyMinElute spin column was placed into a new 2 mL collection column and centrifuged without being covered at the highest relative centrifugal force for 5 min to be dried. The RNeasyMinElute spin column was placed into a new 1.5 mL collection column, 14 μL of RNase-free water was dropwise added to the center of the column membrane, and the column was centrifuged at the highest speed for 1 min to dilute RNA.
[0147] The concentration of the extracted RNA was detected by using Nanodrop (Thermo, catalog No. Nanodrop2000), and 1 μg of RNA was subjected to reverse transcription (Thermo, reverse transcriptase catalog No. 28025013). A reverse transcription system was prepared according to Table 1, incubated at 65° C. for 5 min, and immediately cooled in an ice bath. The reverse transcription system was incubated again at 37° C. for 50 min. The reverse transcriptase was inactivated at 70° C. for 15 min. PCR was performed under the conditions shown in Table 3. After the PCR was completed, 2 μL of PCR product was subjected to agarose gel electrophoresis, and the correctness of the concentration of the PCR product and the size of the band was preliminarily determined according to electrophoresis results. After being purified, the library was established with the PCR products and these products were subjected to the next generation sequencing.
TABLE-US-00001 TABLE 1 Preparation of a reverse transcription system-1 Volume (μL) Total RNA (1 μg) X Oligo dT 1 10 nM dNTP 1 RNase-Free Water 10-X Total volume 12
[0148] The reaction system run at 65° C. for 5 min, and then was immediately transferred onto ices.
TABLE-US-00002 TABLE 2 Preparation of a reverse transcription system-2 Volume (μL) The system in Table 1 12 μL 5X First-Strand Buffer 4 0.1M DTT 2 RNaseOUT ™ Recombinant Ribonuclease Inhibitor 1 M-MLV 1 Total volume 20
TABLE-US-00003 TABLE 3 PCR conditions Steps Time Cycles 98° C. 2 min 1 cycle 98° C. 15 s 28-35 cycles 63° C. 30 s 72° C. 15 s 72° C. 2 min 1 cycle
[0149] Enzymatic Activity Assay
[0150] GM06214 cells were digested, centrifuged, resuspended with 28 μL of 1×PBS comprising 0.1% Triton X-100, and lysed on ices for 30 min. 25 μL of lysed cell solution was added to 25 μL of substrate (Cayman, 2A-19543-500, dissolved in a 0.4 M sodium formate solution, comprising 0.2% Triton X-100, pH=3.5) comprising 190 μm of 4-methylumbelliferyl-α-L-iduronidase. The cells were incubated in the dark at 37° C. for 90 min. 200 μL of 0.5M NaOH/Glycine solution (Beijing Chemical Technology, NAOH, catalog No. AR500G; Solarbio, Glycine, catalog No. G8200) (pH=10.3) was added to inactivate the catalysis reaction. The cells were centrifuged at 4° C. for 2 min. A supernate was transferred into a 96-well plate and subjected to fluorescence measurement at the excitation wavelengths of 365 nm and 450 nm by using an Infinite M200 instrument (TECAN).
[0151] In the present application, all result data of the enzymatic activity assay are expressed as folds of the enzyme activity in GM01323 cells. The GM01323 cells are fibroblasts from patients with Scheie syndrome. Scheie syndrome is the mildest subtype of mucopolysaccharidosis and is much milder than Hurler syndrome. Patients with Scheie syndrome usually have better prognosis and a normal lifespan, and can survive to adulthood. The IDUA enzyme activity in fibroblasts of patients with Scheie syndrome is 0.3% of that in wild-type fibroblasts of healthy people.
[0152] As shown in
Example 4: Detection of the Editing Efficiency at an IDUA Target Site after Electrotransfection of an IDUA-Reporter Cell Line with arRNA
[0153] As shown in
TABLE-US-00004 TABLE 4 111 nt-random SEQ ID NO: 5 uaccgcuacagccacgcugauuucagcuauaccug cccgguauaaagggacguucacaccgcgauguucu cugcuggggaauugcgcgauauucaggauuaaaag aagugc 91 nt-random SEQ ID NO: 6 uaauccugaauaucgcgcaauuccccagcagagaa caucgcggugugaacgucccuuuauaccgggcagg uauagcugaaaucagcguggc 71 nt-random SEQ ID NO: 7 uuucagcuauaccugcccgguauaaagggacguuc acaccgcgauguucucugcuggggaauugcgcgau a 51 nt-random SEQ ID NO: 8 uuccccagcagagaacaucgcggugugaacguccc uuuauaccgggcaggu 55 nt-c-55 nt SEQ ID NO: 4 gacgcccaccgugugguugcuguccaggacggucc cggccugcgacacuucggcccagagcugcuccuca ucugcggggcgggggggggccgucgccgcgugggg ucguug 45 nt-c-45 nt SEQ ID NO: 9 gugugguugcuguccaggacggucccggccugcga cacuucggcccagagcugcuccucaucugcggggc gggggggggccgucgccgcgu 35 nt-c-35 nt SEQ ID NO: 10 uguccaggacggucccggccugcgacacuucggcc cagagcugcuccucaucugcggggcgggggggggc c 25 nt-c-25 nt SEQ ID NO: 11 ggucccggccugcgacacuucggcccagagcugcu ccucaucugcggggcg
Example 5: IDUA Enzymatic Activity Assay and Detection of the RNA Editing Efficiency in GM06214 Cells at Different Time Points after Electrotransfection with arRNAs of Different Lengths
[0154] Electrotransfection of GM06214 cells with arRNAs of different lengths (see Table 4) was performed under the electrotransfection condition of Example 2, the enzymatic activity in the cells was assayed on day 2, day 4, day 6, day 8, day 10, day 12, and day 14 respectively after electrotransfection by the method of Example 3, and the RNA editing efficiency in the cells was detected on day 2 and day 4. As shown in
Example 6: Detection of the Editing Efficiency of a Targeting Base at Different Sites on arRNA
[0155] arRNA with the length of 111 nt was taken as an example, the editing efficiency of a targeting base at different site on arRNA was detected by cutting off from both sides of a mutation site at the same time and cutting out from the 5′-terminal or the 3′-terminal.
[0156] In the present example, the arRNA was introduced into cells by lipofectmine RNAiMAX. First, the arRNA sequence was truncated from both terminals at the same time, and then one terminal was fixed and the other terminal was truncated to obtain 14 arRNAs and 4 random sequences of the same length, as shown in Table 5. 48 h after transfection, the IDUA enzymatic activity was assayed and the RNA editing efficiency was detected. As shown in
TABLE-US-00005 TABLE 5 111 nt-random SEQ ID NO: 5 uaccgcuacagccacgcugauuucagcuauaccugc ccgguauaaagggacguucacaccgcgauguucucu gcuggggaauugcgcgauauucaggauuaaaagaag ugc 91 nt-random SEQ ID NO: 6 uaauccugaauaucgcgcaauuccccagcagagaac aucgcggugugaacgucccuuuauaccgggcaggua uagcugaaaucagcguggc 71 nt-random SEQ ID NO: 7 uuucagcuauaccugcccgguauaaagggacguuca caccgcgauguucucugcuggggaauugcgcgaua 51 nt-random SEQ ID NO: 8 uuccccagcagagaacaucgcggugugaacgucccu uuauaccgggcaggu 55 nt-c-55 nt SEQ ID NO: 4 gacgcccaccgugugguugcuguccaggacgguccc ggccugcgacacuucggcccagagcugcuccucauc ugcggggcgggggggggccgucgccgcguggggucg uug 45 nt-c-45 nt SEQ ID NO: 9 gugugguugcuguccaggacggucccggccugcgac acuucggcccagagcugcuccucaucugcggggcgg gggggggccgucgccgcgu 35 nt-c-35 nt SEQ ID NO: 10 uguccaggacggucccggccugcgacacuucggccc agagcugcuccucaucugcggggcgggggggggcc 25 nt-c-25 nt SEQ ID NO: 11 ggucccggccugcgacacuucggcccagagcugcuc cucaucugcggggcg 55 nt-c-45 nt SEQ ID NO: 12 gacgcccaccgugugguugcuguccaggacgguccc ggccugcgacacuucggcccagagcugcuccucauc ugcggggcgggggggggccgucgccgcgu 55 nt-c-35 nt SEQ ID NO: 13 gacgcccaccgugugguugcuguccaggacgguccc ggccugcgacacuucggcccagagcugcuccucauc ugcggggcgggggggggcc 55 nt-c-25 nt SEQ ID NO: 14 gacgcccaccgugugguugcuguccaggacgguccc ggccugcgacacuucggcccagagcugcuccucauc ugcggggcg 55 nt-c-15 nt SEQ ID NO: 15 gacgcccaccgugugguugcuguccaggacgguccc ggccugcgacacuucggcccagagcugcuccucau 55 nt-c-5 nt SEQ ID NO: 16 gacgcccaccgugugguugcuguccaggacgguccc ggccugcgacacuucggcccagagc 45 nt-c-55 nt SEQ ID NO: 17 gugugguugcuguccaggacggucccggccugcgac acuucggcccagagcugcuccucaucugcggggcgg gggggggccgucgccgcguggggucguug 35 nt-c-55 nt SEQ ID NO: 18 uguccaggacggucccggccugcgacacuucggccc agagcugcuccucaucugcggggcgggggggggccg ucgccgcguggggucguug 25 nt-c-55 nt SEQ ID NO: 19 ggucccggccugcgacacuucggcccagagcugcuc cucaucugcggggcgggggggggccgucgccgcgug gggucguug 15 nt-c-55 nt SEQ ID NO: 20 ugcgacacuucggcccagagcugcuccucaucugcg gggcgggggggggccgucgccgcguggggucguug 5 nt-c-55 nt SEQ ID NO: 21 cggcccagagcugcuccucaucugcggggcgggggg gggccgucgccgcguggggucguug
Example 7: Influence of a Distance Between a Targeting Base and the 3′-Terminal on the Editing Efficiency
[0157] By Example 6, it was found that the IDUA enzymatic activity and the editing efficiency caused by the sequences 81 nt: 55nt-c-25nt and 71 nt: 55nt-c-15nt were higher. In order to screen out the shortest and optimal distance between a targeting base and the 3′-terminal, the distance between the targeting base and the 3′-terminal was gradually truncated from 25 nt (81 nt: 55nt-c-25nt) to 10 nt (66 nt: 55nt-c-10nt), as shown in Table 6. As shown in
[0158] Moreover, the screening of the optimal distance between a targeting base and the 3′-terminal was performed on arRNA targeting a mouse IDUA mutation site (corresponding to the human IDUA-W402X mutation in a human), the distance between the targeting base in the arRNA and the 3′-terminal was truncated by every 5 bases from 55 nt, as shown in Table 7. As shown in
TABLE-US-00006 TABLE 6 55 nt-c-25 nt SEQ ID NO: 14 gacgcccaccgugugguugcuguccaggacggucc cggccugcgacacuucggcccagagcugcuccuca ucugcggggcg 55 nt-c-24 nt SEQ ID NO: 22 gacgcccaccgugugguugcuguccaggacggucc cggccugcgacacuucggcccagagcugcuccuca ucugcggggc 55 nt-c-23 nt SEQ ID NO: 23 gacgcccaccgugugguugcuguccaggacggucc cggccugcgacacuucggcccagagcugcuccuca ucugcgggg 55 nt-c-22 nt SEQ ID NO: 24 gacgcccaccgugugguugcuguccaggacggucc cggccugcgacacuucggcccagagcugcuccuca ucugcggg 55 nt-c-21 nt SEQ ID NO: 25 gacgcccaccgugugguugcuguccaggacggucc cggccugcgacacuucggcccagagcugcuccuca ucugcgg 55 nt-c-20 nt SEQ ID NO: 26 gacgcccaccgugugguugcuguccaggacggucc cggccugcgacacuucggcccagagcugcuccuca ucugcg 55 nt-c-19 nt SEQ ID NO: 27 gacgcccaccgugugguugcuguccaggacggucc cggccugcgacacuucggcccagagcugcuccuca ucugc 55 nt-c-18 nt SEQ ID NO: 28 gacgcccaccgugugguugcuguccaggacggucc cggccugcgacacuucggcccagagcugcuccuca ucug 55 nt-c- 17nt SEQ ID NO: 29 gacgcccaccgugugguugcuguccaggacggucc cggccugcgacacuucggcccagagcugcuccuca ucu 55 nt-c-16 nt SEQ ID NO: 30 gacgcccaccgugugguugcuguccaggacggucc cggccugcgacacuucggcccagagcugcuccuca uc 55 nt-c-15 nt SEQ ID NO: 15 gacgcccaccgugugguugcuguccaggacggucc cggccugcgacacuucggcccagagcugcuccuca u 55 nt-c-14 nt SEQ ID NO: 31 gacgcccaccgugugguugcuguccaggacggucc cggccugcgacacuucggcccagagcugcuccuca 55 nt-c-13 nt SEQ ID NO: 32 gacgcccaccgugugguugcuguccaggacggucc cggccugcgacacuucggcccagagcugcuccuc 55 nt-c-12 nt SEQ ID NO: 33 gacgcccaccgugugguugcuguccaggacggucc cggccugcgacacuucggcccagagcugcuccu 55 nt-c-11 nt SEQ ID NO: 34 gacgcccaccgugugguugcuguccaggacggucc cggccugcgacacuucggcccagagcugcucc 55 nt-c-10 nt SEQ ID NO: 35 gacgcccaccgugugguugcuguccaggacggucc cggccugcgacacuucggcccagagcugcuc
TABLE-US-00007 TABLE 7 Mouse 55 nt-c-55 nt (SEQ ID NO: 62) arRNA GacacccacUgUaUgaUUgcUgUccaaca cagccccagccUUUgagaccUcUgcccag agUUgUUcUccaUcUaUaagccaagcaga gggcUgaggcUgUUggcUcUcUca 55 nt-c-50 nt (SEQ ID NO: 44) GacacccacUgUaUgaUUgcUgUccaaca cagccccagccUUUgagaccUcUgcccag agUUgUUcUccaUcUaUaagccaagcaga gggcUgaggcUgUUggcUc 55 nt-c-45 nt (SEQ ID NO: 45) GacacccacUgUaUgaUUgcUgUccaaca cagccccagccUUUgagaccUcUgcccag agUUgUUcUccaUcUaUaagccaagcaga gggcUgaggcUgUU 55 nt-c-40 nt (SEQ ID NO: 46) GacacccacUgUaUgaUUgcUgUccaaca ccagccccagccUUUgagaccUUgcccag agUUgUUcUccaUcUaUaagccaagcaga gggcUgagg 55 nt-c-35 nt (SEQ ID NO: 47) gacacccacUgUaUgaUUgcUgUccaaca cagccccagccUUUgagaccUcUgcccag agUUgUUcUccaUcUaUaagccaagcaga gggc 55 nt-c-30 nt (SEQ ID NO: 48) gacacccacUgUaUgaUUgcUgUccaaca cagccccagccUUUgagaccUcUgcccag agUUgUUcUccaUcUaUaagccaagcag 55 nt-c-25 nt (SEQ ID NO: 49) gacacccacUgUaUgaUUgcUgUccaaca cagccccagccUUUgagaccUcUgcccag agUUgUUcUccaUcUaUaagcca 55 nt-c-20 nt (SEQ ID NO: 50) gacacccacUgUaUgaUUgcUgUccaaca cagccccagccUUUgagaccUcUgcccag agUUgUUcUccaUcUaUa 55 nt-c-15 nt (SEQ ID NO: 51) gacacccacUgUaUgaUUgcUgUccaacacagccccagccUUUgagaccUc UgcccagagUUgUUcUccaU 55 nt-c-10 nt (SEQ ID NO: 52) gacacccacUgUaUgaUUgcUgUccaacacagccccagccUUUgagaccUc UgcccagagUUgUUc
Example 8: Influence of the Length of the 5′-Terminal on the Editing Efficiency
[0159] 2 arRNAs of different lengths, i.e. 76 nt: 55nt-c-20nt and 71 nt: 55nt-c-15nt, were selected, the length of the 3′-terminal is kept unchanged, the length of the 5′-terminal was gradually truncated, as shown in Table 8. As shown in
TABLE-US-00008 TABLE 8 55 nt-c-20 nt SEQ ID NO: 26 gacgcccaccgugugguugcuguccaggacggu cccggccugcgacacuucggcccagagcugcuc cucaucugcg 50 nt-c-20 nt SEQ ID NO: 36 ccaccgugugguugcuguccaggacggucccgg ccugcgacacuucggcccagagcugcuccucau cugcg 45 nt-c-20 nt SEQ ID NO: 37 gugugguugcuguccaggacggucccggccugc gacacuucggcccagagcugcuccucaucugcg 40 nt-c-20 nt SEQ ID NO: 38 guugcuguccaggacggucccggccugcgacac uucggcccagagcugcuccucaucugcg 35 nt-c-20 nt SEQ ID NO: 39 uguccaggacggucccggccugcgacacuucgg cccagagcugcuccucaucugcg 55 nt-c-15 nt SEQ ID NO: 15 gacgcccaccgugugguugcuguccaggacggu cccggccugcgacacuucggcccagagcugcuc cucau 50 nt-c-15 nt SEQ ID NO: 40 ccaccgugugguugcuguccaggacggucccgg ccugcgacacuucggcccagagcugcuccucau 45 nt-c-15 nt SEQ ID NO: 41 gugugguugcuguccaggacggucccggccugc gacacuucggcccagagcugcuccucau 40 nt-c-15 nt SEQ ID NO: 42 guugcuguccaggacggucccggccugcgacac uucggcccagagcugcuccucau 35 nt-c-15 nt SEQ ID NO: 43 uguccaggacggucccggccugcgacacuucgg cccagagcugcuccucau 51 nt-c-14 nt SEQ ID NO: 56 cccaccgugugguugcuguccaggacggucccg gccugcgacacuucggcccagagcugcuccuca 50 nt-c-14 nt SEQ ID NO: 57 ccaccgugugguugcuguccaggacggucccgg ccugcgacacuucggcccagagcugcuccuca 49 nt-c-14 nt SEQ ID NO: 58 caccgugugguugcuguccaggacggucccggc cugcgacacuucggcccagagcugcuccuca 48 nt-c-14 nt SEQ ID NO: 59 accgugugguugcuguccaggacggucccggcc ugcgacacuucggcccagagcugcuccuca 47 nt-c-14 nt SEQ ID NO: 60 ccgugugguugcuguccaggacggucccggccu gcgacacuucggcccagagcugcuccuca 46 nt-c-14 nt SEQ ID NO: 61 cgugugguugcuguccaggacggucccggccug cgacacuucggcccagagcugcuccuca
Example 9: Influence of Chemical Modifications to RNA on the Editing Efficiency
[0160] Different chemical modifications to RNA during RNA synthesis can increase the stability of the RNA and reduce the possibility of off-target. Common chemical modifications to RNA include 2′-OMe and thiophosphate modification. In the present example, 2 arRNAs of different lengths, i.e. 71 nt and 76 nt, were selected, and different combinations of the two chemical modification methods were performed, as shown in Table 8. Specific modification methods were as follows.
[0161] CM1: the first three nucleotides and the last three nucleotides in the sequence were respectively subjected to 2′-OMe modification, the first three nucleotides and the last three nucleotides were respectively linked via thiophosphate bonds, and meanwhile, all bases U in the sequences were subjected to 2′-OMe modification.
[0162] CM2: the first three nucleotides and the last three nucleotides in the sequence were respectively subjected to 2′-OMe modification, the first three nucleotides and the last three nucleotides were respectively linked via thiophosphate bonds, and meanwhile, the 3′ nearest base of a targeting base was A subjected to 2′-OMe modification.
[0163] CM3: the first three nucleotides and the last three nucleotides in the sequence were respectively subjected to 2′-OMe modification, the first three nucleotides and the last three nucleotides were respectively linked via thiophosphate bonds, and meanwhile, the 5′ nearest base of the targeting base was C subjected to 2′-OMe modification.
[0164] CM4: the first three nucleotides and the last three nucleotides in the sequence were respectively subjected to 2′-OMe modification, the first three nucleotides and the last three nucleotides were respectively linked via thiophosphate bonds, and meanwhile, the targeting base was linked to the 3′ nearest base and the 5′ nearest base of the targeting base respectively via thiophosphate bonds.
[0165] CM5: except the targeting base, five bases, adjacent to the targeting base, at the 5′-terminal, and five bases, adjacent to the targeting base, at the 3′-terminal, all other nucleotides were subjected to 2′-OMe modification, and meanwhile, the first three nucleotides and the last three nucleotides were respectively linked via thiophosphate bonds.
[0166] CM6, the first five nucleotides and the last five nucleotides in the sequence are respectively subjected to 2′-OMe modification, and the first five nucleotides and the last five nucleotides are respectively linked via thiophosphate bonds.
[0167] IDUA in cells was edited by transfecting GM06214 cells with different arRNAs, the cells were collected 48 h after transfection and subjected to IDUA enzymatic activity assay. As shown in
TABLE-US-00009 TABLE 9 Modification Name Length method Sequence HIV2-76-CM1 55 nt-c-20 nt CM1 Gm*Am*Cm*Gro-Cro-Cro-Cro-Aro-Cro-Cro-Gro- Umo-Gro-Umo-Gro-Gro-Umo-Umo-Gro-Cro-Umo- Gro-Umo-Cro-Cro-Aro-Gro-Gro-Aro-Cro-Gro- Gro-Umo-Cro-Cro-Cro-Gro-Gro-Cro-Cro-Umo- Gro-Cro-Gro-Aro-Cro-Aro-Cro-Umo-Umo-Cro- Gro-Gro-Cro-Cro-Cro-Aro-Gro-Aro-Gro-Cro- Umo-Gro-Cro-Umo-Cro-Cro-Umo-Cro-Aro-Umo- UCro-mo*Gm*Cm*Gm (SEQ ID NO: 26) HIV2-76-CM2 55 nt-c-20 nt CM2 Gm*Am*Cm*Gro-Cro-Cro-Cro-Aro-Cro-Cro-Gro -Umo-Gro-Umo-Gro-Gro-Umo-Umo-Gro-Cro-Um o-Gro-Umo-Cro-Cro-Aro-Gro-Gro-Aro-Cro-Gro- Gro-Umo-Cro-Cro-Cro-Gro-Gro-Cro-Cro-Umo-Gr o-Cro-Gro-Aro-Cro-Aro-Cro-Umo-Umo-Cro-Gro- Gro-Cro-Cro-Cro-Amo-Gro-Aro-Gro-Cro-Umo-Gr o-Cro-Umo-Cro-Cro-Umo-Cro-Aro-Umo-Cro- Um*Gm*Cm*Gm (SEQ ID NO: 26) HIV2-76-CM3 55 nt-c-20 nt CM3 Gm*Am*Cm*Gro-Cro-Cro-Cro-Aro-Cro-Cro-Gro- Umo-Gro-Umo-Gro-Gro-Umo-Umo-Gro-Cro-Umo- Gro-Umo-Cro-Cro-Aro-Gro-Gro-Aro-Cro-Gro- Gro-Umo-Cro-Cro-Cro-Gro-Gro-Cro-Cro-Umo- Gro-Cro-Gro-Aro-Cro-Aro-Cro-Umo-Umo-Cro- Gro-Gro-Cro-Cmo-Cro-Aro-Gro-Aro-Gro-Cro- Umo-Gro-Cro-Umo-Cro-Cro-Umo-Cro-Aro-Umo- Cro-Um*Gm*Cm*Gm (SEQ ID NO: 26) HIV2-76-CM4 55 nt-c-20 nt CM4 Gm*Am*Cm*Gro-Cro-Cro-Cro-Aro-Cro-Cro-Gro- Umo-Gro-Umo-Gro-Gro-Umo-Umo-Gro-Cro-Umo- Gro-Umo-Cro-Cro-Aro-Gro-Gro-Aro-Cro-Gro- Gro-Umo-Cro-Cro-Cro-Gro-Gro-Cro-Cro-Umo- Gro-Cro-Gro-Aro-Cro-Aro-Cro-Umo-Umo-Cro- Gro-Gro-Cro-Cr*Cr*Ar-Gro-Aro-Gro-Cro-Umo- Gro-Cro-Umo-Cro-Cro-Umo-Cro-Aro-Umo-Cro- Um*Gm*Cm*Gm (SEQ ID NO 26) HIV2-76-CM5 55 nt-c-20 nt CM5 Gm*Am*Cm*Gmo-Cmo-Cmo-Cmo-Amo-Cmo-Cmo- Gmo-Umo-Gmo-Umo-Gmo-Gmo-Umo-Umo- Gmo-Cmo-Umo-Gmo-Umo-Cmo-Cmo-Amo-Gmo- Gmo-Amo-Cmo-Gmo-Gmo-Umo-Cmo-Cmo-Cmo- Gmo-Gmo-Cmo-Cmo-Umo-Gmo-Cmo-Gmo-Amo- Cmo-Amo-Cmo-Umo-Umo-Cro-Gro-Gro-Cro- Cro-Cro-Aro-Gro-Aro-Gro-Cro-Umo-Gmo- Cmo-Umo-Cmo-Cmo-Umo-Cmo-Amo-Umo-Cmo- Um*Gm*Cm*Gm (SEQ ID NO 26) HIV2-76-CM6 55 nt-c-20 nt CM6 Gm*Am*Cm*Gm*Cm*Cro-Cro-Aro-Cro-Cro-Gro- Uro-Gro-Uro-Gro-Gro-Uro-Uro-Gro-Cro-Uro- Gro-Uro-Cro-Cro-Aro-Gro-Gro-Aro-Cro-Gro- Gro-Uro-Cro-Cro-Cro-Gro-Gro-Cro-Cro-Uro- Gro-Cro-Gro-Aro-Cro-Aro-Cro-Uro-Uro-Cro- Gro-Gro-Cro-Cro-Cro-Aro-Gro-Aro-Gro-Cro- Uro-Gro-Cro-Uro-Cro-Cro-Uro-Cro-Aro- Ur*Cm*Um*Gm*Cm*Gm (SEQ ID NO 26) HIV2-71-CM1 55 nt-c-15 nt CM1 Gm*Am*Cm*Gro-Cro-Cro-Cro-Aro-Cro-Cro-Gro- Umo-Gro-Umo-Gro-Gro-Umo-Umo-Gro-Cro-Umo- Gro-Umo-Cro-Cro-Aro-Gro-Gro-Aro-Cro-Gro- Gro-Umo-Cro-Cro-Cro-Gro-Gro-Cro-Cro-Umo- Gro-Cro-Gro-Aro-Cro-Aro-Cro-Umo-Umo-Cro- Gro-o-Cro-Umo-Cro-Cro-Um*Cm*Am*Um (SEQ ID NO: 15) HIV2-71-CM2 55 nt-c-15 nt CM2 Gm*Am*Cm*Gro-Cro-Cro-Cro-Aro-Cr0-Cro-Gro- Umo-Gro-Umo-Gro-Gro-Umo-Umo-Gro-Cro-Umo- Gro-Umo-Cro-Cro-Aro-Gro-Gro-Aro-Cro-Gro- Gro-Urno-Cro-Cro-Cro-Gro-Gro-Cro-Cro-Umo- Gro-Cro-Gro-Aro-Cro-Aro-Cro-Umo-Umo-Cro- Gro-Gro-Cro-Cro-Cro-Amo-Gro-Aro-Gro-Cro- Umo-Gro-Cro-Umo-Cro-Cro-Um*Cm*Am*Um HIV2-71-CM3 55 nt-c-15 nt CM3 Gm*Am*Cm*Gro-Cro-Cro-Cro-Aro-Cro-Cro-Gro- Umo-Gro-Umo-Gro-Gro-Umo-Umo-Gro-Cro-Umo- Gro-Umo-Cro-Cro-Aro-Gro-Gro-Aro-Cro-Gro- Gro-Urno-Cro-Cro-Cro-Gro-Gro-Cro-Cro-Umo- Gro-Cro-Gro-Aro-Cro-Aro-Cro-Umo-Umo-Cro- Gro-Gro-Cro-Cmo-Cro-Aro-Gro-Aro-Gro-Cro- Umo-Gro-Cro-Umo-Cro-Cro-Um*Cm*Am*Um (SEQ ID NO: 15) HIV2-71-CM4 55 nt-c-15 nt CM4 Gm*Am*Cm*Gro-Cro-Cro-Cro-Aro-Cro-Cro-Gro- Umo-Gro-Umo-Gro-Gro-Umo-Umo-Gro-Cro-Umo- Gro-Umo-Cro-Cro-Aro-Gro-Gro-Aro-Cro-Gro- Gro-Urno-Cro-Cro-Cro-Gro-Gro-Cro-Cro-Umo- Gro-Cro-Gro-Aro-Cro-Aro-Cro-Umo-Umo-Cro- Gro-Gro-Cro-Cr*Cr*Ar-Gro-Aro-Gro-Cro-Umo- Gro-Cro-Umo-Cro-Cro-Um*Cm*Am*Um (SEQ ID NO: 15) HIV2-71-CM5 55 nt-c-15 nt CM5 Gm*Am*Cm*Gmo-Cmo-Cmo-Cmo-Amo-Cmo-Cmo- Gmo-Umo-Gmo-Umo-Gmo-Gmo-Umo-Umo-Gmo- Cmo-Umo-Gmo-Umo-Cmo-Cmo-Amo-Gmo-Gmo- Amo-Cmo-Gmo-Gmo-Umo-Cmo-Cmo-Cmo-Gmo- Gmo-Cmo-Cmo-Umo-Gmo-Cmo-Gmo-Amo-Cmo- Amo-Cmo-Umo-Umo-Cro-Gro-Gro-Cro-Cro- Cro-Aro-Gro-Aro-Gro-Cro-Umo-Gmo-Cmo- Umo-Cmo-Cmo-Um*Cm*Am*Um (SEQ ID NO: 15) HIV2-71-CM6 55 nt-c-15 nt CM6 Gm*Am*Cm*Gm*Cm*Cro-Cro-Aro-Cro-Cro-Gro- Uro-Gro-Uro-Gro-Gro-Uro-Uro-Gro-Cro-Uro- Gro-Uro-Cro-Cro-Aro-Gro-Gro-Aro-Cro-Gro- Gro-Uro-Cro-Cro-Cro-Gro-Gro-Cro-Cro-Uro- Gro-Cro-Gro-Aro-Cro-Aro-Cro-Uro-Uro-Cro- Gro-Gro-Cro-Cro-Cro-Aro-Gro-Aro-Gro-Cro- Uro-Gro-Cro-Uro-Cro*Cm*Um*Cm*Am*Um (SEQ ID NO: 15) Note: ro refers to that nucleotides are not modified and ester bonds between the nucleotides are not modified; r* refers to that nucleotides are not modified and the nucleotides are linked via thiophosphate bonds; mo refers to that nucleotides are subjected to 2′-OMe modification and ester bonds between the nucleotides are not modified; and m* refers to that nucleotides are subjected to 2′-OMe modification and the nucleotides are linked via thiophosphate bonds.
Example 10: Influence of Chemical Modifications on the Editing Efficiency
[0168] In the present example, three preferred arRNAs targeting a human IDUA mutation site and one preferred arRNA targeting a mouse IDUA mutation site were used and chemically modified by the method CM1. Concentration gradient experiments were performed on GM06214 cells and MSPI MEF (MSPI mouse embryo fibroblasts (MEF)) cells separated from a fetal mouse (IDUA W392X mouse, B6.129S-Iduatm.sup.1.1Kmke/J) with a homozygous IDUA mutation (Wang D, Shukla C, Liu X, et al. Characterization of an MPS I-H knock-in mouse that carries a nonsense mutation analogous to the human IDUA-W402X mutation [published correction appears in Mol Genet Metab. 2010 Apr;99(4):439]. Mol Genet Metab. 2010; 99(1):62-71. doi:10.1016/j.ymgme.2009.08.002).
[0169] Based on the experimental results of Example 7, three arRNAs targeting the human IDUA, i.e. the sequences 55nt-c-16nt, 55nt-c-14nt, and 55nt-c-11nt, and one arRNA targeting mouse IDUA, i.e. the sequence 55nt-c-10nt, were selected. In addition, a random arRNA sequence RM-67CM1 was selected as a control. Based on the experimental results of Example 9, the above arRNAs targeting IDUA were synthesized by the chemical modification method CM1 (all bases U were subjected to 2′-OMe modification), as shown in Table 9. Comparative experiments on transfection of the human GM06214 cells and the MSPI mouse MEF with arRNAs at different concentrations were performed. Nine concentrations of arRNA were set: 160 nM, 80 nM, 40 nM, 20 nM, 10 nM, 5 nM, 2.5 nM, 1.25 nM, and 0.625 nM. The cells were coated onto a 6-well plate, transfected 24 h after coating, and digested 48 h after transfection, half of the cells were taken for IDUA enzymatic activity assay, and half of the cells were taken for RNA extraction and editing efficiency detection. As shown in
TABLE-US-00010 TABLE 10 Name and Modification length method Sequence 55 nt-c-ll nt CM1 ′-Gm*Am*Cm*Gro-Cro-Cro-Cro-Aro-Cro-Cro-Gro-Umo- CM1 Gro-Umo-Gro-Gro-Umo-Umo-Gro-Cro-Umo-Gro-Umo-Cro- Cro-Aro-Gro-Gro-Aro-Cro-Gro-Gro-Umo-Cro-Cro-Cro- Gro-Gro-Cro-Cro-Umo-Gro-Cro-Gro-Aro-Cro-Aro-Cro- Umo-Umo-Cro-Gro-Gro-Cro-Cro-Cro-Aro-Gro-Aro-Gro- Cro-Umo-Gro-Cr*Um*Cm*Cm (SEQ ID NO: 34) 55 nt-c-16 nt CM1 ′-Gm*Am*Cm*Gro-Cro-Cro-Cro-Aro-Cro-Cro-Gro-Umo- CM1 Gro-Umo-Gro-Gro-Umo-Umo-Gro-Cro-Umo-Gro-Umo-Cro- Cro-Aro-Gro-Gro-Aro-Cro-Gro-Gro-Umo-Cro-Cro-Cro- Gro-Gro-Cro-Cro-Umo-Gro-Cro-Gro-Aro-Cro-Aro-Cro- Umo-Umo-Cro-Gro-Gro-Cro-Cro-Cro-Aro-Gro-Aro-Gro- Cro-Umo-Gro-Cro-Umo-Cro-Cr*Um*Cm*Am (SEQ ID NO: 30) 55 nt-c-14 nt CM1 ′-Gm*Am*Cm*Gro-Cro-Cro-Cro-Aro-Cro-Cro-Gro-Umo- CM1 Gro-Umo-Gro-Gro-Umo-Umo-Gro-Cro-Umo-Gro-Umo-Cro- Cro-Aro-Gro-Gro-Aro-Cro-Gro-Gro-Umo-Cro-Cro-Cro- Gro-Gro-Cro-Cro-Umo-Gro-Cro-Gro-Aro-Cro-Aro-Cro- Umo-Umo-Cro-Gro-Gro-Cro-Cro-Cro-Aro-Gro-Aro-Gro- Cro-Umo-Gro-Cro-Umo-Cro-Cro-Umo-Cr*Am*Um*Cm (SEQ ID NO: 31) Random RNA- CM1 ′-Um*Am*Cm*Cro-Gro-Cro-Umo-Aro-Cro-Aro-Gro-Cro- 67 nt CM1 Cro-Aro-Cro-Gro-Cro-Umo-Gro-Aro-Umo-Umo-Umo-Cro- Aro-Gro-Cro-Umo-Aro-Umo-Aro-Cro-Cro-Umo-Gro-Cro- Cro-Cro-Gro-Gro-Umo-Aro-Umo-Aro-Aro-Aro-Gro-Gro- Gro-Aro-Cro-Gro-Umo-Umo-Cro-Aro-Cro-Aro-Cro-Cro- Gro-Cro-Gr*Am*Um*Gm (SEQ ID NO: 53) Mouse CM1 Gm*Am*Cm*Aro-Cro-Cro-Cro-Aro-Cro-Umo-Gro-Umo- 55 nt-c-10 nt Aro-Umo-Gro-Aro-Umo-Umo-Gro-Cro-Umo-Gro-Umo-Cro- CM1 Cro-Aro-Aro-Cro-Aro-Cro-Aro-Gro-Cro-Cro-Cro-Cro- Aro-Gro-Cro-Cro-Umo-Umo-Umo-Gro-Aro-Gro-Aro-Cro- Cro-Umo-Cro-Umo-Gro-Cro-Cro-Cro-Aro-Gro-Aro-Gro- Umo-Umo-Gr*Um*Um*Cmo (SEQ ID NO: 52) Note: ro refers to that nucleotides are not modified and ester bonds between the nucleotides are not modified; r* refers to that nucleotides are not modified and the nucleotides are linked via thiophosphate bonds; mo refers to that nucleotides are subjected to 2′-OMe modification and ester bonds between the nucleotides are not modified; and m* refers to that nucleotides are subjected to 2′-OMe modification and the nucleotides are linked via thiophosphate bonds.
Example 11: Sustained Proteinase Activity of IDUA after being Edited by arRNA
[0170] In the present example, three preferred arRNAs targeting a human IDUA mutation site and one preferred arRNA targeting a mouse IDUA mutation site were used and chemically modified by the method CM1. The IDUA enzymatic activity in GM06214 cells and MSPI MEF (mouse embryo fibroblast) cells was significantly improved, which lasted for about 3 weeks.
[0171] In Example 10, 48 h after transfection of the human and mouse cells with the preferred arRNAs targeting IDUA at different concentrations, the IDUA enzymatic activities were compared.
[0172] In the present example, arRNA at the concentration of 20 nM was used, the IDUA enzymatic activity and the editing efficiency at different time points were compared. After transfection of GM06214 cells with arRNA, the IDUA enzymatic activity was assayed for continuous 14 d. As shown in
[0173] By comparison of the data of the human and mouse cells, it is found that the peak of the editing efficiency of the arRNA in the mouse cells appears 24 h after transfection, while the peak of the editing efficiency of the arRNA in the human cells appears 48 h after transfection. After editing, the IDUA enzyme activity in the human cells lasts for more than 21 days, while the IDUA enzyme activity in the mouse cells lasts for more than 8 days.
Example 12: Influence of arRNA Delivery Methods on the Editing Efficiency
[0174] In the present example, wild-type sites on the PPIB gene in human and mouse liver cells obtained by primary cell culture were edited by LEAPER, and arRNAs were delivered by different methods to screen out the optimal delivery method.
[0175] “PPIB” refers to a wild-type site in the human NM_000942 (PPIB Genomic chr15 (−): 64163082) UTR region or a wild-type site in the mouse NM_011149 (PPIB Genomic chr9 (+): 66066490) UTR region. PPIB may be mature mRNA or precursor mRNA. The UTR region of PPIB comprises one TAG. In the present example, A in the TAG serving as a target was edited to detect the editing efficiency of arRNA of the present application in liver cells.
[0176] arRNA (55nt-c-15nt) targeting the PPIB UTR region was designed and synthesized, as shown in Table 11. A part of the synthesized arRNA was dissolved, subpackaged, stored at −80° C., and used for transfection of cells by Lipo (lipofectmine RNAiMAX), and the rest part of the arRNA was prepared into LNP. LNP were prepared with reference to Witzigmann D, Kulkarni J A, Leung J, et al. Lipid nanoparticle technology for therapeutic gene regulation in the liver[J]. Advanced Drug Delivery Reviews, 2020.; Kauffman K J, Dorkin J R, Yang J H, et al. Optimization of lipid nanoparticle formulations for mRNA delivery in vivo with fractional factorial and definitive screening designs[J]. Nano letters, 2015, 15(11): 7300-7306.; and Reis J, Kanagaraj S, Fonseca A, et al. In vitro studies of multiwalled carbon nanotube/ultrahigh molecular weight polyethylene nanocomposites with osteoblast-like MG63 cells[J]. Brazilian Journal of Medical and Biological Research, 2010, 43(5): 476-482.
[0177] Primary human liver cells were purchased from LONZA (catalog No. HUCPI), and recovered and cultured according to an operation manual (recovery medium catalog No.: MCHT50, planking medium catalog No.: MP100). After the cells adhered to walls, the medium was replaced with a hepatocyte maintenance medium 5C (referring to Xiang C, Du Y, Meng G, et al. Long-term functional maintenance of primary human hepatocytes in vitro[J]. Science, 2019, 364(6438): 399-402).
[0178] Primary mouse liver cells were separated from a C57BJ mouse (referring to Charni-Natan M, Goldstein I. Protocol for Primary Mouse Hepatocyte Isolation[J]. STAR protocols, 2020, 1(2): 100086.), and after the separated liver cells adhered to walls, the medium was replaced with a maintenance medium 5C.
[0179] 24 h after recovery of the human liver cells, the arRNA was delivered, and concentrations of arRNA delivered by LNP and Lipo were both 20 nM. 24 h after separation culture of the mouse liver cells, the arRNA was delivered, and concentrations of arRNA delivered by LNP and Lipo were both 20 nM. RNA of the human liver cells was extracted 24 h after delivery of arRNA, RNA of the mouse liver cells was extracted 48 h after delivery of arRNA, and the editing efficiency was detected by the next generation sequencing. As shown in
[0180] Therefore, by comparison of the editing efficiency at the PPIB site in the primary liver cells, it is determined that the peak of editing in the mouse liver cells appears 24 h after delivery of arRNA, and the peak of editing in the human liver cells appears 48 h after delivery of arRNA, and these data coincide with the data of the human GM06214 cells and the mouse MSPI MEF cells. For delivery of arRNA to human liver cells, LNP is better than or equal to Lipo. For delivery of arRNA to mouse liver cells, Lipo is much better than LNP.
TABLE-US-00011 TABLE 11 Mouse CACCCCAUCAGAUGGAAGCACUAGGGCC 55 nt-c-15 nt AGGGUGGCACAGAACCUUGUGACUGGCC ACCUUCGUCUGUGUG (SEQ ID NO: 54) Human GGAGGCGAAAGCAGCCCGGACAGCUGAG 55 nt-c-15 nt GCCGGAAGAGGGUGGGGCCGCGGUGGCC AGGGAGCCGGCGCCG (SEQ ID NO: 55)
Example 13: The Editing Efficiency Caused by the LNP Delivery
[0181] The present example involves the study on the editing efficiency of arRNA in IDUA after delivery of arRNA to human and mouse liver cells obtained by primary cell culture via LNP.
[0182] In the present example, arRNA targeting a wild-type site in the human IDUA CDS region was designed and synthesized, as shown in Table 12. The arRNAs (20 nM) targeting human and mouse IDUA were respectively delivered to primary human liver cells and primary mouse liver cells by LNP, and 24 h and 48 h after delivery, the editing efficiency was detected, As shown in
TABLE-US-00012 TABLE 12 Human IDUA GGAGGCGAAAGCAGCCCGGACAGCUG 55 nt-c-15 nt AGGCCGGAAGAGGGUGGGGCCGCGGU GGCCAGGGAGCCGGCGCCG (SEQ ID NO: 55)
Example 14: Treatment Effect of arRNA on IDUA in the MSPI Mouse Model
[0183] In the present example, an MPSI mouse model (IDUA W392X mouse, B6.129S-Iduatm.sup.1.1Kmke/J) (Wang D, Shukla C, Liu X, et al. Characterization of an MPS I-H knock-in mouse that carries a nonsense mutation analogous to the human IDUA-W402X mutation [published correction appears in Mol Genet Metab. 2010 April;99(4): 439]. Mol Genet Metab. 2010; 99(1): 62-71. Doi:10.1016/j.ymgme.2009.08.002) was used. The mutation corresponds to the human IDUA-W402X mutation, which can terminate the synthesis of proteins prematurely and is prevalent in patients with Hurler mucopolysaccharidosis (MPS I-H). Such a lysosomal storage disorder was caused by a deficiency of α-L-iduronidase. The α-L-iduronidase activity was not detected in brain and liver tissues of 5-, 10-, and 30-week-old homozygous mice. Although these homozygous mice with the mutations were viable and fertile, their average lifespan was 69 weeks. The homozygotes exhibited a progressive increase in urinary excretion of glycosaminoglycans (GAGs) and progressive accumulation of GAGs in tissues. The steady IDUA mRNA level was reduced by 30-50%. Histological analysis showed progressive accumulation of lysosomal storage inclusions in the cytoplasm of Purkinje cells and medullary neurons, and increased infiltration of foamy macrophages with age. Radiographs showed visible thickening of the zygomatic arch and femur in the 15-week-old mouse, and significant thickening in the 35-week-old mouse. At 35 weeks of age, the mineral density of the femoral bone increased, and the percentage of body fat decreased. A selected arRNA targeting a mouse IDUA mutation, i.e. 55nt-c-10nt CM1 (SEQ ID NO: 52), was prepared into LNP. arRNAs at different concentrations were injected via the tail vein, and the different concentrations were 0.1 mg/kg, 0.5 mg/kg, 2 mg/kg, and 10 mg/kg, respectively. 24 h after administration, mouse liver cells were taken for detection of the editing efficiency in IDUA. As shown in