GENE EDITING SYSTEM FOR CORRECTING SPLICING DEFECTS
20210388332 · 2021-12-16
Inventors
Cpc classification
C12N2310/20
CHEMISTRY; METALLURGY
C12N9/22
CHEMISTRY; METALLURGY
A61K31/7088
HUMAN NECESSITIES
C12N2750/14143
CHEMISTRY; METALLURGY
A61K38/465
HUMAN NECESSITIES
C12N2800/80
CHEMISTRY; METALLURGY
C12N15/11
CHEMISTRY; METALLURGY
C12N2750/14043
CHEMISTRY; METALLURGY
C12N15/63
CHEMISTRY; METALLURGY
C12N15/1138
CHEMISTRY; METALLURGY
A61P21/00
HUMAN NECESSITIES
International classification
C12N9/22
CHEMISTRY; METALLURGY
A61K31/7088
HUMAN NECESSITIES
A61P21/00
HUMAN NECESSITIES
C12N15/11
CHEMISTRY; METALLURGY
Abstract
The present disclosure provides genome editing systems, compositions and methods. The genome editing system comprises at least one nuclease that generates blunt-ended DNA breaks in a sequence-specific manner, wherein the genome editing system is configured to form a first and a second blunt-ended double strand break in an intron of a gene, wherein the intron comprises a splice donor site mutation that alters splice site recognition, thereby excising a segment of the intron and simultaneously joining DNA ends flanking the excised segment of the intron to constitute a functional donor splice site.
Claims
1. A genome editing system, comprising at least one nuclease that generates blunt-ended DNA breaks in a sequence-specific manner, wherein the genome editing system is configured to form a first and a second blunt-ended double strand break in an intron of a gene, wherein the intron comprises a splice donor site mutation that alters splice site recognition, thereby excising a segment of the intron and simultaneously joining DNA ends flanking the excised segment of the intron to constitute a functional donor splice site.
2. The system of claim 1, wherein the excising of the segment of the intron and simultaneously joining the DNA ends flanking the excised segment of the intron to constitute a functional donor splice site results in restored splice site recognition in the intron.
3. The system of claim 1, wherein the joining of the DNA ends flanking the excised segment of the intron to constitute a functional donor splice site occurs through a homology directed repair-independent mechanism.
4. The system of claim 1, wherein the joining of the DNA ends flanking the excised segment of the intron to constitute a functional donor splice site occurs through non-homologous end joining.
5. The system of claim 1, wherein the excising occurs at a mutation-eliminating cut site of the intron and at a splice site-generating cut site of the intron of the gene.
6. The system of claim 1, wherein the at least one nuclease that generates blunt-ended DNA breaks is: a Cas9 polypeptide that recognizes a Protospacer Adjacent Motif (PAM) and wherein the system further comprises a mutation-eliminating single guide RNA (sgRNA) and a splice site-generating sgRNA.
7. (canceled)
8. (canceled)
9. (canceled)
10. (canceled)
11. (canceled)
12. (canceled)
13. (canceled)
14. (canceled)
15. (canceled)
16. (canceled)
17. The system of claim 1, wherein the mutation occurs at the +1 position of the splice donor site.
18. (canceled)
19. (canceled)
20. (canceled)
21. (canceled)
22. (canceled)
23. The system of claim 1, wherein the system further comprises a delivery vehicle for delivery of the at least one nuclease that generates blunt-ended DNA breaks in a sequence-specific manner.
24. The system of claim 1, wherein polynucleotides coding for the at least one nuclease that generates blunt-ended DNA breaks in a sequence-specific manner is in a vector.
25. The system of claim 24, wherein the nuclease is Cas9 and the vector further comprises a mutation-eliminating single sgRNA or a splice site-generating sgRNA.
26. The system of claim 23, wherein the delivery vehicle is an adeno-associated virus (AAV) vector.
27. (canceled)
28. The system of claim 26, wherein the delivery vehicle is an AAV9 vector.
29. (canceled)
30. (canceled)
31. The system of claim 1, wherein the mutation is a pathogenic mutation.
32. The system of claim 1, wherein the gene is a gene listed in Table 1.
33. The system of claim 1, wherein the gene is a Lama2 gene.
34. The system of claim 33, wherein the mutation is a c.417+1 g.fwdarw.a mutation.
35. The system of claim 34, wherein the mutation is in a subject having merosin deficient congenital muscular dystrophy (MDC1A).
36. The system of claim 1, for use in the treatment of merosin deficient congenital muscular dystrophy (MDC1A) in a subject.
37. A composition comprising at least one nuclease that generates blunt-ended DNA breaks in a sequence-specific manner, wherein the composition is configured to form a first and a second blunt-ended double strand break in an intron of a gene, wherein the intron comprises a splice donor site mutation that alters splice site recognition, thereby excising a segment of the intron and simultaneously joining DNA ends flanking the excised segment of the intron to constitute a functional donor splice site.
38. (canceled)
39. (canceled)
40. (canceled)
41. (canceled)
42. (canceled)
43. (canceled)
44. (canceled)
45. (canceled)
46. (canceled)
47. (canceled)
48. (canceled)
49. (canceled)
50. (canceled)
51. (canceled)
52. (canceled)
53. (canceled)
54. (canceled)
55. (canceled)
56. (canceled)
57. (canceled)
58. (canceled)
59. (canceled)
60. (canceled)
61. (canceled)
62. (canceled)
63. (canceled)
64. (canceled)
65. (canceled)
66. (canceled)
67. (canceled)
68. (canceled)
69. (canceled)
70. (canceled)
71. (canceled)
72. (canceled)
73. A method of constituting a functional donor splice site in a gene, the gene comprising a mutation in a donor splice site that alters splice site recognition, the method comprising: delivering a genome editing system to the gene, the genome editing system comprising at least one nuclease that generates blunt-ended DNA breaks in a sequence-specific manner, wherein the genome editing system is configured to form a first and a second blunt-ended double strand break in an intron of a gene, wherein the intron comprises a splice donor site mutation that alters splice site recognition, thereby excising a segment of the intron and simultaneously joining DNA ends flanking the excised segment of the intron to constitute a functional donor splice site.
74. (canceled)
75. (canceled)
76. (canceled)
77. (canceled)
78. (canceled)
79. (canceled)
80. (canceled)
81. (canceled)
82. (canceled)
83. (canceled)
84. (canceled)
85. (canceled)
86. (canceled)
87. (canceled)
88. (canceled)
89. (canceled)
90. (canceled)
91. (canceled)
92. (canceled)
93. (canceled)
94. (canceled)
95. (canceled)
96. (canceled)
97. (canceled)
98. (canceled)
99. (canceled)
100. (canceled)
101. (canceled)
102. (canceled)
103. (canceled)
104. (canceled)
105. (canceled)
106. (canceled)
107. (canceled)
108. (canceled)
109. (canceled)
110. (canceled)
111. (canceled)
Description
DESCRIPTION OF THE DRAWINGS
[0060] The patent or application file contains at least one drawing in color. Copies of this patent or patent application publication with color drawings will be provided by the Office upon request and payment of the necessary fee.
[0061] These and other features of the disclosure will become more apparent in the following detailed description in which reference is made to the appended drawings wherein:
[0062]
[0063]
[0064]
[0065]
[0066]
[0067] Graphical representation of % fibrosis (
[0068]
[0069]
[0070]
[0071]
[0072]
[0073]
DETAILED DESCRIPTION OF THE DISCLOSURE
[0074] Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs.
Definitions
[0075] As used herein a “genome editing system” means a system that is capable of editing (e.g., modifying or altering) a target gene or DNA sequence. In an embodiment, the target gene or DNA sequence is a gene or DNA sequence that contains a splice site donor mutation (i.e., a mutation in the 5′ splice site (5′ss)). In an embodiment, the splice site donor mutation results in the gene or DNA sequence having a splicing defect. In an embodiment, the genome editing system comprises at least one nuclease that generates blunt-ended DNA breaks in a sequence-specific manner. The genome-editing system may be implemented in a cell or in an in vitro system. In certain embodiments, the genome-editing system is used in a medicament, e.g., a medicament for correcting a splicing defect, or a medicament for treating a disease caused by a splicing defect.
[0076] As used herein, a “mutation-eliminating cut” is defined as a cut in DNA generated in a sequence-specific manner by a nuclease. The cut separates a mutation in a donor splice site of a gene from the upstream exons and introns of that gene. In one embodiment, the cut is a blunt ended double strand break. A “mutation-eliminating cut site” is defined as a location in DNA at which a cut is generated in a sequence-specific manner by a nuclease to separate a mutation in a donor splice site of a gene from the upstream exons and introns of that gene. For example, a mutation-eliminating cut may occur at a Cas9 cleavage site occurring in the presence of one of the following Protospacer Adjacent Motif (PAM) sequences: NGG, NAG, NGA, NGCG, NNGRRT, NNNRRT, NNAGAA, NNNNRYAC or YG, when in the presence of a mutation-eliminating sgRNA. As used herein, a “mutation-eliminating sgRNA” is an sgRNA that in concert with a Cas9, creates a blunt ended double strand break at a mutation-eliminating cut site. The mutation-eliminating cut site may be, for example, upstream (i.e., when the DNA is in the 5′ to 3′ direction) of the PAM, for example, at the −3 position.
[0077] In other embodiments, the mutation-eliminating cut site may be located as determined by binding by a DNA recognizing protein domain to the DNA and cleavage by a nuclease to which the DNA recognizing protein domain is fused. In another embodiment, the DNA recognizing protein domain and the nuclease are joined by a linker which determines the position of a mutation-eliminating cut site. For example, with a longer linker, the mutation-eliminating cut site is positioned further away from the DNA binding domain.
[0078] As used herein, a “splice site-generating cut” is defined as a cut in an intron of a gene that is generated in a sequence-specific manner by a nuclease. The cut, in concert with a mutation-eliminating cut, results in excision of a segment of an intron, the segment containing a mutation in the donor splice site of the gene. In one embodiment, the cut is a blunt ended double strand break. Upon re-ligation with the DNA ends flanking the excised segment of the intron, a functional donor splice site is constituted. This splice site-generating cut and subsequent ligation results in proper splicing of a gene such that all of the exons are expressed in RNA and translated protein. A “splice site-generating cut site” is defined as a location in the intron of a gene at which a cut is generated in a sequence-specific manner by a nuclease and, when used in concert with a mutation-eliminating cut, results in excision of a segment of an intron, the segment containing a mutation in the donor splice site of the gene. For example, a splice site-generating cut may occur at a Cas9 cleavage site occurring in the presence of one of the following Protospacer Adjacent Motif (PAM) sequences: NGG, NAG, NGA, NGCG, NNGRRT, NNNRRT, NNAGAA, NNNNRYAC or YG, when in the presence of a splice site-generating sgRNA. As used herein, a “splice site-generating sgRNA” is an sgRNA that in concert with a Cas9, creates a blunt ended double strand break at a splice site-generating cut site. The mutation-eliminating cut site may be, for example, upstream (i.e., when the DNA is in the 5′ to 3′ direction) of the PAM, for example, at the −3 position.
[0079] In other embodiments, the splice site-generating cut may occur at a location as determined by binding by a DNA recognizing protein domain to the DNA and cleavage by a nuclease to which the DNA recognizing protein domain is fused. In another embodiment, the DNA recognizing protein domain and the nuclease are joined by a linker which determines the position of a splice site-generating cut site. For example, with a longer linker, the splice site-generating cut site will be positioned further away from the DNA binding domain.
[0080] As used herein, “homology directed repair-independent,” or “HDR-independent” refers to a process of modifying (e.g., repairing) DNA that does not involve homology directed repair (HDR). The homology directed repair-independent process can be, for example, a process that uses non-homologous end-joining to modify the DNA.
[0081] As used herein, “non-homologous end-joining” or “NHEJ” refers to ligation mediated modification of DNA (e.g., repair) that does not require a homologous template in the process. HNEJ includes, for example, canonical NHEJ (cNHEJ), alternative NHEJ (altNHEJ), microhomology-mediated end joining (MMEJ), and synthesis-dependent microhomology-mediated end joining (SD-MMEJ).
[0082] As used herein “repair” refers to modification of an intron such that proper splicing of the RNA (e.g., pre-mRNA) occurs (i.e., such that splicing occurs as it would in a naturally-occurring RNA, for example, an RNA transcribed from a gene that has no splicing mutations), and such that a polypeptide translated from such an RNA contains all the exons of a naturally-occurring polypeptide. In an embodiment, the mutation causing a splicing defect is a mutation in the donor splice site. As used herein, the repair does not have to be a repair (e.g., correction) of the mutated splice donor site. Instead, for example, the repair may involve excision of an intron fragment containing the mutated donor splice site and ligation of the flanking ends of the DNA to create or constitute a new donor splice site in that gene, resulting in proper splicing of the gene.
[0083] As used herein, the first nucleotide of the intron is defined as the “+1” position. The second nucleotide of the intron is defined as the “+2” position. The third nucleotide of the intron is defined as the “+3” position. The fourth nucleotide of the intron is defined as the “+4” position. The fifth nucleotide of the intron is defined as the “+5” position. The sixth nucleotide of the intron is defined as the “+6” position.
[0084] As used herein, a “Cas9 molecule” or “Cas9 polypeptide” refers to a molecule or polypeptide, respectively, that can interact with a guide RNA (gRNA) molecule to localize to a site comprising a target domain (e.g., a DNA cut site) and a PAM sequence. Cas9 molecules and Cas9 polypeptides include naturally occurring Cas9 molecules and Cas9 polypeptides and engineered, altered, or modified Cas9 molecules or Cas9 polypeptides that differ, e.g., by at least one amino acid residue, from a reference sequence (e.g., a naturally-occurring Cas9 sequence).
[0085] As used herein, a “subject” means a higher eukaryote. The term includes, but is not limited to, mammals (e.g., humans, other primates, pigs, rodents (e.g., mice and rats or hamsters), rabbits, guinea pigs, cows, horses, cats, dogs, sheep, and goats). In certain embodiments, the subject is a human. The human may be a child or an adult.
[0086] As used herein, “treat”, “treating” and “treatment”, means the treatment of a disease in a mammal, for example, a human, and includes inhibiting the disease (e.g., decreasing its rate of progression); regressing the disease; relieving or decreasing the severity of one or more symptoms of the disease; and/or curing the disease.
[0087] As used herein, “prevent,” “preventing,” and “prevention” means the prevention of a disease in a mammal, and includes inhibiting initiation of the disease; decreasing a predisposition toward the disease; and/or delaying the onset of at least one symptom of the disease.
[0088] As used herein, “about” or “approximately” means within an acceptable error range for a particular value as determined by a person skilled in the relevant art. Such an acceptable error range may depend, in part, on how the value is measured or assessed. “About” can mean, for example, within 3 or more than 3 standard deviations. “About” can mean within a percentage range of a given value. For example, the range can be ±1%, ±5%, ±10%, ±20%, ±30%, ±40% or ±50% of a given value. “About” can mean with an order of magnitude of a given value, for example, within 2-fold, 3-fold, 4-fold or 5-fold of a value.
[0089] Systems and Compositions
[0090] As described herein, the inventors have provided genome editing systems, compositions and methods for constituting a functional donor splice site in a gene. These genome editing systems, compositions and methods for constituting a functional donor splice site in a gene may be used, for example, to correct a splicing defect in a gene or DNA sequence. In an embodiment, the splicing defect is a pathogenic mutation. The mutation may be, for example, in the donor splice site or an intron that results in improper splicing of the RNA molecule transcribed from the gene or DNA sequence.
[0091] The disclosed genome editing systems, compositions and methods for constituting a functional donor splice site of a gene harnesses the HDR-independent repair pathway (e.g., via NHEJ) to joining of the DNA ends flanking the excised segment of the intron to constitute a functional donor splice site. In doing so, correction of pathogenic splicing defects occurs.
[0092] The disclosed genome editing systems comprise at least one nuclease that generates blunt-ended DNA breaks in a sequence-specific manner. The genome editing system is configured to form a first and a second blunt-ended double strand break in an intron of a gene containing a mutation in the splice donor site, resulting in excision of a segment of the intron containing the mutation. The system simultaneously joins the DNA ends flanking the excised segment of the intron to constitute a functional donor splice site. This results in the creation of a new donor splice site, which allows an RNA (e.g., a pre-mRNA) transcribed from the edited DNA to properly splice (e.g., to splice such that the RNA and the polypeptide translated from the RNA contains all of the exons of a naturally occurring polypeptide, for example, a polypeptide translated from a polynucleotide containing no splicing mutations). In an embodiment, the genome editing system comprises at least two nucleases that generate blunt-ended DNA breaks in a sequence-specific manner.
[0093] In an embodiment, the intron comprises a splice donor site mutation that alters splice site recognition, causing a splicing defect. Excising the segment of the intron containing the splice site donor mutation and simultaneously joining the DNA ends flanking the excised segment of the intron results in constitution of a functional donor splice site, thereby restoring splice site recognition in the intron.
[0094] In an embodiment, the gene editing system uses a homology directed repair-independent mechanism to join the DNA ends flanking the excised segment of the intron to constitute a functional donor splice site. In a preferred embodiment, the homology directed repair-independent mechanism utilizes non-homologous end joining. This mechanism can be used on all cell types where correction of a splicing defect is desired. This mechanism is particularly useful in post-mitotic (i.e., terminally differentiated) cells, where correction of a splicing defect is desired. Examples of post-mitotic cells, include but are not limited to, myocytes, cardiomyocytes, neurons, adipocytes, and skin cells.
[0095] In an embodiment, the excising of the intron occurs at a mutation-eliminating cut site of the intron and at a splice site-generating cut site of the intron. It will be understood that the genome editing system may comprise a nuclease that generates a mutation-eliminating cut site and a splice generating cut site. In an embodiment, the nuclease that generates the mutation-eliminating cut site is the same as the nuclease that generates the splice generating cut site. In another embodiment, the nuclease that generates the mutation-eliminating cut site is different from the nuclease that generates the splice site-generating cut site.
[0096] The gene editing system comprises at least one nuclease that generates blunt-ended DNA breaks in a sequence-specific manner. There are a variety of systems that can be used to generate blunt-ended DNA breaks in a sequence-specific manner. For example, the system may be a Cas9 polypeptide or a variant of a Cas9 polypeptide that recognizes a Protospacer Adjacent Motif (PAM) (Cebrian-Serrano and Davies, “CRISPR-Cas orthologues and variants: optimizing the repertoire, specificity and delivery of genome engineering tools”, Mamm Genome, DOI 10.1007/s00335-017-9697-4; and Karvelis et al., “Harnessing the natural diversity and in vitro evolution of Cas9 to expand the genome editing toolbox”, Current Opinion in Microbiology, 2017, 37:88-94). In a system using Cas9, the genome editing system further comprises a mutation-eliminating single guide RNA (sgRNA) and a splice site-generating sgRNA. In this system, the mutation-eliminating sgRNA and a splice site-generating sgRNA each comprise a CRISPR RNA (crRNA) portion as determined appropriate by the person skilled in the art. For example, each of the crRNAs may be about 17 to 27 nucleotides in length.
[0097] Methods for designing and making sgRNAs are known to those skilled in the art, and include internet based methods (see also, e.g., Haeussler M. et al., “Evaluation of off-target and on-target scoring algorithms and integration into the guide RNA selection tool CRISPOR”, Genome Biol. 2016 Jul. 5; 17(1):148. doi: 10.1186/s13059-016-1012-2). For example, sgRNAs can be designed by finding a PAM motif and considering the preceding 17-27 nucleotides (depending on the Cas9 species to be used) as a target sequence for crRNA. Since it is known that the cleavage will occur at position PAM-3 one can predict where Cas9 is going to cut. Additional scoring for off-target effects can be done by multiple algorithms known to those skilled in the art.
[0098] The Cas9 nuclease may be from any species, including but not limited to, S. pyogenes, S. aureus, S. thermophilus, C. jejuni, and F. novicida. In one embodiment, the Cas9 is from S. aureus. Cas9 nucleases generate blunt ended DNA breaks in a PAM-dependent manner, three nucleotides upstream (i.e., when the nucleotides are read in a 5′ to 3′ direction) of a PAM sequence (i.e., PAM-3). In another embodiment, the mutation-eliminating cut site and/or the splice generating cut site is at a Cas9 cleavage site occurring in the presence of one of the following PAM nucleotide sequences: NGG (S. pyogenes, original), NAG (S. pyogenes, original), NGA (S. pyogenes EQR variant), NGCG (S. pyogenes VRER variant), NNGRRT (S. aureus), NNNRRT (S. aureus KKH variant), NNAGAA (S. thermophilus), NNNNRYAC (C. jejuni), YG (F. novicida).
[0099] In an embodiment, the genome editing system may comprise a fusion polypeptide comprising a catalytically inactive (i.e., nuclease deficient) Cas9 (dCas9) polypeptide that recognizes a PAM and a nuclease that generates blunt-ended DNA breaks (e.g., such as the PAM sequences described above). dCas9 is commercially available (e.g., Novateinbio). In an embodiment, the catalytically inactive dCas9 is fused to nuclease generating blunt ended DNA breaks. In such an embodiment, the genome editing system comprises a left and right dCas9 that specify binding to DNA in a PAM-dependent manner, followed by a nuclease-dependent cleavage. To excise a segment of an intron containing a splice donor site mutation, two sets of fusion proteins can be used in the system, one that creates blunt-ended double strand breaks at a mutation-eliminating cut site and one that creates blunt-ended double strand breaks at splice site-generating cut site. The excision can them be repaired, for example, by non-homologous end joining.
[0100] Specific DNA cleavage (between any desired nucleotides) can be accomplished using other genome-editing systems. Zinc finger nucleases (ZFN) and TALENs are examples of such systems. In general, these systems work as a fusion polypeptide of DNA recognizing protein domains to a nuclease. Precise DNA targeting is accomplished by engineering the zinc finger portion or the TALE portion fused to a nuclease that generates blunt-ended DNA breaks in a sequence-specific manner, and DNA cleavage occurs through the nuclease portion of the fusion polypeptide. Gene editing systems comprising ZFNs and TALENs work as pairs (left and right), so in embodiments in which ZFNs and TALENs are utilized, the genome editing system includes pairs of ZFN or TALENs and the nucleases contained on the ZFNs or TALENs dimerize for the DNA cleavage to occur. In one embodiment, the ZFNs or TALE domains combine with a Mlyl nuclease, which upon dimerization generates blunt ended DNA breaks. To excise a segment of an intron containing a splice donor site mutation, two sets of fusion proteins can be used in the system, one that creates blunt-ended double strand breaks at a mutation-eliminating cut site and one that creates blunt-ended double strand breaks at splice site-generating cut site. The excision can them be repaired, for example, by non-homologous end joining.
[0101] In various embodiments, a linker is positioned between the DNA recognizing protein domain (e.g., dCas9, TALE or zinc finger) and a nuclease that generates blunt-ended DNA breaks. The linker may be, for example, a polypeptide linker. The size of the linker may vary based on the distance between the location of the DNA site the DNA recognizing protein domain recognizes and the location at which the nuclease cuts the DNA in a sequence-specific manner.
[0102] In other embodiments, the nuclease that cuts the DNA in a sequence-specific manner itself can directly target the DNA sequence of interest (e.g., a mutation-eliminating cut site or a splice site-generating cut site). In other embodiments, the nuclease that cuts the DNA in a sequence-specific manner can be fused to a DNA binding protein or to an oligonucleotide that recognises a DNA sequence in proximity of an intended cut site.
[0103] The disclosed genome editing systems may be used to correct various splicing defects. In an embodiment, the splicing site defect is in a splice donor site. The mutation in the splice donor site may occur at the +1 position of the splice donor site. In other various embodiments, the mutation in the splice donor site occurs at the +2, +3, +4, +5, or +6 position of the splice donor site. For example, the splice site donor mutation can be a mutation in a gene provided in Table 1.
TABLE-US-00001 TABLE 1 Donor Splice Mutations Name, mutation, position in the splice donor site Gene(s) Condition(s) CYP21A2, IVS7DS, G-C, +1 CYP21A2 21-hydroxylase deficiency SERAC1, IVS13DS, G-C, +1 SERAC1 3-methylglutaconic aciduria with deafness, encephalopathy, and Leigh- like syndrome HMBS, IVS6DS, G-C, +1 HMBS Acute intermittent porphyria ABCD1, IVS6DS, G-A, +1 ABCD1 Adrenoleukodystrophy JAG1, IVS23DS, G-C, +1 JAG1 Alagille syndrome 1 HR, IVS12DS, G-A, +1 HR Alopecia universalis congenita MHC2TA, IVS13DS, G-A, +1 CIITA Bare lymphocyte syndrome type 2, complementation group A COL6A1, IVS3DS, G-A, +1 COL6A1 Bethlem myopathy 1 FYCO1, IVS9DS, G-T, +1 FYCO1 Cataract, autosomal recessive congenital 2 SBF2, IVS32DS, G-C, +1 SBF2 Charcot-Marie-Tooth disease, type 4B2 TBX22, IVS4DS, G-A, +1 TBX22 Cleft palate with ankyloglossia PRKCSH, IVS4DS, G-C, +1 PRKCSH Congenital cystic disease of liver CPS1, IVS29DS, G-C, +1 CPS1 Congenital hyperammonemia, type I ELANE, IVS4DS, G-A, +1 ELANE Cyclical neutropenia DIAPH1, IVS17DS, G-T, +1 DIAPH1 Deafness, autosomal dominant 1 MYO15A, IVS4DS, G-T, +1 MYO15A Deafness, autosomal recessive 3 TECTA, IVS9DS, G-A, +1 TECTA Deafness, neurosensory autosomal recessive 21 DSPP, IVS3DS, G-A, +1 DSPP Dentinogenesis imperfecta- Shield's type II KRT5, IVS1DS, G-A, +1 KRT5 Epidermolysis bullosa herpetiformis, Dowling- Meara PTH1R, IVS8DS, G-A, +1 PTH1R Failure of tooth eruption, primary AAAS, IVS11DS, G-A, +1 AAAS Glucocorticoid deficiency with achalasia CETP, IVS14DS, G-A, +1 CETP Hyperalphalipoproteinemia TRPM6, IVS16DS, G-A, +1 TRPM6 Hypomagnesemia 1, intestinal DSG4, IVS3, G-T, +1 DSG4 Hypotrichosis 6 GIF, IVS1DS, G-A, +1 GIF Intrinsic factor deficiency ATP2A2, IVS6DS, G-A, +1 ATP2A2 Keratosis follicularis GHR, IVS9DS, G-A, +1 GHR Laron syndrome with elevated serum GH-binding protein GHR, IVS4DS, G-A, +1 GHR Laron-type isolated somatotropin defect MYH7, IVS8DS, G-A, +1 MYH7 Left ventricular noncompaction 5 SPRED1, IVS5DS, G-A, +1 SPRED1 Legius syndrome ITGB2, IVSDS, G-A, +1 ITGB2 Leukocyte adhesion deficiency LPA, IVS K4 TYPE 8 LPA Lipoprotein(a) deficiency, DS, G-A, +1 congenital KCNH2, IVS3DS, G-C, +1 KCNH2 Long QT syndrome 2 XK, IVS2DS, G-A, +1 XK McLeod neuroacanthocytosis syndrome CDT1, IVS2DS, G-C, +1 CDT1 Meier-Gorlin syndrome 4 ARSA, IVS4DS, G-A, +1 ARSA Metachromatic leukodystrophy, severe COL1A1, IVS26DS, G-A, +1 COL1A1 Osteogenesis imperfecta type I RSPO1, IVS5DS, G-A, +1 RSPO1 Palmoplantar hyperkeratosis and true hermaphroditism SDHB, IVS1DS, G-T, +1 SDHB Paraganglioma and gastric stromal sarcoma PIGA, IVS5DS, G-A, +1, PIGA Paroxysmal nocturnal SOMATIC hemoglobinuria 1 AMHR2, IVS2DS, G-T, +1 AMHR2 Persistent mullerian duct syndrome, type II CARD14, IVS3DS, G-A, +1 CARD14 Pityriasis rubra pilaris Polycystic kidney disease, PKD1, IVSDS, G-C,+1 PKD1 adult type HMBS, IVS1DS, G-T, +1 HMBS Porphyria, acute intermittent, nonerythroid variant CNGB1, IVS32DS, G-A, +1 CNGB1 Retinitis pigmentosa 45 PDE6G, IVS3DS, G-T, +1 PDE6G Retinitis pigmentosa 57 CEP152, IVS4DS, G-C, +1 CEP152 Seckel syndrome 5 ADA, IVS2DS, G-A, +1 ADA Severe combined immunodeficiency due to ADA deficiency PTPRC, IVS13DS, G-A, +1 PTPRC Severe combined immunodeficiency, autosomal recessive, T cell- negative, B cell-positive, NK cell-positive NEU1, IVSEDS, G-C, +1 NEU1 Sialidosis, type II NSD1, IVS20DS, G-A, +1 NSD1 Sotos syndrome 1 DNAJB2, IVS5DS, G-A, +1 DNAJB2 Spinal muscular atrophy, distal, autosomal recessive, 5 FAM58A, IVS4DS, G-A, +1 FAM58A STAR syndrome SFTPC, IVS4DS, G-T, +1 SFTPC Surfactant metabolism dysfunction, pulmonary, 2 TBX3, IVS2DS, G-C, +1 TBX3 Ulnar-mammary syndrome NODAL, IVS2DS, G-A, +1 NODAL Visceral heterotaxy 5, autosomal MITF, IVS1DS, G-A, +1 MITF Waardenburg syndrome type 2A BTK, IVS16DS, G-T, +1 BTK X-linked agammaglobulinemia STS, IVS8DS, G-T, +1 STS X-linked ichthyosis with steryl-sulfatase deficiency DLG3, IVS8DS, G-A, +1 DLG3 X-Linked mental retardation 90 IL2RG, IVS3DS, G-A, +1 IL2RG X-linked severe combined immunodeficiency ABCC2, IVS18DS, T-C, +2 ABCC2 Dubin-Johnson syndrome C6, IVS15DS, T-C, +2 C6 C6 deficiency, subtotal COL2A1, IVS51DS, T-C, +2 COL2A1 STICKLER SYNDROME, TYPE I, NONSYNDROMIC OCULAR FERMT1, IVS3DS, T-C, +2 FERMT1 Kindler's syndrome FRMD7, IVS3DS, T-G, +2 FRMD7 Infantile nystagmus, X- linked GABRG2, IVS6DS, T-G, +2 GABRG2 Epilepsy, childhood absence 2|Familial febrile seizures 8 KRIT1, IVS2DS, T-C, +2 KRIT1 Cerebral cavernous malformations 1 MARVELD2, IVS4DS, MARVELD2 Deafness, autosomal T-C, +2 recessive 49 NDUFS6, IVS2DS, T-A, +2 NDUFS6 Mitochondrial complex I deficiency OTOA, IVS12DS, T-C, +2 OTOA Deafness, autosomal recessive 22 PLP1, IVS3DS, T-C, +2 PLP1 Pelizaeus-Merzbacher disease PRDM5, IVS1DS, T-C, +2 PRDM5 Brittle cornea syndrome 2 RPL5, IVS2DS, T-G, +2 RPL5 Aase syndrome SMARCAD1, IVS1DS, T-C, +2 SMARCAD1 Adermatoglyphia TRPM1, IVS16DS, T-C, +2 TRPM1 Congenital stationary night blindness, type 1C TULP1, IVS7DS, T-C, +2 TULP1 Retinitis pigmentosa 14 WAS, IVS6DS, T-G, +2 WAS Wiskott-Aldrich syndrome SMARCAD1, IVS1DS, A-T, +3 SMARCAD1 Basan syndrome COL1A2, IVS26DS, A-G, +3 COL1A2 Osteogenesis imperfecta with normal sclerae, dominant form ATP7A, IVSXDS, A-T, +3 ATP7A Menkes disease, mild F8, IVS6DS, A-G, +3 F8 Hereditary factor VIII deficiency disease RPGR, IVS10DS, A-G, +3 RPGR Retinitis pigmentosa 15 COLQ, IVS16DS, A-G, +3 COLQ Endplate acetylcholinesterase deficiency BLNK, IVS1DS, A-T, +3 BLNK Agammaglobulinemia 4, autosomal recessive NDUFV1, IVS8DS, A-C, +4 NDUFV1 Mitochondrial complex I deficiency ENG, IVS3DS, A-G, +4 ENG Osler hemorrhagic telangiectasia syndrome HSPG2, IVS64DS, A-G, +4 HSPG2 Schwartz Jampel syndrome type 1 SMARCAD1, IVS1DS, G-C, +5 SMARCAD1 Adermatoglyphia OCLN, IVS5DS, G-A, +5 OCLN Band-like calcification with simplified gyration and polymicrogyria RFX5, IVS2DS, G-A, +5 RFX5 Bare lymphocyte syndrome, type II, complementation group c COL1A2, IVS11DS, G-A, +5 COL1A2 Ehlers-Danlos syndrome, autosomal recessive, cardiac vascular form FANCB, IVS7DS, G-A, +5 FANCB Fanconi anemia, complementation group B F8, IVS12DS, G-A, +5 F8 Hereditary factor VIII deficiency disease IKBKG, IVS6DS, G-A, +5 IKBKG Hypohidrotic ectodermal dysplasia with immune deficiency TRPM6, IVS8DS, G-C, +5 TRPM6 Hypomagnesemia 1, intestinal HPRT, IVS7DS, G-A, +5 HPRT1 Lesch-Nyhan syndrome COL9A3, IVS3DS, G-A, +5 COL9A3 Multiple epiphyseal dysplasia 3 COL1A1, IVS14DS, G-A, +5 COL1A1 Osteogenesis imperfecta, recessive perinatal lethal PROS1, IVS10DS, G-A, +5 PROS1 Protein S deficiency NPHP3, IVS13DS, G-A, +5 NPHP3 Renal-hepatic-pancreatic dysplasia ADAMTS13, IVS13DS, G-A, +5 ADAMTS13 Upshaw-Schulman syndrome DLG3, IVS6DS, G-A, +5 DLG3 X-Linked mental retardation 90 ATP7A, IVS6DS, T-A, +6 ATP7A Cutis laxa, X-linked SMN1, IVS7DS, T-G, +6 SMN1 Kugelberg-Welander disease, spinal Muscular Atrophy Type III
[0104] In the disclosed genome editing system, the splice site-generating cut site is positioned before a branch splice point for an adjacent exon. Methods for determining branch splice point for an adjacent exon are known to those skilled in the art (see, e.g., Mercer, Genome Research, 2015 PMID: 25561518; Reed, PNAS, 1988 PMID: 3060403). For example, the splice site-generating cut site may be positioned at least, but not limited to, 25, 50, 100, 150, 200, 300, 400, 500, 600, 700, 800, 900 or 1000 base pairs from the mutation-eliminating cut site, provided the splice site-generating cut site is positioned before the branch splice point for an adjacent exon.
[0105] In one embodiment, the genome editing system comprises a mutation-eliminating single guide RNA (sgRNA); a splice site-generating sgRNA and a Cas9 molecule that recognizes a PAM having the nucleotide sequence NNGRRT, wherein the mutation-eliminating cut site of the intron is positioned 3 nucleotides in the 5′ direction of the end of the PAM.
[0106] In one embodiment, mutation-eliminating cut sites and splice site-generating cut sites are determined by predicting splice sites in a nucleotide sequence. Methods for predicting splices sites are generally known in the art and include the use of tools such as Human Splicing Finder (HSF) (http://www.umd.be/HSF3/), the MaxEntScan donor model and/or the DeepScan donor model. Based on the sequence of the nucleotides at in in proximity to the mutation-eliminating cut sites and splice site-generating cut sites the appropriate nuclease can be selected and the cuts sites can be generated. Excision of the identified intron segment (and the mutation) and simultaneous joining together of the DNA ends flanking the excised intron segment result in constitution of a functional donor splice site, such that the gene can now be properly spliced and a functional protein can be generated upon translation.
[0107] In various embodiments, the genome editing system further comprises a delivery vehicle for delivery of the other components of the system (e.g., for delivery of the at least one nuclease that generates blunt-ended DNA breaks in a sequence-specific manner, and one or more of the sgRNAs, if used in the system). In an embodiment, polynucleotides coding for the at least one nuclease that generates blunt-ended DNA breaks in a sequence-specific manner are in a vector. In another embodiment, when the nuclease is Cas9, the vector further comprises a mutation-eliminating single sgRNA or a splice site-generating sgRNA. The delivery vehicle may be an adeno-associated virus (AAV) vector, for example, an AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8 or AAV9 vector. In a preferred embodiment, the delivery vehicle is an AAV9 vector.
[0108] In certain embodiments, a mutation-eliminating sgRNA is present on a first nucleic acid molecule, e.g. a first vector (e.g., a first viral vector, such as a first AAV vector); and a splice generating sgRNA is present on a second nucleic acid molecule e.g., a second vector (e.g., a second vector, for example, a second AAV vector).
[0109] In certain embodiments, a mutation-eliminating sgRNA and polynucleotides encoding a nuclease that generates blunt-ended DNA breaks in a sequence-specific manner are present on one nucleic acid molecule, e.g., one vector (e.g., one viral vector, such as one AAV vector). In certain embodiments, a mutation-eliminating sgRNA and polynucleotides encoding a nuclease that generates blunt-ended DNA breaks in a sequence-specific manner are present on different vectors. For example, a mutation-eliminating sgRNA is present on a first nucleic acid molecule, e.g. a first vector, (e.g., a first viral vector, for example, a first AAV vector); and polynucleotides encoding a nuclease that generates blunt-ended DNA breaks in a sequence-specific manner are present on a second nucleic acid molecule, e.g., a second vector (e.g., a second vector, such as a second AAV vector).
[0110] In certain embodiments, a splice generating sgRNA and polynucleotides encoding a nuclease that generates blunt-ended DNA breaks in a sequence-specific manner are present on one nucleic acid molecule, e.g., one vector, (e.g., one viral vector, for example, one AAV vector). In certain embodiments, a splice generating sgRNA and polynucleotides encoding a nuclease that generates blunt-ended DNA breaks in a sequence-specific manner are present on different vectors. For example, a splice generating sgRNA is present on a first nucleic acid molecule, e.g. a first vector (e.g., a first viral vector, such as, a first AAV vector); and polynucleotides encoding a nuclease that generates blunt-ended DNA breaks in a sequence-specific manner are present on a second nucleic acid molecule, e.g., a second vector (e.g., a second vector, for example, a second AAV vector).
[0111] The vectors may comprise a promoter operably linked to a mutation-eliminating sgRNA, a splice generating sgRNA or polynucleotides encoding a nuclease that generates blunt-ended DNA breaks in a sequence-specific manner. Where a vector contains more than one nucleotide sequence (e.g., a mutation-eliminating sgRNA and polynucleotides encoding a nuclease that generates blunt-ended DNA breaks in a sequence-specific manner or a splice generating sgRNA and polynucleotides encoding a nuclease that generates blunt-ended DNA breaks in a sequence-specific manner) the vector may further comprise a second promoter operably linked to the second nucleotide sequence. When two promoters are employed in a vector, the promoter may be the same or they may be different. Promoters for use in expression vectors are known to those skilled in the art.
[0112] In addition to AAVs, the vector may comprise plasmid DNA, encoding CRISPR/Cas9 components (Cas9+sgRNA) as a delivery vehicle. Alternatively, the delivery vehicle can comprise RNA, where polynucleotides encoding Cas9 and sgRNAs are comprised of RNA molecules. Alternatively, the delivery vehicle can be in an RNP (ribonucleoprotein) form, where Cas9 is delivered in a complex with sgRNA (RNA form).
[0113] In other embodiments, the delivery vehicle is a non-viral delivery vehicle, for example, a liposome, such as a cationic liposome, or polymeric nanoparticles. Methods for delivering the disclosed gene editing system to a cell are known in the art.
[0114] In an embodiment, the genome editing system comprises a S. aureus Cas9 molecule that recognizes a PAM, a mutation-eliminating sgRNA and a splice site-generating sgRNA. In an embodiment, the mutation-eliminating sgRNA comprises the nucleotide sequence aaaacatatatataatacatatGGT (SEQ ID NO: 1). The genome editing system targets a pathogenic mutation in the Lama2 gene, wherein the mutation is a c.417+1 g.fwdarw.a mutation. In certain embodiments, the mutation is in a subject having merosin deficient congenital muscular dystrophy (MDC1A).
[0115] Compositions comprising one or more components of the gene editing system are also disclosed. The composition may comprise, for example, at least one nuclease that generates blunt-ended DNA breaks in a sequence-specific manner. The composition is configured to form a first and a second blunt-ended double strand break in an intron of a gene, wherein the intron comprises a splice donor site mutation that alters splice site recognition, thereby excising a segment of the intron and simultaneously joining DNA ends flanking the excised segment of the intron to constitute a functional donor splice site.
[0116] The nuclease may be in a fusion polypeptide comprising a DNA recognizing protein domain and the nuclease. For example, the fusion polypeptide may comprise a dCas9 molecule that is fused to a nuclease that generates blunt-ended DNA breaks in a sequence-specific manner as described herein. In an embodiment, zinc finger polypeptides are the DNA recognizing protein domain. Similarly, the fusion polypeptide may comprise a TALE polypeptide as the DNA recognizing protein domain. When the nuclease portions of the fusion polypeptides dimerize, the pair of fusion polypeptides generates blunt-ended DNA breaks.
[0117] The composition may further comprise a linker between the catalytically inactive Cas9 and the nuclease or between a DNA recognizing protein domain and the nuclease, as described herein.
[0118] Where the nuclease is a Cas9 molecule, the composition further comprises a mutation-eliminating sgRNA and a splice site-generating sgRNA. Such compositions may further comprise a delivery vehicle such as a vector (including a viral vector) as described herein or liposomes or nanoparticles. The compositions may be used in the genome editing systems disclosed herein. Each of the mutation-eliminating sgRNA and the splice site-generating sgRNA may comprise a CRISPR RNA (crRNA) portion that is 17 to 27 nucleotides in length.
[0119] Methods of Constituting a Functional Donor Splice Site in a Gene
[0120] Methods of constituting a functional donor splice site in a gene, where the gene comprises a mutation in a donor splice site that alters splice site recognition are also provided. The method comprises delivering a genome editing system to the gene. The genome editing system comprises at least one nuclease that generates blunt-ended DNA breaks in a sequence-specific manner. The genome editing system is configured to form a first and a second blunt-ended double strand break in an intron of a gene, wherein the intron comprises a splice donor site mutation that alters splice site recognition. This results in excision of a segment of the intron and simultaneously joining of DNA ends flanking the excised segment of the intron to constitute a functional donor splice site. This results in restored splice site recognition in the intron. The joining of the DNA ends flanking the excised segment of the intron to constitute a functional donor splice site occurs through a homology directed repair-independent mechanism, for example, through non-homologous end joining.
[0121] Other features of the genome editing system for using in a method of constituting a functional donor splice site in a gene are as described herein.
[0122] In an embodiment, the gene with the donor splice mutations is a gene listed in Table 1.
[0123] Methods of detecting constitution of a functional donor splice site in a gene, where the gene comprises a mutation in a donor splice site that alters splice site recognition are known to those skilled in the art. For example, RT-PCR can be performed on the cDNA flanking the exon affected by the splicing defect and gel electrophoresis can be used to determine if a functional donor splice site has been constituted. Western blot analysis may be performed to determine of a full-length protein has been generated, indicating successful constitution of a function donor splice site. Methods for performing these techniques are known in the art.
[0124] Cells
[0125] Isolated cells comprising a gene whose mutation at a splice donor site has been repaired according to the method for constituting a functional donor splice site in a gene, where the gene comprises a mutation in a donor splice site that alters splice site recognition are also provided.
[0126] The cell may be a post-mitotic cell, for example, a myocyte, cardiomyocyte, neuron, adipocyte, or skin cell.
[0127] Components of a genome editing system as described herein are can be delivered by a vector, for example, an AAV vector as described herein. Alternatively, the components can be delivered using non-viral systems, for example, liposomes or nanoparticles. Delivery into cells can occur using techniques known in the art, including, but not limited to microinjection, electroporation, lipid-mediated transfection, RNP-mediated delivery, RNA-mediated delivery, or a combination thereof.
[0128] Methods of treating a disease or condition caused by a mutation in a splice donor site in a gene of a subject
[0129] Methods for treating a disease or condition caused by a mutation in a splice donor site in a gene of a subject in a subject in need of such treatment are also provided. The method comprises delivering a genome editing system to the intron. The genome editing system comprises at least one nuclease that generates blunt-ended DNA breaks in a sequence-specific manner. The genome editing system is configured to form a first and a second blunt-ended double strand break in an intron of a gene, wherein the intron comprises a splice donor site mutation that alters splice site recognition. This results in excision of a segment of the intron and simultaneously joining of DNA ends flanking the excised segment of the intron to constitute a functional donor splice site. This results in restored splice site recognition in the intron. The joining of the DNA ends flanking the excised segment of the intron to constitute a functional donor splice site occurs through a homology directed repair-independent mechanism, for example, through non-homologous end joining.
[0130] Other features of the genome editing system for using in a method of treating a disease or condition caused by a mutation in a splice donor site in an intron in a subject are as described herein.
[0131] In an embodiment, the subject has a disease or condition caused by a donor splice mutation in a gene as listed in Table 1.
[0132] In certain embodiments, the treatment is initiated prior to onset of the disease. In certain embodiments, the treatment is initiated after onset of the disease. In certain embodiments, the treatment is initiated prior to onset of a symptom of the disease. Alternatively, the treatment is initiated at onset of one or more symptoms of the disease. In certain embodiments, the treatment is initiated after onset one or more symptoms of the disease.
[0133] Components of a genome editing system as described herein are can be delivered by a vector, for example, an AAV vector as described herein. Alternatively, the components can be delivered using non-viral systems, for example, liposomes or nanoparticles. Delivery into cells can occur using techniques known in the art, including, but not limited to microinjection, electroporation, lipid-mediated transfection, peptide-mediated delivery, or a combination thereof. The dose(s) of components of the system to use for treating a disease or condition caused by a mutation in a splice donor site of a gene will depend on the mode of delivery and is within the skill of the skilled person to determine.
[0134] Components of the genome editing system can be delivered systemically.
[0135] Systemic modes of administration include oral and parenteral routes, including but not limited to intravenous, intra-arterial, intramuscular, intradermal, subcutaneous, intranasal, and intraperitoneal routes.
[0136] Components of the genome editing system can be delivered locally. Local modes of administration include injection directly into one or more specific tissues. In one embodiment, local delivery occurs by direct injection in a muscle on the subject.
[0137] Kits
[0138] The present disclosure contemplates kits for carrying out the methods disclosed herein. Such kits typically comprise two or more components required for constituting a functional donor splice site in a gene, where the gene comprises a mutation in a donor splice site that alters splice site recognition. Components of the kit include, but are not limited to, one or more of compounds, reagents, containers, equipment and instructions for using the kit. Accordingly, the methods described herein may be performed by utilizing pre-packaged kits provided herein. In one embodiment, the kit comprises at least one nuclease that generates blunt-ended DNA breaks in a sequence-specific manner. In some embodiments, the instructions comprise one or more protocols for constituting a functional donor splice site in a gene, where the gene comprises a mutation in a donor splice site that alters splice site recognition. In some embodiments, the kit comprises one or more controls.
[0139] The following non-limiting examples illustrative of the disclosure are provided.
Example 1: Materials and Methods
[0140] Cloning and Virus Production
[0141] Each sgRNA shown in Table 2, was cloned into pX601 plasmid (Addgene 61591) containing codon optimized S. aureus Cas9 (3).
TABLE-US-00002 TABLE 2 sgRNA Sequences Used in the Study Name Sequences (5′.fwdarw.3′) PAM ID number sgRNA 1 Top aacatatataatacatatGGT (SEQ ID NO: 2) ATGGAT RDC 495 Bottom ACCatatgtattatatatgtt (SEQ ID NO: 3) RDC 496 sgRNA 2 Top taggaagtagaaacttaggtc (SEQ ID NO: 4) aagaat RDC 501 Bottom gacctaagtttctacttccta (SEQ ID NO: 5) RDC 502 sgRNA 3 Top agagggcagagtcttgacatgtg (SEQ ID NO: 6) tggagt RDC 503 Bottom cacatgtcaagactctgccctct (SEQ ID NO: 7) RDC 504 sgRNA 4 Top ccctggccacagaatgctgtc (SEQ ID NO: 8) tggga RDC 1458 Bottom gacagcattctgtggccaggg (SEQ ID NO: 9) RDC 1459 sgRNA 5 Top atgaaagatgtataaggagtt (SEQ ID NO: 10) aagaat RDC 1460 Bottom aactecttatacatctttcat (SEQ ID NO: 11) RDC 1461 sgRNA 6 Top agtcttgacatgtgtggagtg (SEQ ID NO: 12) aagaa RDC 1462 Bottom cactccacacatgtcaagact (SEQ ID NO: 13) RDC 1463 sgRNA 7 Top gtgaagaaactggaaacagtt (SEQ ID NO: 14) cagaa RDC 1464 Bottom aactgtttccagtttcttcac (SEQ ID NO: 15) RDC 1465 sgRNA 8 Top ggctgacttagagatatggtt (SEQ ID NO: 16) tagaa RDC 1466 Bottom aaccatatctctaagtcagcc (SEQ ID NO: 17) RDC 1467 sgRNA 9 Top taatagagtaaatactgtgtt (SEQ ID NO: 18) tggggt RDC 1468 Bottom aacacagtatttactctatta (SEQ ID NO: 19) RDC 1469 sgRNA 10 Top ggaagatgcttatgttttgtt (SEQ ID NO: 20) gaggg RDC 1470 Bottom aacaaaacataagcatcttcc (SEQ ID NO: 21) RDC 1471 sgRNA 11 Top cacactgaaaagtaacaccaa (SEQ ID NO: 22) gtgaat RDC 1472 Bottom ttggtgttacttttcagtgtg (SEQ ID NO: 23) RDC 1473
[0142] As control, a GFP plasmid was constructed by replacing the SaCas9 cassette in pX601 with eGFP codon from pEGFP-C1 plasmid (Clontech Takara) using Age1 and EcoR1 directional cloning. 2100 bp stuffer sequence was incorporated into Xba1 site of the vector to maintain AAV genome packaging capacity. The pX601-GFP, -sgRNA1 and -sgRNA2 plasmids were produced as Adeno-associated virus serotype 9 (AAV9), concentrated and quantified by quantitative PCR (Vigene Biosciences). Injection-ready virus solution containing 7.5×10.sup.11 viral genome/vector was prepared and brought to 30 μl with 1×PBS (Gibco).
[0143] Cell Culture and Guide Screening
[0144] Primary myoblasts were isolated from EDL muscle of C57BL/6J wild-type and dy.sup.2J/dy.sup.2J mice (18) and maintained in DMEM supplemented with 1% Chicken Embryo Extract (Gemini Biosciences), 10% Horse Serum, 1% Penicillin/Streptomycin, 1% L-glutamine (All from Gibco unless indicated otherwise) at 37° C. with 5% CO.sub.2. Transfection was performed using Amaxa Basic Nucleofector Kit VPI-1002 (Lonza) in Nucleofector device (Lonza) under program number U-023. Cells were grown for 72 hours and genomic DNA was isolated with Blood and Tissue DNA Isolation Kit based on manufacturer's protocol (Qiagen). The activity of guides combination to remove the intervening region was assessed by PCR using primers in Lama2 exon 2 (RDC 551) and intron 2 (RDC 552) flanking the genomic deletion. The sequences used are shown in Table 3.
TABLE-US-00003 TABLE 3 Frequency of Different Genomic DNA Products after Cas9-mediated Deletion Sequences Percentage AATCCATACCGTCAAGAATG (SEQ ID NO: 24) 56.29% AATCCATAC--TCAAGAATG (SEQ ID NO: 25) 13.92% AATCCATACC-TCAAGAATG (SEQ ID NO: 26) 11.41% AATCCATACg--CAAGAATG (SEQ ID NO: 27) 0.29% AATCCATACtGTCAAGAATG (SEQ ID NO: 28) 0.23%
[0145] Animal Studies
[0146] The dy.sup.2J/dy.sup.2J and C57BLJ6J mice were purchased from the Jackson Laboratory and maintained in the Toronto Centre for Phenogenomics (TCP). All animal experiments were performed according to Animal Use Protocol number 16-0234H. No statistical methods were used to predetermine sample size. Intramuscular injection was performed on 3-weeks old dy.sup.2J/dy.sup.2J mice under isofluorane anesthesia. Treatments were randomized among littermates. No blinding was done in the intramuscular treatment. For systemic administration experiments involving neonatal pups, breeding cages were monitored daily for birth and the entire litter of P2 newborns were injected with either GFP- or guide-containing vectors. Mice were genotyped (19) and weaned at the age of 3 weeks old, where randomization occurred and all investigators were subsequently blinded for the different treatment groups throughout the study until the time of necropsy.
[0147] Functional Tests
[0148] Open field activity test and assessment of in vivo muscle force were performed on 10 weeks-old mice (for both intraperitoneal and temporal vein cohorts) at the Lunenfeld-Tanenbaum Research Institute's Centre for Modeling Human Disease Mouse Phenotyping Facility. For the open field test, mice were placed in the frontal center of a transparent Plexiglas open field (41.25 cm×41.25 cm×31.25 cm) illuminated by 200 lx. Trained operator, who is unaware of the nature of the projects and treatments, performed the experiments. The VersaMax Animal Activity Monitoring System recorded vertical activities and total distance travelled for 20 minutes per animal.
[0149] In vivo muscle contraction test was performed using 1300A: 3-in-1 Whole Animal System and analyzed using Dynamic muscle control/analysis (DMC/DMA) High throughput software suite (Aurora Scientific). The mice were anaesthetised with intraperitoneal injection of ketamine/xylazine cocktail at 100 mg/kg and 10 mg/kg of body weight, respectively. Contractile output was measured via percutaneous electrodes that stimulate specific nerves innervating the plantar flexors. Specific tetanic force (200 ms of 0.5-ms pulses at 125 Hz) was recorded and corrected to body weight.
[0150] RNA Isolation, RT-PCR, Quantitative PCR (qPCR)
[0151] RNA from cultured cells were isolated using Trizol (Invitrogen) according to manufacturer's instruction. For mouse tissues, muscles were sectioned at 10 μm thin, collected in 1.4 mm Zirconium Beads pre-filled tubes (OPS Diagnostics) and homogenized using a MagNA Lyser (Roche Diagnostics) using Trizol (Invitrogen). N6-primed cDNA was synthesized from 1000 ng of RNA using SuperScript III (Invitrogen). PCR amplification to detect exon inclusion was performed using Multiplex PCR mix (Qiagen) and primers in exons 1 and 4 (RDC 449 and 450), with sequences shown in Table 2.
[0152] qPCR was performed using Fast SYBR Green Master Mix on Step One Plus Real Time PCR (Applied Biosystems). Control PUC-Lama2 plasmid was generated by directional cloning in our laboratory. Briefly, exon 1-7 Lama2 cDNA was amplified from wild type muscles using primers RDC 1411 and RDC 1479, and cloned into PUC19 plasmid at BamH1 restriction sites. The expression of products with exon 2 inclusion was calculated using primers in exon 2/exon 3 junctions, and normalized to expression of products amplified with exon 4. Copy number of all samples was calculated relative to PUC-Lama2 plasmid control.
[0153] Protein Isolation and Western Blot
[0154] Sectioned muscles collected in the Zirconium beads tubes were homogenized in 500 μl of RIPA homogenizing buffer (50 mM Tris HCl pH 7.4, 150 nM NaCl, 1 mM EDTA supplemented with protease inhibitor cocktails (Roche)) and lysed with a MagNA Lyser. Subsequently, 500 μl of RIPA double detergents buffer (2% deoxycholate, 2% NP40, 2% Triton X-100 in RIPA homogenizing buffer) was added to the lysates, which were then incubated for 45 minutes at 4° C. with gentle agitation and centrifuged for 10 minutes at 13,000 rpm. The protein concentration was measured using BCA Assay (Thermo Scientific). Protein was separated on 3-8% Tris-Acetate gel and transferred using the Novex system (Invitrogen). The primary antibodies used were rat monoclonal anti-Lama2 (Abcam 11576 4H8-2, 1:500), rabbit polyclonal anti-tubulin (Abcam 4074, 1:5000), and mouse monoclonal anti-HA (Abcam 18181, 1:2500).
[0155] Immunofluorescence, H&E and Picrosirius Red Staining
[0156] Antibodies used for immunofluorescence stainings were rat monoclonal anti-Lama2 (Abcam 11576 4H8-2, 1:500), mouse monoclonal anti-Neurofilament H (NF-H, Biologend 801601 SMI 31, 1:1000), FITC-conjugated rat anti-mouse Cdllb clone M1/70 (BD Pharmingen 553310, 1:50) and imaged using either Zeiss Axiovert 200M or Nikon TE-2000 epifluorescence microscopes, or Olympus 1X81 quorum spinning disk confocal microscope. Perkin Elmer Volocity software was used for image acquisition.
[0157] Hematoxylin and eosin (H&E) staining was performed using standard protocol and Picrosirius Red staining was performed at Pathology Laboratory, the Hospital for Sick Children, Toronto. 300 fibers obtained from 4 different field of views per animal were analyzed using ImageJ 1.51h software for minimum Feret's diameter and cross-sectional area. Fibrosis quantification was performed from Sirius Red-stained (for intraperitoneal cohort) and H&E-stained (for temporal vein cohort) images by image-segmentation using ImageJ 1.51h software.
[0158] Off-Target Analysis by T7 Endonuclease Assay
[0159] Top 10 predicted off-target sites were computed using Benchling CRISPR Design tool (20). Genomic DNA isolated from treated- and control mice were PCR-amplified using primers targeting loci corresponding to each hit, as listed in Table 2.
[0160] Purified PCR products were then digested using T7 Endonuclease I (NEB) based on the manufacturer's instruction.
[0161] Deep Sequencing
[0162] Genomic DNA extracted from tibialis anterior (TA) muscles from the temporal vein cohorts (Qiagen Blood and Tissue kit) were amplified using oligos RDC 1474 and RDC 1461 as shown in Tables 2 and 4. A second round of PCR was performed to add 5′ and 3′ end barcodes, and sequenced with 125 bp paired-end reads on Illumina HiSeq 2500 at the Toronto Center for Applied Genomics (TCAG). Bioinformatic analyses were done similarly as previously described (21).
TABLE-US-00004 TABLE 4 Primers Used for PCR, RT-PCR, and qPCR ID Reference Name Sequences number Used for to figure(s) Lama2-guide AAG GAC CCG AGA RDC 551 Guide/deletion FIG. 1d screen F TGT ACT GC (SEQ ID screening NO: 29) Lama2-guide TGG CCT GTT CTT RDC 552 screen R AGT GCT TG (SEQ ID NO: 30) Lama2-guide GTGCCGAATCTGCAA RDC 1474 Guide/deletion FIG. S1 screen1-F TCAGAACAGC (SEQ screening ID NO: 31) ddPCR Lama2-guide cactccacacatgtcaagactct RDC 1475 screen1-R gc (SEQ ID NO: 32) Lama2-guide GAAAAAGGACCCGAG RDC 1476 Guide/deletion FIG. S1 screen2-F ATGTACTGCAAG screening (SEQ ID NO: 33) Lama2-guide ctatatcaaaattgattttgaaat RDC 1477 screen2-R gcaatgaagctg (SEQ ID NO: 34) Lama2-ex1 Fw 2 CAG TCC CAA GCG RDC449 Exon inclusion FIG. 1d CAT CAA C (SEQ ID detection NO: 35) Lama2-ex5 Fw GAATGTAGCGAGCAG RDC 1478 AGGTGAATTC (SEQ ID NO: 36) Lama2-ex4 Rev TGG CTG CCT TCA RDC 450 Exon inclusion FIGS. 2b, 3a, CAA TTA CG (SEQ ID detection 3b, 3i ,4b, NO: 37) S1, S6, S7 Lama2-5′UTR-F aggagtggatctgctccgg RDC 1411 Generation of FIGS. 3c, 4c (SEQ ID NO: 38) PUC-Lama2 Lama2-ex7 Rev1 CACACTCAGACTTGG RDC 1479 TGAGGAAG (SEQ ID NO: 39) Lama2-ex1ex2- GCATCAACAGAGAGG RDC 1480 qPCR FIGS. 3c, 4c gqPCR-F TTTATTTCCTG (SEQ ID NO: 40) Lama2-ex3-qPCR- CAATAGCATTCGTAAT RDC 1481 R CGGGTGC (SEQ ID NO: 41) Lama2-ex4ctrl- GTAATTGTGAAGGCA RDC 1482 qPCR-F GCCAATTC (SEQ ID NO: 42) Lama2-ex4ctrl- TGCGGGGATAGATAT RDC 1483 qPCR-R TGTAGAGG (SEQ ID NO: 43)
[0163] Quantitative Droplet Digital PCR (ddPCR)
[0164] ddPCR was performed on temporal vein samples using a QX200™ Droplet Digital™ PCR System with QX200™ ddPCR EvaGreen Supermix (BioRad). Three-steps ddPCR was performed with the following conditions: 95° C. (10 mins), 94° C. (30 sec), 62° C. (30 sec), 72° C. (1 min) for 45 cycles, and 98° C. (10 mins) using Lama2 guide screen 1F (RDC 1474) and 1R (RDC 1475). The data were analyzed using QuantaSoft Pro (BioRad).
Example 2: Exon Inclusion Strategy Enables Restoration of the Full-Length Lama2 Expression
[0165] Mammalian donor splice sites for major U2-type introns have a 9 nucleotides long consensus sequence A/C-A-G-g-t-a/g-a-g-u and are mainly recognized by canonical base pairing to the 5′ end of U1 small nuclear RNA (25). The first two intronic nucleotides (gt) are essentially invariant, whereas certain flexibility is allowed at all other positions. Targeted CRISPR/Cas9-mediated removal of an intronic region containing a pathogenic mutation was used to enable joining of Lama2 exon 2 with a ‘gt’ dinucleotide, leading to reconstitution of a functional donor splice site.
[0166] A mouse model of MDC1A, dy.sup.2J/dy.sup.2J harboring a splice site mutation in intron 2 of Lama2, annotated as c.417+1 g.fwdarw.a, which results in skipping of exon 2 and truncation of the N-terminal domain of the protein (
[0167] Approximately 1400 nucleotides in a proximal region of intron 2 were evaluated for SaCas9 targets that would generate genomic deletions in combination with sgRNA1 and, following NHEJ, create potential donor splice sites (splice site-generating cut) (
[0168] RT-PCR analysis revealed the presence of an amplicon similar to the wild-type control for sgRNA2, sgRNA3 and sgRNA5, indicating an inclusion of the exon 2 of Lama2 after generation of intronic deletions (
Example 3: Restoration of Full-length Lama2 and Improvement of Muscle Pathology Following Intramuscular Administration of AAV9-CRISPR
[0169] To assess the impact of the full-length Lama2 restoration in skeletal muscle, Cas9 and sgRNAs were delivered into the right tibialis anterior (TA) muscles of 3-week old dy.sup.2J/dy.sup.2J mice using adeno-associated virus serotype 9 (AAV9) (
[0170] RT-PCR analysis successfully identified transcripts with inclusion of exon 2 in all right TA muscles treated with AAV9-Cas9-sgRNA1 and -sgRNA2, but not in contralateral muscles and AAV9-GFP-treated mice (
[0171] The expression of Lama2 was analyzed by western blot, and it was observed that in all dy.sup.2J/dy.sup.2J-isolated muscles, a truncated form of Lama2 was present, whereas a slightly larger full-length protein was expressed in the TA muscles injected with AAV9-Cas9-sgRNA1 and -sgRNA2 combination (
[0172] To assess the localization of the restored Lama2, immunofluorescence staining was performed on the treated TA muscles. Lama2 immunoreactivity was detected at the sarcolemma of muscles fibers, similar to the wild-type controls (
[0173] This demonstrated the successful inclusion of the previously missing exon 2 and restoration of the full-length Lama2 protein at the sarcolemma of the muscle, which led to overall improvement of muscle histopathology supporting the therapeutic potential of this treatment strategy in the dy.sup.2J/dy.sup.2J mice.
Example 4: Restoration of Lama2 in Neonatal Mice Through Intraperitoneal Administration of AAV9-Cas9-sgRNA1 and -sgRNA2
[0174] It was next asked whether the dystrophic pathology, which is already apparent at pre-weaning age (17), could be ameliorated by restoration of Lama2 in neonatal mice. P2 dy.sup.2J/dy.sup.2J mice were injected with AAV9-Cas9-sgRNA1 and -sgRNA2 via intraperitoneal route and compared to AAV9-GFP-treated controls (
[0175] RT-PCR analysis of gastrocnemius muscles isolated 10 weeks post-injection showed successful exon 2 inclusion (
[0176] Histological examination using hematoxylin and eosin (H&E) and picrosirius red staining revealed an overall improved morphology of muscles with restored Lama2 expression (
[0177] Morphometric analysis showed ˜50% reduction of fibrosis (
[0178] RT-PCR and immunofluorescence staining analyses of sciatic nerves showed no restoration of Lama2 transcript and protein (
[0179] This finding recapitulated an earlier study demonstrating that muscle-specific overexpression of Laminin-211 in merosin-deficient mice improved muscle pathology, but not the associated paralysis (7), indicating that correction of the peripheral neuropathy requires restoration of Lama2 beyond skeletal muscles. The previously reported feasibility of CMV-driven gene expression mediated by AAV9 into sciatic nerves has been demonstrated mostly via vascular administration routes (26, 27).
Example 5: Full-length Expression of Lama2 Post-temporal Vein Injection of AAV9-Cas9-sgRNA1 and -sgRNA2
[0180] To achieve a more robust tissue distribution, AAV9-Cas9-sgRNA1 and -sgRNA2 were injected into the temporal vein of P2 neonatal dy.sup.2J/dy.sup.2J mice (
[0181] At 30 weeks post-injection, the muscles and sciatic nerves were harvested and analyzed for targeted genomic DNA deletion and expression of full-length Lama2 mRNA and protein. The expected ˜493 bp genomic deletion was detected by PCR (
[0182] RT-PCR and western blot analyses of the TA muscles indicated expression of full-length Lama2 at transcript and protein levels (
[0183] Lama2 restoration was also achieved in sciatic nerve at the transcript (
[0184] Consequently, the degree of contracture and paralysis in the hind limbs and overall mobility of the AAV9-Cas9-sgRNA1 and -sgRNA2-treated mice were markedly improved compared to those of control dy.sup.2J/dy.sup.2J mice. Supporting this finding, open field test demonstrated significantly higher total distance travelled in the AAV9-Cas9-sgRNA1 and -sgRNA2-treated mice compared to the control AAV9-GFP-treated dy.sup.2J/dy.sup.2J (
[0185] In addition to the on-target activity, Cas9 nuclease has been shown to rarely generate DSB at non-intended off-target sites throughout the genome (16). Off-target effects are a safety concern for the eventual translation of gene editing therapies to humans. No off-target mutations at twenty potential off-target sites (ten for each sgRNA) in the mouse genome were observed (
Example 6: Bioinformatic Analysis of Targetable Exon/Intron Junctions and Intronic “GT” Motifs with Currently Available Cas9 Species
[0186] Positions of all exon-intron junctions were obtained from GENCODE gene annotation gtf file (version 19, https://www.gencodegenes.org/releases/19.html) using customized scripts. 5′ splice site (+1) mutations were extracted from the Human Gene Mutation Data database (HGMD, version 20140404). All of these mutations are pathogenic or disease-causing mutations (DM). As used herein, the first nucleotide of the intron is defined as the “+1” position. For each transcript, introns were extracted. For introns that were longer than 1000 base pairs, only the first 1000 base pairs were retained for analysis. For each gene, overlapping introns from different transcripts were merged. If a shorter intron overlaps with a larger one, then the shorter intron was used. For the introns containing a 5′ splice site (+1) mutation, it was first examined if a “mutation-eliminating cut” site could be found. As used herein, a “mutation-eliminating cut” is defined as when any of the available Cas9s cleavage site occurring after the following PAM sequences: NGG, NAG, NGA, NGCG, NNGRRT, NNNRRT, NNAGAA, NNNNRYAC, YG could lead to a cutting that results in the elimination of the mutation. For mutations occurring at the “+1” position, the cut has to be made immediately upstream of the “+1” position (“cut at “+1”). The search for PAM sequences was done on both strands. For the introns where a “mutation-eliminating cut” could be found, the search continued into the introns and it was examined if a″ “splice site-generating cut” site could be found. The first two nucleotides (“gt” at +1 and +2 sites) were required to be restored. Finally, “splice site-generating cut” sites that were at least 50 base pairs away from the next exon were filtered for, to avoid interference with the splicing machinery, although the splice site-generating cut site may be less than 50 base pairs away from the next exon (e.g., at least 25 base pairs away from a downstream (5′ to 3) or adjacent exon).
[0187] Splice Site Prediction
[0188] Splice sites were scored using (a) the MaxEntScan donor model, receiving in input 3 exonic bp and 6 intronic bp (22) centered at the splice site; (b) the DeepScan donor model, a proprietary predictor developed at Deep Genomics Inc., receiving in input 40 exonic bp and 40 intronic bp centered at the splice site. Both predictors were trained based on human annotated splice sites and genome sequence, and applied to the mouse sequence without further adjustments. MaxEntScan was previously shown to be one of the best performing publicly-available methods also for evaluating the deleteriousness of splice site altering variants (23). DeepScan is based on a deep neural network architecture (24) with convolutional layers and hidden layers consisting of rectified linear units trained using stochastic gradient descent to distinguish sequences of annotated splice sites from decoy splice sites found in introns; it achieved 98% area under the ROC curve for distinguishing held-out donor splice sites from held-out intronic sequences with the core donor dinucleotide GT, achieving a better performance than MaxEntScan as seen in Table 5:
TABLE-US-00005 TABLE 5 Splice Site Prediction Scores Using DeepScan and MaxEntScan Donor Models N# GT % GT N# GT % GT % exons Avg Prediction sites sites sites not- pred pred >=1 pred GT Score ann* and ann and ann and and GT site site <=1 kb Threshold threshold pred** pred pred ann in <=1 kb per exon explanation Deep Scan 0.0022 188,884 0.978 13,110,240 0.014 0.995 35.34 Ultra sensitivity DONOR (recovers all experimentally- positive Lama2/Cas9 splice sites) 0.4800 173,761 0.900 1,165,522 0.130 0.864 12.85 Optimal sensitivity (90% intronic ‘gt’ annotated as splice sites recovered) 0.9790 85,873 0.445 85,732 0.500 0.536 2.75 Optimal specificity (“only” 50% intronic ‘gt' not- annotated as splice sites recovered) MaxEntScan −20.0095 192,810 0.998 47,356,342 0.004 1.000 41.62 Ultra sensitivity DONOR (same criteria) 4.9000 173,409 0.900 3,023,988 0.054 0.893 11.36 Optimal sensitivity (same criteria) *ann: annotated **pred: predicted
[0189] To derive genome wide statistics on donor splice site prediction, all intronic GT sites were extracted for GENCODE Basic version 19 transcripts, using the reverse complement of the hg19/b37 human reference genome sequence for negative strand transcripts; only sites for which the intronic categorization was not ambiguous were retained (i.e. intronic in all transcripts); sites were then categorized as annotated splice site donors or not. Finally, prediction models (MaxEntScan donor model, DeepScan donor model) were used to score these sites, and derive genome wide statistics at different prediction score thresholds. Exon statistics for reconstituted splice site donors (percentage of exons with at least one GT site within 1 kb predicted as donor when deleting the intervening intronic sequence and retaining the exon sequence, average number of such sites per exon) were calculated with respect to the longest transcript isoform of each gene in GENCODE Basic version 19.
Example 7: Correction of a Second Splice Site Mutation in a Patient with MDC1A
[0190] A patient has two mutations in the LAMA2 gene, with one of them being +1 mutation (c3037+1G>T) in 5′SS of exon 21 (
[0191] Using Human Splicing Finder (HSF) (http://www.umd.be/HSF3/) a “strength” of a non-mutated 5′ss was determined. For that, an exon 21 (181 nt)+50 intronic nucleotides (231 nt total) were analyzed by HSF (
[0192] For mutated sequence (c3037+1G>T) HSF and MaxEnt scores dropped to 59.24 and −1.65, respectively, indicating that alteration of 5′ss will likely cause a splicing defect (
[0193] By modelling a double stranded break at any position within a sequence, a set of new exon/intron junctions are generated by removing an intervening sequence containing the mutation and linking the end of the exon 21 with intronic sequences starting with ‘gt’ in order to constitute a possible 5′ss. An example is shown in
TABLE-US-00006 TABLE 6 HSF Scores and MaxEnt Scores of Potential Splices Sites in the Intronic Sequence that Could Constitute a Possible 5′ss for Exon 21 of the Lama2 Gene Name Exonic sequence Intronic sequences HSF score MaxEnt score 1 Control GCTGGGACCTTTGGCCT gtctgtaaatatgacttaag 86.17 6.84 ACAATCAGCAAGGGGCT tcctacatattcactctgata GTGTTCCCTGCAACTGC gttttaggg (SEQ ID AATTCTTTTGGGTCTAAG NO: 45) 2 Patient TCATTCGACTGTGAAGA ttctgtaaatatgacttaagt 59.34 -1.65 GAGTGGACAATGTTGGT cctacatattcactctgata GCCAACCTGGAGTCACA gttttaggg (SEQ ID GGGAAGAAATGTGACCG NO: 46) 3 Del 4 nt CTGTGCCCACGGCTATT gtaaatatgacttaagtcct 99.44 8.76 TCAACTTCCAAGAAGGA acatattcactctgatagtttt GGCTGCACAG (SEQ ID agggtctg (SEQ ID NO: 44) NO: 47) 4 Del 19 nt gtcctacatattcactctgat 71.52 3.12 agttttagggtctgacttctgt aacagtt (SEQ ID NO: 48) 5 Del 41 nt gttttagggtctgacttctgta 71.52 0.68 acagtttgttcttttaaagga agcacag (SEQ ID NO: 49) 6 Del 49 nt gtctgacttctgtaacagttt 83.84 5.06 gttcttttaaaggaagcaca gattgatac (SEQ ID NO: 50) 7 Del 60 nt gtaacagtttgttcttttaaa 85.65 8.88 ggaagcacagattgatac aacgtatttaa (SEQ ID NO: 51) 8 Del 66 nt gtttgttcttttaaaggaagc 88.22 7.44 acagattgatacaacgtatt taacataat (SEQ ID NO: 52) 9 Del 102 nt gtatttaacataatgagtaa 99.2 7.51 tgtctacttatttttacgttactt aaaaaa (SEQ ID NO: 53) 10 Del 118 nt gtaatgtctacttatttttacgt 99.8 9.43 tacttaaaaaatgaaatac atgtcagt (SEQ ID NO: 54) 11 Del 123 nt gtctacttatttttacgttactt 71.83 0.92 aaaaaatgaaatacatgt cagtttgaa (SEQ ID NO: 55) 12 Del 139 nt gttacttaaaaaatgaaat 85.93 0.49 acatgtcagtttgaaacact gccttaataac (SEQ ID NO: 56) 13 Del 162 nt gtcagtttgaaacactgcct 94.98 8.94 taataactgtgtgcaagtac tcagtaaaaa (SEQ ID NO: 57) 14 Del 190 nt gtgtgcaagtactcagtaa 82.38 6.46 aaatcagcagatgctcgttt ggttgtgtccc (SEQ ID NO: 58) 15 Del 192 nt gtgcaagtactcagtaaaa 85.42 2.83 atcagcagatgctcgtttgg ttgtgtccc (SEQ ID NO: 59) 16 Del 198 nt gtactcagtaaaaatcagc 96.36 7.04 agatgctcgtttggttgtgtc cc (SEQ ID NO: 60) 17 Del 205 nt gtaaaaatcagcagatgct 99.18 8.59 cgtttggttgtgtccc (SEQ ID NO: 61) 18 Del 225 nt gtttggttgtgtccctccctgt 82.66 5.63 ctgtgactttgattcagcca gtcactca (SEQ ID NO: 62) 19 Del 230 nt gttgtgtccctccctgtctgt 89.36 -0.29 gactttgattcagccagtca ctca (SEQ ID NO: 63) 20 Del 233 nt gtgtccctccctgtctgtgac 83.19 3.16 tttgattcagccagtcactc a (SEQ ID NO: 64) 21 Del 235 nt gtccctccctgtctgtgacttt 74.16 0.68 gattcagccagtcactca (SEQ ID NO: 65) 22 Del 245 nt gtctgtgactttgattcagcc 86.17 6.84 agtcactcaaacactgcta cggcttgaaa (SEQ ID NO: 66) 23 Del 249 nt gtgactttgattcagccagt 89.22 6.6 cactcaaacactgctacg gcttgaaa (SEQ ID NO: 67) 24 Del 267 nt gtcactcaaacactgctac 82.96 3.41 ggcttgaaagttaaatgttc gatttttctgg (SEQ ID NO: 68) 25 Del 295 nt gttaaatgttcgatttttctgg 81.28 2.54 taccagtcttatcttagtttaa gctcaa (SEQ ID NO: 69) 26 Del 302 nt gttcgatttttctggtaccagt 85.59 5.36 cttatcttagtttaagctcaa (SEQ ID NO: 70) 27 Del 316 nt gtaccagtcttatcttagttta 99.62 7.88 agctcaaaccccacgcca atgcatttca (SEQ ID NO: 71) 28 Del 322 nt gtcttatcttagtttaagctca 71.52 0.67 aaccccacgccaatgcatt tca (SEQ ID NO: 72) 29 Del 333 nt gtttaagctcaaaccccac 80.76 1.88 gccaatgcatttcaactttc cctttttccta (SEQ ID NO: 73) 30 Del 394 nt gtatccggatcaagaaaa 100 7.91 gtcaaggtgttttttttttcctg gtaccccaa (SEQ ID NO: 74) 31 Del 412 gtcaaggtgttttttttttcctg 80.88 2.98 gtaccccaaaaacatctttt catctac (SEQ ID NO: 75) 32 Del 418 nt gtgtttttttttcctggtaccc 83.62 2.28 caaaaacatcttttcatcta c (SEQ ID NO: 76) 33 Del 420 nt gttttttttttcctggtacccca 84.35 2.93 aaaacatcttttcatctac (SEQ ID NO: 77) 34 Del 435 nt gtaccccaaaaacatctttt 99.39 8.63 catctacttttggctgccttg aaaactatg (SEQ ID NO: 78) 35 Del 487 nt gttctaagagagaaatag 83.18 0.74 gttggctctagatatttagttc atcaaaaaat (SEQ ID NO: 79) 36 Del 505 nt gttggctctagatatttagttc 5.52 88.07 atcaaaaaatgcaggcag aggtttctaa (SEQ ID NO: 80)
[0194] A set of the generated junctions within the first 505 nucleotides in the intron are scored as described above (Table 6). For the new junction depicted in
[0195] Although the disclosure has been described with reference to certain specific embodiments, various modifications thereof will be apparent to those skilled in the art. Any examples provided herein are included solely for the purpose of illustrating the disclosure and are not intended to limit the disclosure in any way. Any drawings provided herein are solely for the purpose of illustrating various aspects of the disclosure and are not intended to be drawn to scale or to limit the disclosure in any way. The scope of the claims appended hereto should not be limited by the preferred embodiments set forth in the above description, but should be given the broadest interpretation consistent with the present specification as a whole. The disclosures of all art recited herein are incorporated herein by reference in their entirety as if each piece of art was specifically and individually indicated to be incorporated by reference.
DOCUMENTS CITED
[0196] 1. Scotti, M. M., Swanson, M. S. RNA mis-splicing in disease. Nat Rev Genet 17, 19 (2016) [0197] 2. Bladen, C. L., Salgado, D., Monges, S. et al. The TREAT-NMD DMD Global Database: Analysis of More than 7,000 Duchenne Muscular Dystrophy Mutations. Hum Mutat 36, 395 (2015). [0198] 3. Ran, F. A., Cong, L., Yan, W. X. et al. In vivo genome editing using Staphylococcus aureus Cas9. Nature 520, 186 (2015). [0199] 4. Durbeej, M. Laminin-alpha2 Chain-Deficient Congenital Muscular Dystrophy: Pathophysiology and Development of Treatment. Curr Top Membr 76, 31 (2015). [0200] 5. Geranmayeh, F., Clement, E., Feng, L. H. et al. Genotype-phenotype correlation in a large population of muscular dystrophy patients with LAMA2 mutations. Neuromuscul Disord 20, 241 (2010). [0201] 6. Stenson, P. D., Mort, M., Ball, E. V. et al. The Human Gene Mutation Database: building a comprehensive mutation repository for clinical and molecular genetics, diagnostic testing and personalized genomic medicine. Hum Genet 133, 1 (2014). [0202] 7. Kuang, W., Xu, H., Vachon, P. H. et al. Merosin-deficient congenital muscular dystrophy. Partial genetic correction in two mouse models. J Clin Invest 102, 844 (1998). [0203] 8. Yang, Y., Wang, L., Bell, P. et al. A dual AAV system enables the Cas9-mediated correction of a metabolic liver disease in newborn mice. Nat Biotechnol 34, 334 (2016). [0204] 9. Yin, H., Xue, W., Chen, S. et al. Genome editing with Cas9 in adult mice corrects a disease mutation and phenotype. Nat Biotechnol 32, 551 (2014), [0205] 10. Yin, H., Song, C. Q., Dorkin, J. R. et al. Therapeutic genome editing by combined viral and non-viral delivery of CRISPR system components in vivo. Nat Biotechnol 34, 328 (2016). [0206] 11. Ferdousi, L., Rocheteau, P., Chayot, R. et al. More efficient repair of DNA double-strand breaks in skeletal muscle stem cells compared to their committed progeny. Stem Cell Research 13, 492 (2014). [0207] 12. Guiraud, S., Aartsma-Rus, A., Vieira, N. M. et al. The Pathogenesis and Therapy of Muscular Dystrophies. Annu Rev Genomics Hum Genet (2015) [0208] 13. Jinek, M., Chylinski, K., Fonfara, I. et al. A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science 337, 816 (2012). [0209] 14. Cong, L., Ran, F. A., Cox, D. et al. Multiplex genome engineering using CRISPR/Cas systems. Science 339, 819 (2013). [0210] 15. Kim, S., Kim, D., Cho, S. W. et al. Highly efficient RNA-guided genome editing in human cells via delivery of purified Cas9 ribonucleoproteins. Genome Res 24, 1012 (2014). [0211] 16. Hendel, A., Bak, R. O., Clark, J. T. et al. Chemically modified guide RNAs enhance CRISPR-Cas genome editing in human primary cells. Nat Biotechnol (2015). [0212] 17. Mehuron, T., Kumar, A., Duarte, L. et al. Dysregulation of matricellular proteins is an early signature of pathology in laminin-deficient muscular dystrophy. Skelet Muscle 4, 14 (2014). [0213] 18. Kemaladewi, D. U., de Gorter, D. J., Aartsma-Rus, A. et al. Cell-type specific regulation of myostatin signaling. FASEB J 26, 1462 (2012). [0214] 19. Vilquin, J. T., Vignier, N., Tremblay, J. P. et al. Identification of homozygous and heterozygous dy2J mice by PCR. Neuromuscul Disord 10, 59 (2000). [0215] 20. Hsu, P. D., Lander, E. S., Zhang, F. Development and applications of CRISPR-Cas9 for genome engineering. Cell 157, 1262 (2014). [0216] 21. Wojtal, D., Kemaladewi, D. U., Malam, Z. et al. Spell Checking Nature: Versatility of CRISPR/Cas9 for Developing Treatments for Inherited Disorders. Am J Hum Genet 98, 90 (2016). [0217] 22. Yeo, G., Burge, C. B. Maximum entropy modeling of short sequence motifs with applications to RNA splicing signals. J Comput Biol 11, 377 (2004). [0218] 23. Jian, X., Boerwinkle, E., Liu, X. In silico prediction of splice-altering single nucleotide variants in the human genome. Nucleic Acids Res 42, 13534 (2014). [0219] 24. LeCun, Y., Bengio, Y., Hinton, G. Deep learning. Nature 521, 436 (2015). [0220] 25. Roca, X., Krainer, A. R., Eperon, I. C. Pick one, but be quick: 5′ splice sites and the problems of too many choices. Genes Dev 27, 129 (2013). [0221] 26. Tanguy, Y., Biferi, M. G., Besse, A. et al. Systemic AAVrh10 provides higher transgene expression than AAV9 in the brain and the spinal cord of neonatal mice. Front Mol Neurosci 8, 36 (2015). [0222] 27. Zincarelli, C., Soltys, S., Rengo, G. et al. Analysis of AAV serotypes 1-9 mediated gene expression and tropism in mice after systemic injection. Mol Ther 16, 1073 (2008). [0223] 28. Gombash Lampe, S. E., Kaspar, B. K., Foust, K. D. Intravenous injections in neonatal mice. J Vis Exp e52037 (2014). [0224] 29. Iyer, S. R., Valencia, A. P., Hernandez-Ochoa, E. O. et al. In Vivo Assessment of Muscle Contractility in Animal Studies. Methods Mol Biol 1460, 293 (2016). [0225] 30. Call, J. A., Lowe, D. A. Eccentric Contraction-Induced Muscle Injury: Reproducible, Quantitative, Physiological Models to Impair Skeletal Muscle's Capacity to Generate Force. Methods Mol Biol 1460, 3 (2016). [0226] 31. Tsai, S. Q., Zheng, Z., Nguyen, N. T. et al. GUIDE-seq enables genome-wide profiling of off-target cleavage by CRISPR-Cas nucleases. Nat Biotechnol 33, 187 (2015). [0227] 32. Sunada, Y., Bernier, S. M., Utani, A. et al. Identification of a novel mutant transcript of laminin alpha 2 chain gene responsible for muscular dystrophy and dysmyelination in dy2J mice. Hum Mol Genet 4, 1055 (1995). [0228] 33. Xu, H., Wu, X. R., Wewer, U. M. et al. Murine muscular dystrophy caused by a mutation in the laminin alpha 2 (Lama2) gene. Nat Genet 8, 297 (1994). [0229] 34. Tatem, K. S., Quinn, J. L., Phadke, A. et al. Behavioral and locomotor measurements using an open field activity monitoring system for skeletal muscle diseases. J Vis Exp 51785 (2014). [0230] 35. Hirano, H., Gootenberg, J. S., Horii, T. et al. Structure and Engineering of Francisella novicida Cas9. Cell 164, 950 (2016). [0231] 36. Kim, E., Koo, T., Park, S. W. et al. In vivo genome editing with a small Cas9 orthologue derived from Campylobacter jejuni. Nat Commun 8, 14500 (2017). [0232] 37. Kleinstiver, B. P., Prew, M. S., Tsai, S. Q. et al. Broadening the targeting range of Staphylococcus aureus CRISPR-Cas9 by modifying PAM recognition. Nat Biotechnol 33, 1293 (2015). [0233] 38. Kleinstiver, B. P., Prew, M. S., Tsai, S. Q. et al. Engineered CRISPR-Cas9 nucleases with altered PAM specificities. Nature 523, 481 (2015).