CRISPR-BASED TREATMENT OF FRIEDREICH ATAXIA
20200056206 · 2020-02-20
Inventors
Cpc classification
C12N2310/20
CHEMISTRY; METALLURGY
A61P25/14
HUMAN NECESSITIES
A61K31/7088
HUMAN NECESSITIES
A61K31/7088
HUMAN NECESSITIES
C12N9/22
CHEMISTRY; METALLURGY
C12N2750/14143
CHEMISTRY; METALLURGY
A61K38/465
HUMAN NECESSITIES
A61K48/0016
HUMAN NECESSITIES
C12N15/113
CHEMISTRY; METALLURGY
A61K48/005
HUMAN NECESSITIES
A61K38/465
HUMAN NECESSITIES
A61P25/28
HUMAN NECESSITIES
A61K31/7105
HUMAN NECESSITIES
A61K2300/00
HUMAN NECESSITIES
A61K2300/00
HUMAN NECESSITIES
International classification
C12N15/90
CHEMISTRY; METALLURGY
A61P25/28
HUMAN NECESSITIES
C12N15/113
CHEMISTRY; METALLURGY
A61P25/14
HUMAN NECESSITIES
C12N9/22
CHEMISTRY; METALLURGY
A61K31/7105
HUMAN NECESSITIES
Abstract
Methods of modifying a frataxin gene are disclosed, comprising removing some or all of endogenous GAA trinucleotide repeats within the frataxin gene, e.g., within an intron (e.g., intron 1) of the frataxin gene. The removal may be effected using a CRISPR/CAS nuclease system. Such modification may be used to increase frataxin expression in the cell, and also to treat a subject suffering from Friedreich ataxia. Reagents, kits and uses of the method are also disclosed, for example to modify a frataxin gene and to treat a subject suffering from Friedreich ataxia.
Claims
1-60. (canceled)
61. A method of modifying within a cell, a frataxin (FXN) gene comprising a plurality of GAA trinucleotide repeats in an intron of said gene, the method comprising: (a) introducing a first cut within the intron of the FXN gene creating a first intron end, wherein said first cut is located upstream of or within the plurality of GAA trinucleotide repeats; (b) introducing a second cut within the intron of the FXN gene creating a second intron end, wherein said second cut is located downstream of or within the plurality of GAA trinucleotide repeats; wherein upon ligation of said first and second intron ends, said FXN gene is modified and some or all of said GAA trinucleotide repeats are removed.
62. The method of claim 61, wherein the first and second cuts are introduced by providing a cell with (i) at least one CRISPR nuclease; and (ii) a pair of gRNAs consisting of (a) a first gRNA which binds to a polynucleotide sequence within the intron of the FXN gene located upstream of the plurality of GAA trinucleotide repeats for introducing a first cut; (b) a second gRNA which binds to a polynucleotide sequence within the intron of the FXN gene located downstream of the plurality of GAA trinucleotide repeats for introducing the second cut.
63. The method of claim 61, wherein the FXN gene comprises at least 70 GAA trinucleotide repeats within the intron.
64. The method of claim 61, wherein said first cut is located for the removal of between 30 and 506 nucleotides upstream of the GAA trinucleotide repeats.
65. The method of claim 61, wherein the second cut is located for the removal of between 20 and 478 nucleotides downstream of the GAA trinucleotide repeats.
66. The method of claim 61, wherein: the first gRNA has a target sequence adjacent to a NGG PAM nucleotide sequence located within nts 6201-6633 and the second gRNA has a target sequence adjacent to a NGG PAM nucleotide sequence located within nts 7078-7161; the first gRNA has a target sequence adjacent to a NNGRRT PAM nucleotide sequence located within nts 6201-6633 and the second gRNA has a target sequence adjacent to a NNGRRT PAM nucleotide sequence located within nts 7078-7161; or the first gRNA has a target sequence adjacent to a NNNNRYAC PAM nucleotide sequence located within nts 6201-6633 and the second gRNA has a target sequence adjacent to a NNNNRYAC PAM nucleotide sequence located within nts 7078-7161; wherein the nucleotide positions are given with respect to the FXN polynucleotide gene sequence set forth in GenBank NG_00845 (SEQ ID NO: 4).
67. The method of claim 61, wherein: the first gRNA has a target sequence adjacent to a NGG PAM nucleotide sequence located within nts 6594-6633 and the second gRNA has a target sequence adjacent to a NGG PAM nucleotide sequence located within nts 6973-7163; the first gRNA has a target sequence adjacent to a NNGRRT PAM nucleotide sequence located within nts 6594-6633 and the second gRNA has a target sequence adjacent to a NNGRRT PAM nucleotide sequence located within nts 6973-7163; or the first gRNA has a target sequence adjacent to a NNNNRYAC PAM nucleotide sequence located within nts 6594-6633 and the second gRNA has a target sequence adjacent to a NNNNRYAC PAM nucleotide sequence located within nts 6973-7163; wherein the nucleotide positions are given with respect to the FXN polynucleotide gene sequence set forth in GenBank NG_00845 (SEQ ID NO: 4).
68. A gRNA pair for deleting a plurality of endogenous GAA trinucleotide repeats within an intron of a FXN gene within a cell, wherein said pair consists of a first gRNA and a second gRNA, wherein (a) said first gRNA binds to a first polynucleotide sequence within the intron of the FXN gene located upstream of or within the plurality of GAA trinucleotide repeats for introducing a first cut; and (b) said second gRNA binds to a second polynucleotide sequence within the intron of the FXN gene located downstream of or within the plurality of GAA trinucleotide repeats for introducing a second cut downstream from the first cut.
69. The gRNA pair of claim 68, wherein said first cut removes between 30 and 506 nucleotides upstream of the GAA trinucleotide repeats.
70. The gRNA pair of claim 68, wherein the second cut removes between 20 and 478 nucleotides downstream of the GAA trinucleotide repeats.
71. The gRNA pair of claim 68, wherein: the first gRNA has a target sequence adjacent to a NGG PAM nucleotide sequence located within nts 6201-6633 and the second gRNA has a target sequence adjacent to a NGG PAM nucleotide sequence located within nts 7078-7161; the first gRNA has a target sequence adjacent to a NNGRRT PAM nucleotide sequence located within nts 6201-6633 and the second gRNA has a target sequence adjacent to a NNGRRT PAM nucleotide sequence located within nts 7078-7161; or the first gRNA has a target sequence adjacent to a NNNNRYAC PAM nucleotide sequence located within nts 6201-6633 and the second gRNA has a target sequence adjacent to a NNNNRYAC PAM nucleotide sequence located within nts 7078-7161; wherein the nucleotide positions are given with respect to the FXN polynucleotide gene sequence set forth in GenBank NG_00845 (SEQ ID NO: 4).
72. The gRNA pair of claim 68, wherein: the first gRNA has a target sequence adjacent to a NGG PAM nucleotide sequence located within nts 6594-6633 and the second gRNA has a target sequence adjacent to a NGG PAM nucleotide sequence located within nts 6973-7163; the first gRNA has a target sequence adjacent to a NNGRRT PAM nucleotide sequence located within nts 6594-6633 and the second gRNA has a target sequence adjacent to a NNGRRT PAM nucleotide sequence located within nts 6973-7163; or the first gRNA has a target sequence adjacent to a NNNNRYAC PAM nucleotide sequence located within nts 6594-6633 and the second gRNA has a target sequence adjacent to a NNNNRYAC PAM nucleotide sequence located within nts 6973-7163; wherein the nucleotide positions are given with respect to the FXN polynucleotide gene sequence set forth in GenBank NG_00845 (SEQ ID NO: 4).
73. A nucleic acid comprising one or more polynucleotide sequences encoding one or both members of the gRNA pair of claim 68.
74. The nucleic acid of claim 73, further comprising a sequence encoding a CRISPR nuclease.
75. A nucleic acid comprising a modified FXN gene comprising ligated first and second intron ends as defined in claim 61.
76. A vector comprising the nucleic acid of claim 73.
77. A combination of vectors comprising: a gRNA vector comprising a first nucleic acid comprising a polynucleotide sequence encoding the first gRNA and a second nucleic acid comprising a polynucleotide sequence encoding the second gRNA, of the gRNA pair of claim 68; and a CRISPR nuclease vector comprising a third nucleic acid comprising a polynucleotide sequence encoding one or more CRISPR nucleases.
78. A cell comprising the vector of claim 76.
79. A method for treating Friedreich ataxia in a subject, comprising modifying a FXN gene and increasing FXN expression within a cell of said subject according to the method of claim 61.
80. A method for treating Friedreich ataxia in a subject, comprising contacting a cell of the subject with (i)(a) the gRNA pair of claim 68 or one or more nucleic acids encoding said gRNA pair and (b) a CRISPR nuclease polypeptide or a nucleic acid encoding a CRISPR nuclease polypeptide.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0061] In the appended drawings:
[0062]
[0063]
[0064]
[0065]
[0066]
[0067]
[0068]
[0069]
[0070]
[0071]
[0072]
[0073]
[0074]
[0075]
[0076]
[0077]
[0078]
[0079]
DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS
[0080] ciRNAs
[0081] In order to cut DNA at a specific site, CRISPR nucleases require the presence of a gRNA and a protospacer adjacent motif (PAM), which immediately follows (or precedes) the gRNA target sequence in the targeted polynucleotide gene sequence. The PAM is located at one end (i.e., the 3 end or 5 end) of the gRNA target sequence but is not part of the gRNA guide sequence. Different CRISPR nucleases require a different PAM. Accordingly, selection of a specific polynucleotide target sequence (e.g., in the FXN gene nucleic acid sequence) by a gRNA is generally based on the CRISPR nuclease used. The PAM for the Streptococcus pyogenes Cas9 CRISPR system is 5-NRG-3, where R is either A or G, and characterizes the specificity of this system in human cells. The S. pyogenes Type II system naturally prefers to use an NGG sequence, where N can be any nucleotide, but also accepts other PAM sequences, such as NAG in engineered systems. The PAM of S. aureus is NNGRR (or NNGRRT wherein R is A or G). Similarly, the Cas9 derived from Neisseria meningitides (NmCas9) normally has a native PAM of NNNNGATT, but has activity across a variety of PAMs, including a highly degenerate NNNNGNNN PAM. Another Example is the Cas9 derived from Campylobacter jejuni (CjCas9) which is advantageously small and which generally recognizes a NNNNRYAC PAM (but also a NNNVRYAC or NNNNACAC), where N can be any nucleotide, R is a purine (G or A), and Y is a pyrimidine (T or C). CjCas9 also recognizes NNNVRYAC or NNNNACAC PAM, where V is A, G or C.
[0082] In a preferred embodiment, the PAM for a Cas9 or Cpf1 protein used in accordance with the present invention is a NGG (SpCas9), a NNGRRT (SaCas9), a NNNNRYAC, NNNVRYAC or NNNNACAC (CjCas9) or TTTN (AsCpf1 and LbCpf1) nucleotide-sequence. Table 1 below provides a list of non-limiting examples of CRISPR/nuclease systems with their respective PAM sequences.
TABLE-US-00001 TABLE 1 Non-exhaustive list of CRISPR-nuclease systems from different species (see. Mohanraju, P. et al. (58); Shmakov, S et al. (59); Zetsche, B. et al (60); and Shah et al., (63)). Also included are examples of engineered variants recognizing alternative PAM sequences (see Kleinstiver, BP. et al., (61) and (62)). CRISPR nuclease/subtype PAM Sequence Cut site Streptococcus pyogenes (SP); SpCas9 NGG + NAG (in 3) Blunt end; 3-4bp upstream of the PAM (subtype II) sequence SpCas9 D1135E variant (subtype II) NGG (in 3, reduced NAG binding) Blunt end; 3-4bp upstream of the PAM sequence SpCas9 VRER variant (subtype II) NGCG (in 3) Blunt end; 3-4bp upstream of the PAM sequence SpCas9 EQR variant (subtype II) NGAG (in 3) Blunt end; 3-4bp upstream of the PAM sequence SpCas9 VQR variant (subtype II) NGAN or NGNG (in 3) Blunt end; 3-4bp upstream of the PAM sequence Staphylococcus aureus (SA); SaCas9 NNGRRT or NNGRR(N), (in 3) (R = A Blunt end; 3-4bp upstream of the PAM (subtype II) or G) sequence SaCas9 KKH variant (subtype II) NNNRRT (in 3) (R = A or G) Blunt end; 3-4bp upstream of the PAM sequence Neisseria meningitidis (NM) NNNNGATT (in 3) Blunt end; 3-4bp upstream of the PAM sequence AsCpf1 TTTN (in 5) 5 nucleotide 5 overhang 18-23 bases away from the PAM. LbCpf1 TTTN (in 5) 5 nucleotide 5 overhang 18-23 bases away from the PAM. Campylobacter jejuni (Cj) NNNNRYAC, NNNVRYAC, or Blunt end; 3-4bp upstream of the PAM NNNNACAC (in 3) sequence
[0083] Other non-limiting examples of known CRISPR nucleases that may be used include CRISPR nucleases from Streptococcus thermophilus (subtype II-A, PAM: NNAGAAW (in 3) (W=A or T); Treponema denticola (PAM: NAAAAC (in 3); Streptococcus agalactiae (PAM: NGG (in 3)); Sulfolobus solfataricus (subtype I-Al, PAM: CNN); Sulfolobus solfataricus (subtype I-A2, PAM: TCN); Haloquadratum walsbyi (subtype I-B, PAM: TTC), Escherichia coli (subtype I-E, PAM: AWG); Escherichia coli (subtype I-F; PAM: CC); and Pseudomonas aeruginosa (subtype I-F, PAM: CC).
[0084] As used herein, the expression gRNA (which is used interchangeably with sgRNA) refers to a guide RNA which in an embodiment is a fusion between the gRNA guide sequence (or CRISPR targeting RNA or crRNA) and the CRISPR nuclease recognition sequence (tracrRNA). It provides both targeting specificity and scaffolding/binding ability for the CRISPR nuclease of the present invention. Alternatively, the gRNA may be provided as two separate entities (a tracrRNA and a gRNA guide sequence (i.e., target-specific sequence/crRNA)). gRNAs of the present invention do not exist in nature, i.e., they are non-naturally occurring nucleic acid(s).
[0085] A target region, target sequence or protospacer in the context of gRNAs and CRISPR system of the present invention are used herein interchangeably and refers to the region of the target gene which is targeted by the CRISPR/nuclease-based system, without the PAM. It refers to the sequence corresponding to the nucleotides that are adjacent to the PAM (i.e., in 5 or 3 of the PAM, depending of the CRISPR nuclease) in the genomic DNA. It is the DNA sequence that is included into a gRNA expression construct (e.g., vector/plasmid/AW). The CRISPR/nuclease-based system may include at least one (i.e., one or more) gRNAs, wherein each gRNA targets a different DNA sequence on the target gene. The target DNA sequences may be overlapping. The target sequence or protospacer is followed or preceded by a PAM sequence at an end (3 or 5 depending on the CRISPR nuclease used) of the protospacer. Generally, the target sequence is immediately adjacent (i.e., is contiguous) to the PAM sequence (it is located on the 5 end of the PAM for SpCas9-like nuclease and at the 3 end for Cpf1-like nuclease).
[0086] As used herein, the expression gRNA guide sequence refers to the corresponding RNA sequence of the gRNA target sequence. Therefore, it is the RNA sequence equivalent of the protospacer on the target polynucleotide gene sequence. It does not include the corresponding PAM sequence in the genomic DNA. It is the sequence that confers target specificity. The gRNA guide sequence is preferably linked to a CRISPR nuclease recognition sequence (transactivating CRISPR RNA, i.e., tracrRNA, scaffolding RNA) which binds to the nuclease (e.g., Cas9/Cpf1). Although it is advantageous that the tracrRNA sequence and gRNA guide sequence be provided as a single RNA, it is also possible to provide the tracrRNA as a separate entity. The gRNA guide sequence recognizes and binds to the targeted gene of interest. It hybridizes with (i.e., is complementary to) the opposite strand of a target gene sequence, which comprises the PAM (i.e., it hybridizes with the DNA strand opposite to the PAM). As noted above, the PAM is the nucleic acid sequence, that immediately follows (is contiguous to) the target sequence in the target polynucleotide but is not in the gRNA.
[0087] A CRISPR nuclease recognition sequence (e.g., Cas9/recognition sequence) refers to the portion of the gRNA guide sequence that binds to the CRISPR nuclease (tracrRNA, scaffolding RNA or other recognition sequence (e.g., SEQ ID NOs: 91 (SpCas9), 93 (SaCas9), 154, and 94 (Cpf1)). It leads the CRISPR nuclease to the target sequence so that it may bind and cut the target nucleic acid. It is adjacent the gRNA guide sequence (in 3 (e.g., Cas9) or 5 (Cpf1) depending on the CRISPR nuclease used). In embodiments, the CRISPR nuclease recognition sequence is a Cas9 recognition sequence having at least 65, 74, 76 or 77 nucleotides. In embodiments, the CRISPR nuclease recognition sequence is a Cpf1 recognition sequence (5 direct repeat) having about 20 nucleotides. In particular embodiments, the Cas9 recognition sequence (gRNA scaffold sequence derived from crRNA and tracrRNA-) comprises (or consists of) the sequence as set forth in SEQ ID NO: 92, 93 or 154. The gRNA of the present invention may comprise any variant of this sequence, provided that it allows for the binding of the CRISPR nuclease protein of the present invention to the FXN gene. In embodiments, the CRISPR nuclease (e.g., Cas9 or Cpf1) recognition sequence is a CRISPR nuclease recognition sequence having at least 65 nucleotides. In embodiments, the CRISPR nuclease recognition sequence is a CRISPR nuclease recognition sequence having at least 74, 76 or 77 nucleotides.
[0088] As noted above not all CRISPR nucleases require a tracrRNA to function. Cpf1 is a single crRNA-guided endonuclease. Unlike Cas9, which requires both an RNA guide sequence (crRNA) and a tracrRNA (or a fusion or both crRNA and tracrRNA) to mediate interference, Cpf1 processes crRNA arrays independent of tracrRNA, and Cpf1-crRNA complexes alone cleave target DNA molecules, without the requirement for any additional RNA species (see Zetsche et al. (60)). Therefore, in the case of Cpf1, the CRISPR recognition sequence only comprises the conserved portion of the crRNA (i.e., without the target sequence).
[0089] In embodiments, the gRNA may comprise a G at the 5 end of its polynucleotide sequence. The presence of a G in 5 is preferred when the gRNA is expressed under the control of the U6 promoter (Koo T. et al. (65)). The CRISPR/nuclease system of the present invention may use gRNAs of varying lengths. The gRNA may comprise a gRNA guide sequence of at least 10 nts, at least 11 nts, at least a 12 nts, at least a 13 nts, at least a 14 nts, at least a 15 nts, at least a 16 nts, at least a 17 nts, at least a 18 nts, at least a 19 nts, at least a 20 nts, at least a 21 nts, at least a 22 nts, at least a 23 nts, at least a 24 nts, at least a 25 nts, at least a 30 nts, or at least a 35 nts of a target sequence in the FXN gene (such target sequence is followed or preceded by a PAM in the FXN gene but is not part of the gRNA). In embodiments, the gRNA guide sequence or gRNA target sequence may be least 10 nucleotides long, preferably 10-40 nts long (e.g., 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39 or 40 nts long), more preferably 17-30 nts long, more preferably 17-22 nucleotides long. In embodiments, the gRNA guide sequence is 10-40, 10-30, 12-30, 15-30, 18-30, or 10-22 nucleotides long. In embodiments, the PAM sequence is NGG, where N can be any nucleotide. In embodiments, the PAM sequence is TTTN, where N can be any nucleotide. In embodiments, the PAM sequence is NNNNRYAC, NNNVRYAC or NNNNACAC, where N can be any nucleotide, V is A, G or C, R is a purine (G or A), and Y is a pyrimidine (T or C). gRNAs may target any region of a target gene (e.g., FXN) which is immediately adjacent (contiguous, adjoining, in 5 or 3) to a PAM (e.g., NGG/TTTN/NNNNRYAC/NNNVRYAC/NNNNACAC, or CCN/NAAA/GTRYNNNN/GTRYBNNN/GTGTNNNN, for a PAM that would be located on the opposite strand) sequence. In embodiments, the gRNA of the present invention has a target sequence which is located (wholly or partly) in an exon (the gRNA guide sequence consists of the RNA sequence of the target (DNA) sequence which is located in an exon) but the cut is preferably in an intron. In embodiments, the gRNA of the present invention has a target sequence which is located in an intron (the gRNA guide sequence consists of the RNA sequence of the target (DNA) sequence which is located in an intron). In embodiments, the gRNA may target any region (sequence) which is followed (or preceded, depending on the CRISPR nuclease used) by a PAM in the FXN gene which may be used to restore or increase FXN expression level and/or activity.
[0090] The number of gRNAs administered to or expressed in a target cell in accordance with the methods of the present invention may be at least 1 gRNA, at least 2 gRNAs, at least 3 gRNAs at least 4 gRNAs, at least 5 gRNAs, at least 6 gRNAs, at least 7 gRNAs, at least 8 gRNAs, at least 9 gRNAs, at least 10 gRNAs, at least 11 gRNAs, at least 12 gRNAs, at least 13 gRNAs, at least 14 gRNAs, at least 15 gRNAs, at least 16 gRNAs, at least 17 gRNAs, or at least 18 gRNAs. The number of gRNAs administered to or expressed in a cell may be between at least 1 gRNA and 15 gRNAs, 1 gRNA and least 10 gRNAs, 1 gRNA and 8 gRNAs, 1 gRNA and 6 gRNAs, 1 gRNA and 4 gRNAs, 1 gRNA and gRNAs, 2 gRNA and 5 gRNAs, or 2 gRNAs and 3 gRNAs.
[0091] Although a perfect match between the gRNA guide sequence and the DNA sequence on the targeted gene is preferred, a mismatch between a gRNA guide sequence and target sequence on the gene sequence of interest is also permitted as along as it still allows hybridization of the gRNA with the complementary strand of the gRNA target polynucleotide sequence on the targeted gene. A seed sequence of between 8-12 consecutive nucleotides in the gRNA, which perfectly matches a corresponding portion of the gRNA target sequence is preferred for proper recognition of the target sequence. The remainder of the guide sequence may comprise one or more mismatches. In general, gRNA activity is inversely correlated with the number of mismatches. Preferably, the gRNA of the present invention comprises 7 mismatches, 6 mismatches, 5 mismatches, 4 mismatches, 3 mismatches, more preferably 2 mismatches, or less, and even more preferably no mismatch, with the corresponding gRNA target gene sequence (less the PAM). Preferably, the gRNA nucleic acid sequence is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% and 99% identical to the gRNA target polynucleotide sequence in the gene of interest (e.g., FXN). Of course, the smaller the number of nucleotides in the gRNA guide sequence the smaller the number of mismatches tolerated. The binding affinity is thought to depend on the sum of matching gRNA-DNA combinations.
[0092] Any gRNA guide sequence can be selected in the target gene, as long as it allows introducing at the proper location, the desired modification(s) (e.g., spontaneous insertions/deletions or selected target modification(s) using one or more patch/donor sequence(s)). Accordingly, the gRNA guide sequence or target sequence of the present invention may be in coding or non-coding regions of the FXN gene (i.e., exons or introns, preferably intron 1). Of course the complementary strand of the sequence (e.g., reverse complement of SEQ ID NO: 4) may alternatively and equally be used to identify proper PAM and gRNA target/guide sequences.
CRISPR Nucleases
[0093] The recombinant CRISPR nuclease that may be used in accordance with the present invention is i) derived from a naturally occurring Cas or related nuclease (e.g., Cpf1); and ii) has a nuclease activity to introduce a DSB in cellular DNA when in the presence of appropriate gRNA(s). Thus, as used herein, the term CRISPR nuclease refers to a recombinant protein which is derived from a naturally occurring nuclease which has nuclease activity and which functions with the gRNAs of the present invention to introduce DSBs in the targets of interest, e.g., the FXN gene. In embodiments, the CRISPR nuclease is CjCas9, SpCas9 or SaCas9. In embodiments, the CRISPR nuclease is Cpf1. In a further embodiment, the Cas protein is a dCas9 protein fused with a dimerization-dependant FoKI nuclease domain. Exemplary CRISPR nucleases that may be used in accordance with the present invention are provided in Table 1 above. A variant of Cas9 can be a Cas9 nuclease that is obtained by protein engineering or by random mutagenesis (i.e., is non-naturally occurring). Such Cas9 variants remain functional and may be obtained by mutations (deletions, insertions and/or substitutions) of the amino acid sequence of a naturally occurring Cas9, such as that of S. pyogenes.
[0094] CRISPR nucleases such as Cas9 nucleases cut 3-4bp upstream of the PAM sequence. CRISPR nucleases such as Cpf1 on the other hand, generate a 5 overhang. The cut occurs 19 bp after the PAM on the targeted (+) strand and 23 bp on the opposite strand (62). Table 1 above provides the PAM sequence and cut site for exemplary CRISPR nucleases. There can be some off-target DSBs using wildtype Cas9. The degree of off-target effects depends on a number of factors, including: how closely homologous the off-target sites are compared to the on-target site, the specific site sequence, and the concentration of nuclease and guide RNA (gRNA). These considerations only matter if the PAM sequence is immediately adjacent to the nearly homologous target sites. The mere presence of additional PAM sequences should not be sufficient to generate off target DSBs; there needs to be extensive homology of the protospacer followed or preceded by PAM.
Optimization of Codon Degeneracy
[0095] Because CRISPR nuclease proteins are (or are derived from) proteins normally expressed in bacteria, it may be advantageous to modify their nucleic acid sequences for optimal expression in eukaryotic cells (e.g., mammalian cells) when designing and preparing CRISPR nuclease recombinant proteins.
[0096] Accordingly, the following codon chart (Table 2) may be used, in a site-directed mutagenic scheme, to produce nucleic acids encoding the same or slightly different amino acid sequences of a given nucleic acid:
TABLE-US-00002 TABLE2 Codonsencodingthesameaminoacid AminoAcids Codons Alanine Ala A GCAGCCGCGGCU Cysteine Cys C UGCUGU Asparticacid Asp D GACGAU Glutamicacid Glu E GAAGAG Phenylalanine Phe F UUCUUU Glycine Gly G GGAGGCGGGGGU Histidine His H CACCAU Isoleucine Ile I AUAAUGAUU Lysine Lys K AAAAAG Leucine Leu L UUAUUGCUACUCCUGCUU Methionine Met M AUG Asparagine Asn N AACAAU Proline Pro P CCACCCCCGCCU Glutamine Gln Q CAACAG Arginine Arg R AGAAGGCGACGCCGGCGU Serine Ser S AGCAGUUCAUCCUCGUCU Threonine Thr T ACAACCACGACU Valine Val V GUAGUCGUGGUU Tryptophan Trp W UGG Tyrosine Tyr Y UACUAU
Methods of Modifying Frataxin
[0097] The present invention provides a method of modifying within a cell, a frataxin (FXN) gene comprising a GAA repeat expansion. The method uses gRNAs in combination with a CRISPR nuclease and allow to introduce cuts (e.g., double stranded breaks with blunt ends or 5 overhang) into DNA at specific sites. The cuts introduced in the gene FXN gene in accordance with the present invention allow to remove all or some of the endogenous sequence comprising the abnormal number of GAA trinucleotide repeats in intron 1 which leads to reduced FXN protein expression and causes FRDA.
[0098] Accordingly, in embodiments, methods of the present invention comprise introducing cuts within intron 1 of the FXN gene comprising an endogenous abnormal number of GAA trinucleotide repeats causing FRDA (e.g., 70 or more, 100 or more, 150 or more, 300 or more, 500 or more 800 or more or 1000 or more GAA repeats). In embodiments, the position of each cut is selected from cuts set forth in Table 3, 5, 6 or 7.
[0099] Although the entire intron 1 could be removed in accordance with the present invention, preferably, only a portion of the nucleic acid sequence of intron 1 is deleted on each side of the GAA repeats. In embodiments, a first cut is introduced within about 2000, preferably about 1000 and more preferably within about 550 nts from the first nucleotide of the GAA repeats (e.g., in 5 or upstream of the beginning of the GAA repeats). Similarly, in embodiments, a second cut is introduced within about 2000, preferably about 1000 and more preferably within about 550 nts from the last nucleotide of the GAA repeats (e.g., in 3 or downstream of the beginning of the GAA repeat sequence). In embodiments, the first and second cuts are made as close as possible from each end of the GAA repeats so as to remove the smallest number of nucleotides from intron 1 (e.g., within 200, within 150, within 124, within 100, within 75, within 50, within 30, within 35, or within 20 nucleotides or less from each end of the GAA repeat sequence).
[0100] Under certain conditions, gRNAs of the present invention may cut within the GAA repeat expansion, such that a portion of the GAA repeat expansion may be removed (i.e., a subset of the GAA repeats). For example, if a target sequence of a gRNA is sufficiently close to (or overlaps) the 5 or 3 end of the GAA repeat expansion, the cut introduced by the CRISPR nuclease may be within the GAA repeat expansion. As known in the art, CRISPR nuclease cuts in 5 or 3 of the PAM. The distance of the cut from the PAM depends on the CRISPR nuclease used. Under these conditions, introduction of cuts within the FXN gene followed by NHEJ may generate a modified FXN gene in which a portion of the GAA repeats remain. The presence of a small number of GAA repeats (e.g., less than 70) is known to not significantly affect FXN expression. Therefore modified FRDA cells in which some GAA repeats have been removed and some GAA repeats remain would nevertheless express FXN to a level above the base level of FXN expression in the unmodified FRDA cells.
[0101] Accordingly, in embodiments, the first cut of the first gRNA is within the GAA repeat expansion, preferably near the 5 end of the GAA repeat expansion. In embodiments, the second cut of the second gRNA is within the GAA repeat expansion, preferably near the 3 end of the GAA repeat expansion, i.e., downstream from the first cut. In embodiments, ligation of the first and second intron ends in accordance with methods of the present invention generates a modified FXN gene having 150 or fewer GAA repeats. Preferably, ligation of the first and second intron ends in accordance with methods of the present invention generates a modified FXN gene having 70 or fewer GAA repeats. In preferred embodiments, methods of the present invention allow removal of the entire GAA repeat expansion, i.e. all the GAA repeats, in intron 1 of the FXN gene of FRDA cells. Preferably, ligation of the first and second intron ends in accordance with methods of the present invention occurs by non-homologous end joining (NHEJ).
[0102] In embodiments, gRNAs and CRISPR nucleases which are used in accordance with the present invention allow removal at least 10, at least 50, at least 100, at least 200, at least 300, at least 500, at least 600, at least 700, at least 800 GAA repeats in the FXN gene of a cell. In embodiments, gRNAs and CRISPR nucleases which are used in accordance with the present invention allow removal of at least 50%, 60%, 70%, 80% or 90% of the GAA repeats in the FXN gene of a cell. Preferably, gRNAs and CRISPR nucleases of the present invention a portion of the GAA repeat extension in FRDA cells which is sufficient to increase FXN expression above the base level of FXN expression in the unmodified FRDA cells. Preferably, the complete GAA repeat expansion within an intron of the FXN gene.
[0103] gRNAs of the present invention are preferably between 17 and 20 nucleotides long. Non-limiting examples of gRNA target sequences are provided in Tables 5, 6 and 7. Thus, gRNAs having a target sequence corresponding to at least 17 consecutive nucleotides of intron 1 of the FXN gene or of a gRNA target sequence listed in Tables 5, 6 and 7 and genetic variants thereof, can be used in accordance with the present invention. Of course the target sequence should also be suitably positioned with respect to the PAM of the selected CRISPR nuclease.
[0104] In embodiments, gRNAs of the present invention comprise a target sequence which is set forth in Table 5, 6 or 7. In particular embodiments, the polynucleotide sequence removed on each side of the GAA repeat expansion in the FXN gene comprises (or consists of) polynucleotide sequences set forth in SEQ ID NOs: 100-126 and 158-167 (see also Tables 5, 6 and 7).
[0105] Although any suitable combinations of gRNAs may be used in accordance with the present invention, Table 3 below shows exemplary combination of gRNAs allowing to remove GAA trinucleotide repeats from intron 1 of the FXN gene.
TABLE-US-00003 TABLE 3 Sequences removed in intron 1 of FXN gene using Exemplary combinations of gRNAs. Total of nts # nts removed #nts removed in removed apart gRNA in 5 of GAA 3 of GAA from GAA combination repeats repeats repeats AC1AC6 159 412 571 AC2AC6 93 412 505 AC3AC6 30 412 442 C1C11 142 20 162 C2C11 136 20 156 C1C20 142 403 545 C2C20 136 403 539 C15C18 506 478 984 C15C20 506 403 909 C16C18 457 478 935 C16C20 457 403 860 Cj1Cj6 321 323 644 Cj1Cj7 321 241 562 Cj1Cj8 321 286 607 Cj1Cj9 321 302 623 Cj1Cj10 321 346 667 Cj2Cj6 310 323 633 Cj2Cj7 310 241 551 Cj2Cj8 310 286 596 Cj2Cj9 310 302 612 Cj2Cj10 310 346 656 Cj3Cj6 264 323 587 Cj3Cj7 264 241 505 Cj3Cj8 264 286 550 Cj3Cj9 264 302 566 Cj3Cj10 264 346 610 Cj4Cj6 220 323 543 Cj4Cj7 220 241 461 Cj4Cj8 220 286 506 Cj4Cj9 220 302 522 Cj4Cj10 220 346 566 Cj5Cj6 201 323 524 Cj5Cj7 201 241 442 Cj5Cj8 201 286 487 Cj5Cj9 201 302 503 Cj5Cj10 201 346 547
[0106] In embodiments, the first cut in the FXN gene is within about 625 nucleotides from the end of the GAA repeats (i.e., upstream or 5 from the first nucleotide of the GAA repeat sequence). In embodiments, the first cut in the FXN gene is within about 519 nucleotides from the end of the GAA repeats (i.e., upstream or 5 from the first nucleotide of the GAA repeat sequence). In embodiments, the first cut in the FXN gene is within about 506 nucleotides from the end of the GAA repeats (i.e., upstream or 5 from the first nucleotide of the GAA repeat sequence). In embodiments, the first cut in the FXN gene is within about 457 nucleotides from the end of the GAA repeats (i.e., upstream or 5 from the first nucleotide of the GAA repeat sequence). In embodiments, the first cut in the FXN gene is within about 178 nucleotides from the end of the GAA repeats (i.e., upstream or 5 from the first nucleotide of the GAA repeat sequence). In embodiments, the first cut in the FXN gene is within about 159 nucleotides from the end of the GAA repeats (i.e., upstream or 5 from the first nucleotide of the GAA repeat sequence). In embodiments, the first cut in the FXN gene is within about 142 nucleotides from the end of the GAA repeats (i.e., upstream or 5 from the first nucleotide of the GAA repeat sequence). In embodiments, the first cut in the FXN gene is within about 136 nucleotides from the end of the GAA repeats (i.e., upstream or 5 from the first nucleotide of the GAA repeat sequence). In embodiments, the first cut in the FXN gene is within about 93 nucleotides from the end of the GAA repeats (i.e., upstream or 5 from the first nucleotide of the GAA repeat sequence). In embodiments, the first cut in the FXN gene is within about 81 nucleotides from the end of the GAA repeats (i.e., upstream or 5 from the first nucleotide of the GAA repeat sequence). In embodiments, the first cut in the FXN gene is within about 76 nucleotides from the end of the GAA repeats (i.e., upstream or 5 from the first nucleotide of the GAA repeat sequence). In embodiments, the first cut in the FXN gene is within about 58 nucleotides from the end of the GAA repeats (i.e., upstream or 5 from the first nucleotide of the GAA repeat sequence). In embodiments, the first cut in the FXN gene is within about 30 nucleotides from the end of the GAA repeats (i.e., upstream or 5 from the first nucleotide of the GAA repeat sequence).
[0107] In embodiments, the second cut in the FXN gene is within about 597 nucleotides from the end of the GAA repeats (i.e., downstream or 3 from the last nucleotide of the GAA repeat sequence). In embodiments, the second cut in the FXN gene is within about 493 nucleotides from the end of the GAA repeats (i.e., downstream or 3 from the last nucleotide of the GAA repeat sequence). In embodiments, the second cut in the FXN gene is within about 478 nucleotides from the end of the GAA repeats (i.e., downstream or 3 from the last nucleotide of the GAA repeat sequence). In embodiments, the second cut in the FXN gene is within about 412 nucleotides from the end of the GAA repeats (i.e., downstream or 3 from the last nucleotide of the GAA repeat sequence). In embodiments, the second cut in the FXN gene is within about 403 nucleotides from the end of the GAA repeats (i.e., downstream or 3 from the last nucleotide of the GAA repeat sequence). In embodiments, the second cut in the FXN gene is within about 126 nucleotides from the end of the GAA repeats (i.e., downstream or 3 from the last nucleotide of the GAA repeat sequence). In embodiments, the second cut in the FXN gene is within about 114 nucleotides from the end of the GAA repeats (i.e., downstream or 3 from the last nucleotide of the GAA repeat sequence). In embodiments, the second cut in the FXN gene is within about 86 nucleotides from the end of the GAA repeats (i.e., downstream or 3 from the last nucleotide of the GAA repeat sequence). In embodiments, the second cut in the FXN gene is within about 50 nucleotides from the end of the GAA repeats (i.e., downstream or 3 from the last nucleotide of the GAA repeat sequence). In embodiments, the second cut in the FXN gene is within about 49 nucleotides from the end of the GAA repeats (i.e., downstream or 3 from the last nucleotide of the GAA repeat sequence). In embodiments, the second cut in the FXN gene is within about 22 nucleotides from the end of the GAA repeats (i.e., downstream or 3 from the last nucleotide of the GAA repeat sequence). In embodiments, the second cut in the FXN gene is within about 20 nucleotides from the end of the GAA repeats (i.e., downstream or 3 from the last nucleotide of the GAA repeat sequence).
[0108] In embodiments, gRNAs of the present invention have a target sequence adjacent to a NGG PAM nucleotide sequence in intron 1 of the FXN gene corresponding to the following nucleotide positions: a) nts 6579-6577; (b) nts 6592-6594; (c) nts 6543-6541; (d) nts 6670-6672; (e) nts 6645-6643; (f) nts 6647-6649; (g) nts 6761-6759; (h) nts 6832-6834; (i) nts 6888-6886; (j) nts 6853-6851; (k) nts 6766-6768; (I) nts 6872-6874; (m) nts 6202-6200; (n) nts 6103-6105; (o) nts 6221-6223; (p) nts 6264-6262; (q) nts 7232-7230; (r) nts 7324-7326; (s) nts 7336-7334; or (t) nts 7142-7141. In embodiments, gRNAs of the present invention have a target sequence adjacent to a NNGRRT PAM nucleotide sequence in intron 1 of the FXN gene corresponding to the following nucleotide positions: a) nts 6569-6574; (b) nts 6635-6640; (c) nts 6691-6686; (d) nts 6789-6784; (e) nts 7078-7073; or (f) nts 7158-7163. All nucleotides positions on the frataxin gene described herein are with respect to nucleotides comprised in intron 1 of the frataxin gene set forth in GenBank NG_00845 (SEQ ID NO: 4).
[0109] In embodiments, methods of the present invention generate a modified FXN gene (in which the GAA trinucleotide repeats have been removed) comprising in intron 1 a modified polynucleotide sequence as set forth in
[0110] As any other nucleic acid gene sequence, endogenous sequence variations in intron 1 of the FXN gene exist between individuals (allelic/genetic variants). Such variant nucleic acid sequences are retrievable from well-known databases and websites such as NCBI, Ensembl, Vega, OMIM and others (e.g., ClinVar and dbVar databases and NCBI variation viewer. See for example www.ncbi.nlm.nih.gov/gene/2395#variation). Accordingly, gRNAs of the present invention target any naturally occurring genetic variants of the FXN gene which can be found in a population. Thus, as used herein, the term frataxin (FXN) gene encompasses any frataxin gene found within a cell and includes variants (e.g., allelic/genetic variants) of the frataxin gene polynucleotide sequence in SEQ ID NO: 4.
[0111] As indicated above, nucleic acids encoding gRNAs and nucleases (e.g., Cas9 or Cpf1) of the present invention may be delivered into cells using one or more various vectors such as viral vectors. Accordingly, preferably, the above-mentioned vector is a viral vector for introducing the gRNA and/or nuclease of the present invention in a target cell. Non-limiting examples of viral vectors include retrovirus, lentivirus, Herpes virus, adenovirus or Adeno Associated Virus, as well known in the art.
[0112] The modified AAV vector preferably targets one or more cell types affected in FRDA subjects. In an embodiment, the cell type is a muscle cell, in a further embodiment, a myoblast. Accordingly, the modified AAV vector may have enhanced cardiac, skeletal muscle, neuronal, liver, and/or pancreatic tissue (Langerhans cells) tropism. The modified AAV vector may be capable of delivering and expressing the at least one gRNA and nuclease of the present invention in the cell of a mammal. For example, the modified AAV vector may be an AAV-SASTG vector (Piacentino et al. (2012) Human Gene Therapy 23:635-646). The modified AAV vector may deliver gRNAs and nucleases to neurons, skeletal and cardiac muscle, and/or pancreas (Langerhans cells) in vivo. The modified AAV vector may be based on one or more of several capsid types, including AAVI, AAV2, AAVS, AAV6, AAV8, and AAV9. The modified AAV vector may be based on AAV2 pseudotype with alternative muscle-tropic AAV capsids, such as AAV2/1, AAV2/6, AAV2/7, AAV2/8, AAV2/9, AAV2.5 and AAV/SASTG vectors that efficiently transduce skeletal muscle or cardiac muscle by systemic and local delivery. In an embodiment, the modified AAV vector is a AAV-DJ. In an embodiment, the modified AAV vector is a AAV-DJ8 vector. In an embodiment, the modified AAV vector is a AAV2-DJ8 vector. In an embodiment, the modified AAV vector is a AAV-PHP.B vector. In an embodiment, the modified AAV vector is a AAV-PHP.B, AAV-9 or AAV-DJ8 (PHP.B: PMID: 26829320, PMID: 27867348; AAV DJ-8: www.cellbiolabs.com/news/aav-helper-free-expression-systems-aav-dj-aav-dj8, http://www.cellbiolabs.com/aav-expression-and-packaging; www.cellbiolabs.com/scaav-dj8-helper-free-complete-expression-systems; and AAV9: PMID: 27637390, PMID: 16713360).
[0113] In another aspect, the present invention provides a composition (e.g., a pharmaceutical composition) comprising the above-mentioned gRNA and/or CRISPR nuclease (e.g., Cas9), or nucleic acid(s) encoding same or vector(s) comprising such nucleic acid(s). In an embodiment, the composition further comprises one or more pharmaceutically acceptable or biologically acceptable carriers, excipients, and/or diluents.
[0114] As used herein, pharmaceutically acceptable refers to materials characterized by the absence of (or limited) toxic or adverse biological effects in vivo. It refers to those compounds, compositions, and/or dosage forms which are, within the scope of sound medical judgment, suitable for use in contact with the biological fluids and/or tissues and/or organs of a subject (e.g., human, animal) without excessive toxicity, irritation, allergic response, or other problem or complication, commensurate with a reasonable benefit/risk ratio. Biologically acceptable refers to materials characterized by the absence of (or limited) toxic or adverse biological effects in biological systems, e.g., in vitro or in vivo, i.e., compatible for use with living cells without excessive toxicity.
[0115] The present invention further provides a kit or package comprising at least one container means having disposed therein at least one of the above-mentioned gRNAs, nucleases, vectors, cells, targeting systems, combinations and/or compositions. In an embodiment, the kit or package further comprises instructions for removing the GAA repeat expansion in the FXN gene in a cell or for treatment of FRDA in a subject.
Definitions
[0116] In order to provide clear and consistent understanding of the terms in the instant application, the following definitions are provided.
[0117] The articles a, an and the are used herein to refer to one or to more than one (i.e., to at least one) of the grammatical object of the article.
[0118] As used in this specification and claim(s), the words comprising (and any form of comprising, such as comprise and comprises), having (and any form of having, such as have and has), including (and any form of including, such as includes and include) or containing (and any form of containing, such as contains and contain) are inclusive or open-ended and do not exclude additional, un-recited elements or method steps and are used interchangeably with, the phrases including but not limited to and comprising but not limited to.
[0119] For the recitation of numeric ranges herein, each intervening number there between with the same degree of precision is explicitly contemplated. For example, for the range of 18-20, the numbers 18, 19 and 20 are explicitly contemplated, and for the range 6.0-7.0, the number 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, and 7.0 are explicitly contemplated. The terms such as are used herein to mean, and is used interchangeably with, the phrase such as but not limited to.
[0120] Unless otherwise defined herein, scientific and technical terms used in connection with the present disclosure shall have the meanings that are commonly understood by those of ordinary skill in the art. For example, any nomenclature used in connection with, and techniques of, cell and tissue culture, molecular biology, immunology, microbiology, genetics and protein and nucleic acid chemistry and hybridization described herein are those that are well known and commonly used in the art. The meaning and scope of the terms should be clear; in the event however of any latent ambiguity, definitions provided herein take precedent over any dictionary or extrinsic definition. Further, unless otherwise required by context, singular terms shall include pluralities and plural terms shall include the singular.
[0121] Practice of the methods, as well as preparation and use of the products and compositions disclosed herein employ, unless otherwise indicated, conventional techniques in molecular biology, biochemistry, chromatin structure and analysis, computational chemistry, cell culture, recombinant DNA and related fields as are within the skill of the art. These techniques are fully explained in the literature. See, for example, Sambrook et al. MOLECULAR CLONING: A LABORATORY MANUAL, Second edition, Cold Spring Harbor Laboratory Press, 1989 and Third edition, 2001; Ausubel et al., CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley & Sons, New York, 1987 and periodic updates; the series METHODS IN ENZYMOLOGY, Academic Press, San Diego; Wolffe, CHROMATIN STRUCTURE AND FUNCTION, Third edition, Academic Press, San Diego, 1998; METHODS IN ENZYMOLOGY, Vol. 304, Chromatin (P. M. Wassarman and A. P. Wolffe, eds.), Academic Press, San Diego, 1999; and METHODS IN MOLECULAR BIOLOGY, Vol. 119, Chromatin Protocols (P. B. Becker, ed.) Humana Press, Totowa, 1999.
[0122] The terms nucleic acid, polynucleotide, and oligonucleotide are used interchangeably and refer to a deoxyribonucleotide or ribonucleotide polymer, in linear or circular conformation, and in either single- or double-stranded form. For the purposes of the present disclosure, these terms are not to be construed as limiting with respect to the length of a polymer. The terms can encompass known analogues of natural nucleotides, as well as nucleotides that are modified in the base, sugar and/or phosphate moieties (e.g., phosphorothioate backbones). In general, an analogue of a particular nucleotide has the same base-pairing specificity; i.e., an analogue of A will base-pair with T.
[0123] The terms polypeptide, peptide and protein are used interchangeably to refer to a polymer of amino acid residues. The term also applies to amino acid polymers in which one or more amino acids are chemical analogues or modified derivatives of corresponding naturally-occurring amino acids.
[0124] Coding sequence or encoding nucleic acid as used herein means the nucleic acids (RNA or DNA molecule) that comprise a nucleotide sequence which encodes a protein or gRNA. The coding sequence can further include initiation and termination signals operably linked to regulatory elements including a promoter and polyadenylation signal capable of directing expression in the cells of an individual or mammal to which the nucleic acid is administered. The coding sequence may be codon optimized.
[0125] Complement or complementary as used herein refers to Watson-Crick (e.g., A-T/U and C-G) or Hoogsteen base pairing between nucleotides or nucleotide analogs of nucleic acid molecules. Complementarity refers to a property shared between two nucleic acid sequences, such that when they are aligned antiparallel to each other, the nucleotide bases at each position will be complementary.
Sequence Similarity
[0126] Homology and homologous refers to sequence similarity between two peptides or two nucleic acid molecules. Homology can be determined by comparing each position in the aligned sequences. A degree of homology between nucleic acid or between amino acid sequences is a function of the number of identical or matching nucleotides or amino acids at positions shared by the sequences. As the term is used herein, a nucleic acid sequence is substantially homologous to another sequence if the two sequences are substantially identical and the functional activity of the sequences is conserved (as used herein, the term homologous does not infer evolutionary relatedness, but rather refers to substantial sequence identity, and thus is interchangeable with the terms identity/identical). Two nucleic acid sequences are considered substantially identical if, when optimally aligned (with gaps permitted), they share at least about 50% sequence similarity or identity, or if the sequences share defined functional motifs. In alternative embodiments, sequence similarity in optimally aligned substantially identical sequences may be at least 60%, 70%, 75%, 80%, 85%, 90% or 95%. For the sake of brevity, the units (e.g., 66, 67 . . . 81, 82, . . . 91, 92%, . . . ) have not systematically been recited but are considered, nevertheless, within the scope of the present invention.
[0127] Substantially complementary nucleic acids are nucleic acids in which the complement of one molecule is substantially identical to the other molecule. Two nucleic acid or protein sequences are considered substantially identical if, when optimally aligned, they share at least about 70% sequence identity. In alternative embodiments, sequence identity may for example be at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 98% or at least 99%. Optimal alignment of sequences for comparisons of identity may be conducted using a variety of algorithms, such as the local homology algorithm of Smith and Waterman, 1981, Adv. Appl. Math 2: 482, the homology alignment algorithm of Needleman and Wunsch, 1970, J. Mol. Biol. 48:443, the search for similarity method of Pearson and Lipman (Pearson and Lipman 1988), and the computerized implementations of these algorithms (such as GAP, BESTFIT, FASTA and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, Madison, Wis., U.S.A.). Sequence identity may also be determined using the BLAST algorithm, described in Altschul et al. (Altschul et al. 1990), using the published default settings. Software for performing BLAST analysis may be available through the National Center for Biotechnology Information (through the internet at http://www.ncbi.nlm.nih.gov/). The BLAST algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence that either match or satisfy some positive-valued threshold score T when aligned with a word of the same length in a database sequence. T is referred to as the neighborhood word score threshold. Initial neighborhood word hits act as seeds for initiating searches to find longer HSPs. The word hits are extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Extension of the word hits in each direction is halted when the following parameters are met: the cumulative alignment score falls off by the quantity X from its maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of one or more negative-scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm parameters W, T and X determine the sensitivity and speed of the alignment. One measure of the statistical similarity between two sequences using the BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance. In alternative embodiments of the invention, nucleotide or amino acid sequences are considered substantially identical if the smallest sum probability in a comparison of the test sequences is less than about 1, preferably less than about 0.1, more preferably less than about 0.01, and most preferably less than about 0.001.
[0128] An alternative indication that two nucleic acid sequences are substantially complementary is that the two sequences hybridize to each other under moderately stringent, or preferably stringent, conditions. Hybridization to filter-bound sequences under moderately stringent conditions may, for example, be performed in 0.5 M NaHPO4, 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 65 C., and washing in 0.2SSC/0.1% SDS at 42 C. (Ausubel 2010). Alternatively, hybridization to filter-bound sequences under stringent conditions may, for example, be performed in 0.5 M NaHPO4, 7% SDS, 1 mM EDTA at 65 C., and washing in 0.1SSC/0.1% SDS at 68 C. (Ausubel 2010). Hybridization conditions may be modified in accordance with known methods depending on the sequence of interest (Tijssen 1993). Generally, stringent conditions are selected to be about 5 C. lower than the thermal melting point for the specific sequence at a defined ionic strength and pH.
[0129] Promoter as used herein means a synthetic or naturally-derived nucleic acid molecule which is capable of conferring, modulating or controlling (e.g., activating, enhancing and/or repressing) expression of a nucleic acid in a cell. A promoter may comprise one or more specific transcriptional regulatory sequences to further enhance or repress expression and/or to alter the spatial expression and/or temporal expression of same. A promoter may also comprise distal enhancer or repressor elements, which may be located as much as several thousand base pairs from the start site of transcription. A promoter may be derived from sources including viral, bacterial, fungal, plants, insects, and animals. A promoter may regulate the expression of a gene component constitutively or differentially with respect to cell, the tissue or organ in which expression occurs or, with respect to the developmental stage at which expression occurs, or in response to external stimuli such as physiological stresses, pathogens, metal ions, or inducing agents. Representative examples of promoters include the bacteriophage T7 promoter, bacteriophage T3 promoter, SP6 promoter, lac operator-promoter, tac promoter, SV40 late promoter, SV40 early promoter, RSV-LTR promoter, CMV, CMV IE promoter, SV40 early promoter or SV40 late promoter and the CMV IE promoter. In embodiments, the U6, Cbh, CMV and/or H1m promotor is used to express one or more gRNAs in a cell.
[0130] A WPRE sequence refers to the Woodchuck Hepatitis Virus (WHP) Posttranscriptional Regulatory Element (WPRE) which is a DNA sequence that, when transcribed, creates a tertiary structure which may enhance expression. The sequence is commonly used in molecular biology to increase expression of genes delivered by viral vectors. WPRE is a tripartite regulatory element with gamma, alpha, and beta components. The full tripartite sequence has 100% homology with base pairs 1093 to 1684 (SEQ ID NO: 170 or 196) of the Woodchuck hepatitis B virus (WHV8) genome. When used in the 3 untranslated region (UTR) of a mammalian expression cassette, it can significantly increase mRNA stability and protein yield.
[0131] Vector as used herein means a nucleic acid sequence containing an origin of replication. A vector may be a viral vector, bacteriophage, bacterial artificial chromosome or yeast artificial chromosome. A vector may be a DNA or RNA vector. A vector may be a self- replicating extrachromosomal vector, and preferably, is a DNA plasmid. For example, the vector may comprise nucleic acid sequence(s) that/which encode(s) a gRNA, a donor (or patch) nucleic acid, and/or a CRISPR nuclease (e.g., Cas9 or Cpf1) of the present invention. A vector for expressing one or more gRNA will comprise a DNA sequence of the gRNA.
[0132] Adeno-associated virus or AAV as used interchangeably herein refers to a small virus belonging to the genus Dependovirus of the Parvoviridae family that infects humans and some other primate species. AAV is not currently known to cause disease and consequently the virus causes a very mild immune response.
[0133] Subject and patient as used herein interchangeably refers to any vertebrate, including, but not limited to, a mammal (e.g., cow, pig, camel, llama, horse, goat, rabbit, sheep, hamsters, guinea pig, cat, dog, rat, and mouse, a non-human primate (for example, a monkey, such as a cynomolgus or rhesus monkey, chimpanzee, etc.) and a human). In some embodiments, the subject may be a human or a non-human. In an embodiment, the subject or patient may suffer from FRDA and has an abnormal GAA trinucleotide repeat expansion. The subject or patient may be undergoing other forms of treatment.
[0134] The present invention is illustrated in further details by the following non-limiting examples.
EXAMPLE 1
Materials and Methods
[0135] DNA constructs. Plasmids used in this study included the following: px330 as px330-U6-Chimeric_BB-CBh-hSpCas9 (Addgene plasmid #42230)(43), pxGFP or pxPuro as pSpCas9(BB)-2A-GFP or Puro (Addgene plasmids #48138/48139)(44) and px601 as px601-AAV-CMV::NLS-SaCas9-NLS-3xHA-bGHpA;U6::Bsal- gRNA (Addgene plasmid #61591) (31) which were provided by Feng Zhang (Department of Genetics, Harvard Medical School, Boston, Mass., USA). Plasmids used also included the pRGEN-CMV-CjCas9 plasmid (Addgene #89752) the pU6-Cj- gRNA plasmid (Addgene #89753), which were provided by Seokjoon Kim (ToolGen, Geumcheon-gu, Seoul, South Korea). Others plasmids included a recombinant AAV vector backbone modified from the pAAV_TALE-TF (VP64)-BB_V3 (Addgene#42581), provide by Feng Zhang (Department of Genetics, Harvard Medical School, Boston, Mass., USA). Oligonucleotides coding for guide RNAs were synthetized by Integrated DNA Technologies (IDT inc., Coralville, Iowa) and cloned into Bbsl (or Bsal for px601) restriction sites according to Zhang's guidelines (https://www.addgene.org/static/cms/filer_public/e6/5a/e65a9ef8-c8ac-4f88-98da-3b7d7960394c/zhang-lab-general-cloning-protocol.pdf). All DNA constructs were sent for sequencing using the primer U6F (5-GTCGGAACAGGAGAGCGCACGAGGGAG, SEQ ID NO: 5) or the H1F primer (5-TGTCGCTATGTGTTCTGGGA, SEQ ID NO: 211) to the Genomic sequencing and genotyping platform of the CHU de Qubec (Quebec City, QC, Canada).
[0136] When needed, PCR amplicons from plasmidic or genomic DNA were cloned into the linearized cloning vector pMiniT (NEB, 1pwisch, Mass.) and sequenced using the manufacturer provided forward and reverse primers.
[0137] Modifications of the original px601 vector were performed as follows. The CMV promoter (577 bp, located between Xhol and Agel sites) was replaced by short versions of 212 or 259 bp amplified from the pscAAV-GFP plasmid from John T. Gray (Addgene plasmid #32396) (45) and according to previous experimentations published by Senis and al. (32). The px601 polyA sequence (204 bp, included in the sequence between EcoRl and Kpnl sites) was replaced by a short version of 60 bp (32) cloned as a gBLOCK (IDT inc., Coralville, Iowa) while preserving the Kpnl restriction site. A sequence comprising the H1 minimal promoter (H1m), a selected cloned oligonucleotide gRNA-coding and the SaCas9 tracrRNA was amplified from the home made pGL3 H1m/BbsI/SaCas9 and cloned into the Kpnl site of the newly prepared px601 vector. Finally, and if not previously included in the plasmid, the second oligonucleotide coding for a gRNA was cloned into the Bsal site following the U6 promoter.
[0138] Modifications of the original pAAV_TALE-TF (VP64)-BB_V3 were performed as follows. A fragment containing the CjCas9 gene (amplified from the pRGEN-CMV-CjCas9 plasmid under the control of a CBh promoter, followed by a SV40 late poly A and a WPRE sequence and two gRNAs expressed from either the human U6 promoter or a minimal H1 promoter was clone into a gel purified Xbal/Notl digested pAAV plasmid.
[0139] All PCR amplifications were performed using the Phusion High Fidelity polymerase (Thermo Fisher Scientific inc., Waltham, Mass.). All cloning were performed using the In-Fusion HD cloning kit (Clontech Laboratories inc., Mountain View, Calif.). Plasmid design and sequencing analysis were done using the CLC main workbench software version 7.6 (CLC bio/Qiagen inc., Hilden, Germany).
[0140] Mouse cells and animal model. Mouse fibroblasts derived from the YG8R and YG8sR mouse models were obtained from Dr. MA Pook (Brunel University, London, UK). Characterization of these mice by Pook's group revealed that the YG8R fibroblasts carried two tandem copies of the human FXN gene with about 82 and 190 GAA trinucleotide sequence repeats (25) while the YG8sR has 190 GAA repeats. The Y47R cell line, which has been produced and isolated the same way as the YG8R cell line, contains however a single copy of the wild-type human FXN transgene, with about 9 GAA trinucleotide repeats (25). The mouse fibroblasts were cultured at 37 C., 5% CO.sub.2 in high glucose DMEM (Wisent inc., St-Bruno, QC, Canada) supplemented with 10% fetal bovine serum (GE healthcare Life Sciences inc., Mississauga, ON, Canada), 1 mM sodium pyruvate, 1 mM L-glutamine and 1X non-essential amino acids (Wisent inc.).
[0141] The mouse model YG8R (Fxntm1Mkn /Tg (FXN)YG8Pook/J)(25) homozygous for the Fxntm1Mkn (FXN) targeted allele and hemizygous for the Tg (FXN)YG8Pook (FXN, human) transgene was purchased from the Jackson Laboratory (stock number 012253, Bar Harbor, Minn).
[0142] Transfections and clonal expansion. Mouse YG8R or YG8sR fibroblasts or human 283T or FRDA cells were seeded and transfected at 70-80% confluence with DNA using Lipofectamine 2000 (Life Technologies inc., Carlsbad, Calif.) according to the manufacturer's instructions. Cells were harvested 48 hours later for DNA, RNA and protein analysis. For clonal expansion, puromycin (0.75 pg/ml) was added 24 h post-transfection and 48 h later, remaining cells were seeded in 96-well plates at 0.75 cell/well and expanded.
[0143] Five hundred thousand (510.sup.5) normal or FRDA fibroblasts (YG8R or YG8sR, passages10) were nucleofected using the Amaxa system and program P-022 for normal human dermal adult fibroblasts (VPD-1001, Lonza inc., Walkersville, Md.). Cells were harvested 72 hours later for genomic DNA or RNA transcriptional analysis. When needed, fluorescence from transfected cells was visualized using a Zeiss Axiovert 100-Inverted microscope (Zeiss inc., Oberkochen, Germany).
[0144] For experiments using human FRDA fibroblasts (Example 8), cells (#GM04078 and #GM03665) were purchased from the Coriell Institute (Boston, Mass.). One million (110.sup.6) cells (passages 10) were nucleofected using the Amaxa system as described above. Plasmids expressing the SpCas9 and C2C20 or C15C20 combinations or a ribonucleoproteic complex of 2.5 uM of SpCas9 protein (Feldan Therapeutics, Quebec, Canada) and 150 pmol of each gRNA in vitro transcribed (HiScribe T7 RNA high yield RNA synthesis kit #E2040S, New England Biolabs inc.) from DNA templates. Cells were harvested 72 hours later for genomic DNA. A primer-based assay using the F9 (5- TCCCGGTTGCATTTACACTG, SEQ ID NO: 9) and R3 (5-AGGGGGAGCTTAGGGTCAAT, SEQ ID NO: 11) primer set was used to amplify the corrected, GAA deleted, DNA molecules. The FRDA fibroblasts were cultured at 37 C., 5% CO2 in high glucose DMEM (Wisent inc., St-Bruno, QC, Canada) supplemented with 10% fetal bovine serum (GE healthcare Life Sciences inc., Mississauga, ON, Canada), 1 mM sodium pyruvate, 1 mM L-glutamine and 1X non-essential amino acids (Wisent inc.).
[0145] In vivo DNA electrotransfer. The electrotransfer was performed in the Tibialis anterior (TA) of adult YG8LR mice as previously described (46). Briefly, 40 g of DNA consisting of a mixture of two pxGFP plasmids (encoding for SpCas9 and two gRNAs) were electroporated into the TA muscle of YG8LR mice. The latter were euthanized 1 month later, TAs were collected and genomic DNA was extracted immediately or the TA was embedded in OTC and snap-frozen in liquid nitrogen. PCR amplification was performed to detect deletions, according to the gRNA pair used. All experiments involving animals were approved by the animal care committee of the Centre Hospitalier Universataire de Qubec-Universit Laval (CHUQ-Universit Laval).
[0146] AAV production and infection. Viruses were produced with the Plateforme d'outils molculaires at Centre de recherche Institut Universitaire en Sant Mentale Qubec. 7.510.sup.11 vector genomes of each AAV-Cas9 and AAV- gRNA C2C20 PHB.P-serotyped viruses were co-injected intravenously in month-old YG8sR. One month later, mice were euthanized and organs were collected (brain, medulla, spinal cord, dorsal root ganglia, liver, heart, Tibialis anterior and pancreas) and genomic DNA was extracted. A PCR was performed to detect the viruses in various samples using the Cas9 and the RSV primers. Digital droplet PCR (ddPCR) analysis of genomic DNA was performed using the following primers and probes to detect the non-edited molecules (Fw: GATTGGTTGCCAGTGCTTAAA, SEQ ID NO: 34; Rev: TCAGGTGATCCACCTTCCTA, SEQ ID NO: 35; Probe:5-(HEX)-TGCCCATAATCTCA-(IABkFQ)-3, SEQ ID NO: 36, HEX as the reporter and IOWA black FQ (IABkFQ) as the quencher) and edited molecules (Fw:GATTGGTTGCCAGTGCTTAAA, SEQ ID NO: 34; Rev:GTTGCAGTGAGCTGAGACT, SEQ ID NO: 37, Probe: 5-(FAM)-AGTGCAGTGGCT-(IABkFQ)-3 SEQ ID NO: 38, FAM as the reporter and IABkFQ as the quencher). Genomic DNA was digested using HindIII within the ddPCR pre-mix and droplets were generated using the droplet generator (Bio-Rad). Then, molecules were amplified as follows: 1 cycle at 95 C. 10 minutes then 40 cycles of 95 C. 30 seconds and 57 C. 45 seconds. Droplets were read using the droplet reader (Bio-Rad). Data analysis was performed using the Quantasoft software.
[0147] Genomic DNA analysis. Cells or tissues (TA) were harvested, resuspended in lysis buffer (50 mM EDTA pH 8.0, 10% Sarcosyl, 0.5 mg/ml Proteinase K) and genomic DNA was extracted using a standard phenol/chloroform and ethanol precipitation method. The polymerase chain reaction was done using primer sequences provided in Table 4. The conditions for PCR reactions, using the Phusion High Fidelity polymerase (Thermo Fisher Scientific inc., Waltham, Mass.) were: 35 cycles, denaturation at 98 C. for 10 sec, annealing at 60 C. for 10 sec, elongation at 72 C. for 90 sec. PCR products were visualized on agarose gel and if needed, submitted to the Surveyor Assay (Integrated DNA Technology inc., Coralville, Iowa) according to the manufacturer's instructions.
TABLE-US-00004 TABLE4 Primersusedinthisstudy Primername Sequence5-3 Species F1 AAGAATGGCTGTGGGGATGA Human F2 GTGGAAGCCCAATACGTGGC Human F3 GCTTTCCTGGAACGAGGTGA Human F4 GGATTTCCCAGCATCTCTGG Human F9 TCCCGGTTGCATTTACACTG Human F10 GGGTTGTCAGCAGAGTTGTG Human R3 AGGGGGAGCTTAGGGTCAAT Human R9 TGGCATCTTCAAGACCCTCA Human R10 GGAGAAAAGGGTGGGGAAGA Human FXNexons2/3F AAGCCATACACGTTTGAGGACTA Human FXNexons2/3R TTGGCGTCTGCTTGTTGATCA Human FXN5UTR/exon1F GGCGGAGCGGGCGGCAGAC Human FXN5UTR/exon1R GGGGCGTGCAGGTCGCATCG Human hFXNexon2F CCAACGTGGCCTCAACCAGAT Human hFXNexon2R GGGTGGCCCAAAGTTCCAGAT Human mFXNexon2F CATTTGAACCTCCACTACCTCCAGAT Mouse mFXNexon2R TGTCCAATGTCCCCAAGTTCCTC Mouse hFXNpromoterF GTTGCAGTAAGCCAGGACCAC Human hFXNpromoterR GATCCACCCGCCTCATTTATTTG Human mFXNpromoterF GAGGCCATATCCCAGAAGAAAACT Human mFXNpromoterR CAGGCAGCATGAATGGAGGAG Mouse HPRT1F CAGGACTGAAAGACTTGCTCGAGAT Mouse HPRT1R CAGCAGGTCAGCAAAGAACTTATAGC Mouse GAPDHF GGCTGCCCAGAACATCATCCCT Mouse GAPDHR ATGCCTGCTTCACCACCTTCTTG Mouse Cas9(fw) AGATGATCGCCAAGAGCGAG HumanizedS.p Cas9(rev) ATCCCCAGCAGCTCTTTCAC HumanizedS.p RSV(fw) TGCGGAATTCAGTGGTTCGT RSV RSV(rev) AGCTACAACAAGGCAAGGCT RSV
[0148] Copy number analysis. Oligoprimer pairs were designed by GeneTool 2.0 software (Biotools inc, Edmonton, AB, CA) and their specificity was verified by blast in the GenBank database. The synthesis was performed by IDT (Integrated DNA Technology inc., Coralville, Iowa, USA) (Table 4).
[0149] 40 ng of genomic DNA was used to perform fluorescent-based Realtime PCR quantification using the LightCycler 480 (Roche Diagnostics inc., Mannheim, Del.). Reagent LightCycler 480 SYBRGreen I Master (Roche Diagnostics inc., Indianapolis, Ind., USA) was used as described by the manufacturer. The conditions for PCR reactions were: 45 cycles, denaturation at 98 C. for 10 sec, annealing at 62 C. for 10 sec, elongation at 72 C. for 14 sec and reading for 5 sec. A melting curve was performed to assess non-specific signal. Relative quantity was calculated using the delta Ct method (47). Quantitative Real-Time PCR measurements were performed by the CHU de Qubec Research Center (CHUL) Gene Expression Platform, Quebec, Canada and were compliant with MIQE guidelines (48, 49).
[0150] RNA analysis. Cells were harvested, resuspended in Trizol and RNA was isolated. Total RNA was measured using a NanoDrop ND-1000 Spectrophotometer (NanoDrop Technologies inc., Wilmington, Del.) and total RNA quality was assayed on an Agilent BioAnalyzer 2100 (Agilent Technologies inc., Santa Clara, Calif.).
[0151] First-strand cDNA synthesis was done using 500 ng of isolated RNA in a reaction containing 200 U of Superscript III Rnase H-RT (Invitrogen Life Technologies inc., Burlington, ON, CA), 300 ng of oligo-dT18, 50 ng of random hexamers, 50 mM Tris-HCl pH 8.3, 75 mM KCl, 3 mM MgCl.sub.2, 500 M deoxynucleotides triphosphate, 5 mM dithiothreitol, and 40 U of Protector RNase inhibitor (Roche Diagnostics inc., Indianapolis, Ind.) in a final volume of 50 l. Reaction was incubated at 25 C. for 10 min, then at 50 C. for 1 h and PCR purification kit (Qiagen inc., Hilden, Del.) was used to purify cDNA.
[0152] cDNA corresponding to 20 ng of total RNA was used to perform fluorescent-based Realtime PCR quantification using the LightCycler 480 (Roche Diagnostics inc., Mannheim, DE). Reagent LightCycler 480 SYBRGreen I Master (Roche Diagnostics inc., Indianapolis, Ind.) was used as described by the manufacturer with 2% DMSO. The conditions for PCR reactions were: 45 cycles, denaturation at 95 C. for 10 secs, annealing at 58 C. for 10 secs, elongation at 72 C. for 14 secs and then 74 C. for 5 sec (reading) using primers described in Table 4. A melting curve was performed to assess non-specific signal. Calculation of the number of copies of each mRNA was performed using second derivative method and a standard curve of Cp versus logarithm of the quantity (50). The standard curve was established using known amounts of purified PCR products (10, 102, 103, 104, 105 and 106 copies) and a LightCycler 480 v1.5 program provided by the manufacturer (Roche Diagnostics inc., Mannheim, Del.). The CHU de Qubec Research Center (CR-CHUQ) Gene Expression Platform, Quebec, Canada, performed quantitative real-time PCR measurements.
[0153] Protein analysis. Cells were harvested and resuspended in lysis buffer (137 mM NaCl, 50 mM Tris-HCl pH8 and 0.1% Triton .sup.X100) supplemented with 1X protease inhibitor cocktail (Roche Diagnostics Canada inc., Mississauga, ON, Canada). Protein extracts were loaded onto 12.5% SDS-PAGE and wet transfer was performed onto PVDF membrane. The latter was blotted using primary anti-FXN (ab110328, Abcam inc., Cambridge, UK or sc-25820, Santa Cruz Biotechnologies inc., Santa Cruz, Calif.), anti-HA (H-3663) anti-FLAG M2 (F-1804) and -actin (A-1978) from Sigma-Aldrich inc. (St-Louis, Mo.) antibodies. Mouse and rabbit secondary antibodies were purchased from Jackson ImmunoResearch inc. (West Grove, Pa.).
EXAMPLE 2
Identification of gRNA Pairs Targeting Sequences Upstream and Downstream GAA TRINUCLEOTIDE Repeats
[0154] gRNAs targeting sequences located upstream (5) and downstream (3) of the GAA trinucleotide repeats in intron 1 of the FXN gene (NG_008845) were designed. Sequences adjacent to the S. pyogenes NGG PAM were first identified (
[0155] The rescued YG8 (YG8R) mouse model is model system to study FRDA (24-26) which has been known for many years. The YG8R mouse genome contains 2 null mouse FXN genes but also 2 copies in tandem, of a FXN transgene obtained from an FRDA patient. These human transgenes contain respectively 82 and 190 GAA repeats in intron 1 and thus a reduced amount of human FXN is produced leading to the development of FRDA symptoms in mouse.
[0156] Therefore, plasmids were first transfected in mouse YG8R fibroblasts. These cells contain two human FRDA FXN transgenes in tandem comprising about 82 and 190 GAA repeats respectively (29) (
TABLE-US-00005 TABLE5 Pre-andpost-GAArepeattargetsequencesforS.pyogenesCas9. PositionoffirstnucleotideofGAArepeats:6725andoflastnucleotideofGAA repeats:6742ofFXNgenesequencesetforthinSEQIDNO:4(NG_008845). gRNA Distance target ofcut gRNA sequence from Sequence Pre- sequence gRNAtarget gene PAMgene Cutsite firstor removed or (SEQID sequence(5-3) position position gene lastGAA (SEQ post- ID NO.) Strand (SEQIDNO.) (5-3) (5-3) position repeat IDNO.) GAA C1 SEQID Antisense ATGAGCCACCGCGTCCTGCC 6599-6580 PAM 6579-6577 6582-6583 142 SEQIDNO: Pre NO:65 SEQIDNO:39 100 C2 SEQID Sense GATTTCCTGGCAGGACGCGG 6572-6591 TGG 6592-6594 6588-6589 136 SEQIDNO: Pre NO:66 SEQIDNO:40 101 C3 SEQID Antisense AAGTCCTAACTTTTAAGCAC 6563-6544 TGG 6543-6541 6546-6547 178 SEQIDNO: Pre NO:67 SEQIDNO:41 102 C4 SEQID Sense TCCGGAGTTCAAGACTAACC 6650-6669 TGG 6670-6672 6666-6667 58 SEQIDNO: Pre NO:68 SEQIDNO:42 103 C5 SEQID Antisense AGTCTTGAACTCCGGACCTC 6665-6646 AGG 6645-6643 6648-6649 76 SEQIDNO: Pre NO:69 SEQIDNO:43 104 C6 SEQID Sense CTAGGAAGGTGGATCACCTG 6627-6646 AGG 6647-6649 6643-6644 81 SEQIDNO: Pre NO:70 SEQIDNO:44 105 C7 SEQID Antisense CAGGCGCGCGACACCACGCC 6781-6762 CGG 6761-6759 6764-6765 22 SEQIDNO: Post NO:71 SEQIDNO:45 106 C8 SEQID Sense GAGAATCGCTTGAGCCCGGG 6812-6831 AGG 6832-6834 6828-6829 86 SEQIDNO: Post NO:72 SEQIDNO:46 107 C9 SEQID Antisense CCGCAGCCTCTGGAGTAGCT 6808-6789 GGG 6888-6886 6891-6892 49 SEQIDNO: Post NO:73 SEQIDNO:47 108 C10 SEQID Antisense CGGAGTGCATTGGGCGATCT 6873-6854 TGG 6853-6851 6856-6857 114 SEQIDNO: Post NO:74 SEQIDNO:48 109 C11 SEQID Sense AAAGAAAAGTTAGCCGGGCG 6746-6765 TGG 6766-6768 6762-6763 20 SEQIDNO: Post NO:75 SEQIDNO:49 110 C12 SEQID Sense CAAGATCGCCCAATGCACTC 6852-6871 CGG 6872-6874 6868-6869 126 SEQIDNO: Post NO:76 SEQIDNO:50 111 C13 SEQID Antisense TTTCAAGCCGTGGCGTAAC 6221-6203 TGG 6202-6200 6205-6206 519 SEQIDNO: Pre NO:77 SEQIDNO:51 112 C14 SEQID Sense GACGCCCATTTTGCGGACC 6084-6102 TGG 6103-6105 6099-6100 625 HOIDNO: Pre NO:78 SEQIDNO:52 113 C15 SEQID Sense AGTTACGCCACGGCTTGAA 6202-6220 AGG 6221-6223 6217-6218 507 SEQIDNO: Pre NO:79 SEQIDNO:53 114 C16 SEQID Antisense ATACCATGTCCTCCCCTTG 6283-6265 AGG 6264-6262 6267-6268 457 SEQIDNOs: Pre NO:80 SEQIDNO:54 115and116 C17 SEQID Antisense ATAATCCCAGCTACTCGGG 7251-7233 AGG 7232-7230 7235-7236 493 SEQIDNO: Post NO:81 SEQIDNO:55 117 C18 SEQID Sense GTCTCGAACTCCCAACCTC 7305-7323 AGG 7324-7326 7320-7321 578 SEQIDNO: Post NO:82 SEQIDNO:56 118 C19 SEQID Antisense CACTTTGGGAGGGCGAGGT 7355-7337 GGG 7336-7334 7339-7340 597 SEQIDNO: Post NO:83 SEQIDNO:57 119 C20 SEQID Antisense TCCAGCCTGGGCAACAAGA 7161-7143 GGG 7142-7140 7145-7146 403 SEQIDNO: Post NO:84 SEQIDNO:58 1120
EXAMPLE 3
Deletion of the FXN Intronic GAA Repeats in YG8R Fibroblasts
[0157] Some gRNA pairs were selected and were cloned into pxPuro, which shares similarities with px330 but contains a puromycin gene for selection. These new plasmids were retested in YG8R cells (
EXAMPLE 4
Protein Analysis IN YG8R Clones
[0158] FXN protein levels were thus analyzed in samples from a heterogeneous gRNA/SpCas9 transfected YG8R cell population (
[0159] The surprising initial in vitro results in YG8R fibroblasts (where a reduction, rather than an increase in FXN expression level was generally detected) is explained by the presence of two FRDA transgenes in tandem (one with about 82 GAA repeats and the other with about 190 GAA repeats) in the YG8R mouse genome. Some gRNA pairs tested frequently removed not only the GAA repeats but also (through NHEJ) one complete copy of the hFXN transgene. Since only one functioning complete FXN transgene (including the promoter region) remained, no significant change in FXN levels or reduced FXN expression (compared with the untreated YG8R cells expressing FXN from two copies of the hFXN transgene) was detected.
EXAMPLE 5
Deletion of the FXN Intronic GAA Repeats in YG8SR Fibroblasts
[0160] Recently a new mouse model derived from the YG8R model has been described. During the course of breeding, some YG8R mice have lost one of the human transgene (27). This new model called YG8sR presents more severe symptoms than the original mouse model, including significant behavioral deficits, together with some level of glucose intolerance and insulin hypersensitivity. These symptoms are also associated with significantly reduced expression of FAST-1 and FXN, and the presence of pathological vacuoles within neurons of the dorsal root ganglia (DRG). The YG8sR model thus represents more closely the symptoms observed in more severely affected FRDA subjects.
[0161] Three (3) new mouse fibroblast cell lines (called YG8sR-6, YG8sR-8 and YG8sR-39) derived from 3 different YG8sR mice were used for further experiments. Each cell line contained only one copy of the human FXN transgene with about 190 GAA repeats within intron 1 (28) (
[0162] Since the YG8sR cells contain only one copy of a mutated human FXN transgene, only one rearrangement is possible by NHEJ recombination following cuts on both sides of the GAA repeat expansion (
EXAMPLE 6
Identification of YG8SR Clones Expressing High Amounts of FXN Protein
[0163] Analysis of YG8sR C2C20 clones. Protein extracts by western blot revealed an increase in FXN protein levels in two C2C20 clones (
[0164] The copy number of hFXN transgene in C2C20 clones was also measured. As expected no change was found, in almost all clones, compared to the YG8sR untreated population (
EXAMPLE 7
Electroporation of Plasmid DNA into the Tibialis Anterior Shows in Vivo Correction
[0165] Three gRNA combinations were tested in vivo. Briefly, plasmids coding for SpCas9 and a pair of gRNAs (either C2C20, C15C20 or C16C20) were electrotransfered into the Tibialis anterior (TA) of YG8R mouse muscles (
EXAMPLE 8
AAV-Encoded S. aureus cas9 Plasmid Generate cuts in Mouse Fibroblasts
[0166] As FRDA is a neuro-muscular degenerative disease involving mainly the brain, the spinal ganglia, the heart and the pancreas, a viral vector was used to deliver pairs of gRNAs in target tissues in vivo. The gRNAs were redesigned to provide an adeno-associated virus (AAV) encoding the recently available S. aureus (Sa) Cas9 protein, which requires a NNGRRT PAM sequence (31). Target sequences were thus adjusted in order to be recognized by the humanized S. aureus Cas9 (see Table 6 below).
[0167] SaCas9 PAM sequences located close to previously identified SpCas9 PAMs were selected, i.e., the C2 and C20 sites (
TABLE-US-00006 TABLE6 Pre-andpost-GAArepeattargetsequencesforS.aureusCas9. gRNA Distanceof target cutfrom gRNA sequence firstor Sequence Pre- sequence gRNAtarget gene PAMgene Cutsite lastnuc- removed or (SEQID sequence(5-3) position position gene leotidein (SEQ post- ID NO.) Strand SEQIDNO. (5-3) PAM (5-3) position GAArepeat IDNO.) GAA AC1 SEQID Sense TAAAAGTTAGGACTTAGAAA 6549-6568 ATGGAT 6569-6574 6565-6566 159 SEQID Pre NO:85 SEQIDNO:59 NO:121 AC2 SEQID Sense ACTTTGGGAGGCCTAGGAAG 6615-6634 GTGGAT 6635-6640 6631-6632 93 SEQID Pre NO:86 SEQIDNO:60 NO:122 AC3 SEQID Antisense TTTGTATTTTTTAGTAGATA 6711-6692 CTGGGT 6691-6686 6694-6695 30 SEQID Pre NO:87 SEQIDNO:61 NO:123 AC4 SEQID Antisense GCCGCAGCCTCTGGAGTAGC 6809-6790 TGGGAT 6789-6784 6792-6793 50 SEQID Post NO:88 SEQIDNO:62 NO:124 ACS SEQID Antisense CCCATGCTGTCCACACAGGC 7093-7074 AGGGGT 7078-7073 7076-7077 334 SEQID Post NO:89 SEQIDNO:63 NO:125 AC6 SEQID Sense TTCCCTCTTGTTGCCCAGGC 7138-7157 TGGAGT 7158-7163 7154-7155 412 SEQID Post NO:90 SEQIDNO:64 NO:126
EXAMPLE 9
Single Intravenous Injection of AAV Vectors Coding for SPCAS9 and gRNAS in 1 Month-Old YG8SR Mice Enables Removal of GAA Repeats Intron 1 and Correction of FXN Gene in Liver Cells
[0168] In vivo correction of intron 1 of the FXN gene using the CRISPR/Cas system was further assessed the YG8sR mouse model. Two AAV viruses were used (
[0169] AAV viruses were injected in one-month old mice. The mice were euthanized one month later. Genomic DNA was extracted from brain, medulla, spinal cord, dorsal root ganglia, liver, heart, Tibialis anterior and pancreas. DNA carrying the SpCas9 gene and the RSV promoter (from the gRNA plasmid) were detected in all analyzed tissues, including the brain. A digital droplet PCR approach was used to detect the correction.
[0170] Analysis reveals about 0.6-2% correction in the liver and lower percentages in other tissues. It is expected that longer infection periods will increase the number of corrected cells (
EXAMPLE 10
Correction of GAA Trinucleotide Repeats in Intron 1 of the FXN Gene in Human FRDA Primary Fibroblasts
[0171] The efficiency of the method was further tested in human primary fibroblasts of FRDA patients. Two different techniques were used to achieve correction of the FXN gene using the CRISPR/Cas system: 1) nucleofection of SpCas9 and gRNA expression plasmids; or 2) nucleofection of a ribonucleoproteic complex (SpCas9 protein and gRNAs). Both methods allowed to remove GM repeats and correct the FXN gene, resulting in smaller amplicons (See
[0172] The scope of the claims should not be limited by the preferred embodiments set forth in the examples, but should be given the broadest interpretation consistent with the description as a whole.
EXAMPLE 11
C. jejuni can be used to Delete GAA Repeats frome Intron 1 of the FXN Gene and its Small Size Gene Allows Packaging of Optimized Molecular Components in a Single AAV Vector
[0173] The CjCas9 (SEQ ID NO: 155, 156 or 157) was selected because of its smaller gene size compared to SpCas9 and SaCas9 (
[0174] A new AAV vector was constructed to introduce the CjCas9 gene (amplified from the pRGEN-CMV-CjCas9 plasmid (Addgene #89752)) under the control of a CBh promoter. A WPRE sequence was added to enhance expression. Two gRNAs can be expressed at the same time; one from a human U6 promoter and the other from a minimal H1 promoter (
EXAMPLE 12
Preferred Target Regions for Deletion of GAA Repeats were Identified within Pre- and Post-GAA Regions
[0175] The best pre-GAA region (see
[0176] Table 8 below summarizes the gRNAs tested and their efficiency, together with a CRISPR nuclease, in targeting and cutting the FXN gene.
TABLE-US-00007 TABLE7 Pre-andpost-GAArepeattargetsequencesforC.jejuniCas9. Distance gRNA ofcut target fromfirst gRNA sequence orlast Sequence Pre- sequence gRNAtarget gene PAMgene Cutsite nucleotide removed or (SEQ. sequence(5-3) position position gene inGAA (SEQ post- ID IDNO.) Strand SEQIDNO. (5-3) PAM (5-3) position repeat IDNO.) GAA Cj1 SEQID Anti- CTTTCATCTCCCCTAATACATG 6422-6401 CGGCGTAC 6400-6393 6403-6404 321 158 Pre NO:197 sense SEQIDNO:144 Cj2 SEQID Anti- GTGGCCTGCCTCTTTCATCTCC 6433-6412 CCTAATAC 6411-6404 6414-6415 310 159 Pre NO:198 sense SEQIDNO:145 Cj3 SEQID Sense CATATTTGTGTTGCTCTCCGGA 6442-6463 GTTTGTAC 6464-6471 6460-6461 264 160 Pre NO:199 SEQIDNO:146 Cj4 SEQID Anti- TCTTCAAACACAATGTGGGCCA 6525-6502 AATAACAC 6501-6494 6504-6505 220 161 Pre NO:200 sense SEQIDNO:147 Cj4 SEQID Anti- GGCAACCAATCCCAAAGTTTCT 6542-6521 TCAAACAC 6520-6513 6523-6524 201 162 Pre NO:201 sense SEQIDNO:148 Cj6 SEQID Anti- TCCACACAGGCAGGGGTGGAAG 7084-7063 CCCAATAC 7062-7055 7065-7066 323 163 Post NO:202 sense SEQIDNO:149 Cj7 SEQID Anti- GAGGAGATCTAAGGACCATCAT 7002-6981 GGCCACAC 6980-6973 6984-6985 241 164 Post NO:203 sense SEQIDNO:150 Cj8 SEQID Sense GCAGACATTTATTACTTGGCTT 7010-7031 CTGTGCAC 7032-7039 7029-7030 286 165 Post NO:204 SEQIDNO:151 Cj9 SEQID Anti- GCCCAATACGTGGCAGCTCAGA 7063-7042 TAGTGCAC 7041-7034 7044-7044 302 166 Post NO:205 sense SEQIDNO:152 Cj10 SEQID Anti- AACTCTGCTGACAACCCATGCT 7107-7086 GTCCACAC 7085-7078 7088-7089 346 167 Post NO:206 sense SEQIDNO:153
TABLE-US-00008 TABLE 8 Summary of gRNAs tested and their efficiency Cuts (yes (y) gRNA or no (n)) Efficiency AC1 y +++ AC2 y ++++ AC3 y ++ AC4 n AC5 y ++ AC6 y ++++ C1 y ++ C2 y ++++ C10 n C11 y ++ C12 y + C13 n C14 n C15 y +++ C16 y +++ C17 y ++ C18 y +++ C19 y +++ C20 y ++++ Cj1 y +++ Cj2 y +++ Cj3 y ++ Cj4 y ++++ Cj5 y +++ Cj6 n Cj7 y ++++ Cj8 n Cj9 n Cj10 y ++++ C20 y ++++
[0177] gRNAs C3-C9 were also prepared and tested. Preliminary results regarding efficacy of these gRNAs were uncertain due to technical problems encountered during the tests. Accordingly their efficacy could not be determined with certainty.
TABLE-US-00009 TABLE 9 Sequences described herein SEQ ID NO(s) Description 1 FXN isoform 1 (210aa) from NP_000135.2 2 FXN isoform 2 (196aa) from NP_852090 3 FXN isoform 3 (171aa) from NP_001155178 4 FXN gene sequence from NCBI reference number NG_008845.2. Intron 1 extends from nts 5644 to nts15822. 5-38 Primer sequences listed in Example 1 39-64 gRNA target sequences in FXN intron 1 gene (Tables 4 and 5) 65-90 gRNA RNA sequences corresponding to the target sequences of SEQ ID NOs: 30-54 listed in Tables 5 and 6 91 S. pyogenes Cas9 RNA recognition sequence/scaffold sequence (derived from TracrRNA/crRNA) 92 S. aureus Cas9 RNA recognition sequence/scaffold sequence (derived from tracrRNA) 93 recognition sequence/scaffold sequence from Cpf1 tracrRNA 94 Protein sequence of humanized Cas9 from S. pyogenes (without NLS and without TAG) 95 Protein sequence of humanized Cas9 from S. pyogenes (with NLS and without TAG) 96 Protein sequence of humanized Cas9 from S. pyogenes (with NLS and with TAG, from Addgene plasmid #71814) 97 Protein sequence of humanized Cas9 from S. aureus (without NLS and without TAG) 98 Protein sequence of humanized Cas9 from S. aureus (with NLS and without TAG) 99 Protein sequence of humanized Cas9 from S. aureus (with NLS and with TAG, from Addgene plasmid #61591) 100-126 Polynucleotide sequence removed by Cas9/gRNAs in intron 1 of the FXN gene. SEQ ID NOs: 100-114 (gRNAs C1 to C15); SEQ ID NOs: 115 and 116 (gRNA C16-alternative cuts detected); SEQ ID NO: 117-120 (gRNAs C17-C20); and SEQ ID NOs: 121-126 (gRNAs AC1-AC6). 127-130 Promoter polynucleotide sequences for expressing gRNAs and CRISPR nucleases (see Example 8) 131-137 Partial sequences of corrected intron 1 of FXN gene following cuts with gRNA combinations C15C20 (SEQ ID NOs: 131-133); C2C11 (SEQ ID NO: 134); C2C20 (SEQ ID NOs: 135 and 136) and C16C20 (SEQ ID NO: 137). See also FIGS. 14A-D. 138-142 Partial sequences of corrected intron 1 of FXN gene following cuts with gRNA combinations C15C18 (SEQ ID NO: 138); C16C18 (SEQ ID NO: 139); C1C20 (SEQ ID NO: 140); AC1AC6 (SEQ ID NO: 141); and AC2AC6 (SEQ ID NO: 142). See also FIGS. 15A-E. 143 Forward primer F1 used to amplify upstream of the pre-GAA repeat (see Table 4) 144-153 gRNA target sequence/gRNA DNA sequence for gRNAs Cj1-Cj10 (see Table 7) 154 Cas9 recognition sequence from C. jejuni (i.e., gRNA scaffold sequence derived from crRNA and tracrRNA) 155 Humanized Cas9 protein sequence from C. jejuni (without NLS and without TAG) 156 Humanized Cas9 protein sequence from C. jejuni (NLS and without TAG) 157 Protein sequence of humanized high specific Cas9 from C. jejuni (with NLS and with TAG; from Addgene plasmid #89752) (1003 aa)-HA TAG (C-term) 158-167 Nucleotide sequence removed following cut by each of Cj1-Cj10 in intron 1 of FXN gene 168 H1 minimal promoter sequence 169 CBh (or CBA hybrid intron): CBA promoter with a hybrid intron composed of a 5 donor splice site from the chicken -actin 5 UTR and a 3 acceptor splice site from MVM (Minute virus of mice). 170 WPREL sequence: Sequence containing SV40 late poly A (135 bp) and Woodchuck post transcriptional region gamma and alpha elements (247 bp) 171-195 Partial sequence of frataxin intron 1 following cuts with Cj1Cj6 (SEQ ID NO: 171); Cj1Cj7 (SEQ ID NO: 172); Cj1Cj8 (SEQ ID NO: 173); Cj1Cj9 (SEQ ID NO: 174); Cj1Cj10 (SEQ ID NO: 175); Cj2Cj6 (SEQ ID NO: 176); Cj2Cj7 (SEQ ID NO: 177); Cj2Cj8 (SEQ ID NO: 178); Cj2Cj9 (SEQ ID NO: 179); Cj2Cj10 (SEQ ID NO: 180); Cj3Cj6 (SEQ ID NO: 181); Cj3Cj7 (SEQ ID NO: 182); Cj3Cj8 (SEQ ID NO: 183); Cj3Cj9 (SEQ ID NO: 184); Cj3Cj10 (SEQ ID NO: 185); Cj4Cj6 (SEQ ID NO: 186); Cj4Cj7 (SEQ ID NO: 187); Cj4Cj8 (SEQ ID NO: 188); Cj4Cj9 (SEQ ID NO: 189); Cj4Cj10 (SEQ ID NO: 190); Cj5Cj6 (SEQ ID NO: 191); Cj5Cj7 (SEQ ID NO: 192); Cj5Cj8 (SEQ ID NO: 193); Cj5Cj9 (SEQ ID NO: 194); and Cj5Cj10 (SEQ ID NO: 195) 196 WPRE sequence comprising alpha, beta and gamma elements 197-206 gRNA sequence of Cj1-Cj10 of Table 7 (CjCas9) 207-208 Fragments of human FXN gene intron 1 corresponding to effective subregions discussed in Example 12 (SEQ ID NO. 207 corresponds to a subregion upstream of GAA repeats and SEQ ID NO: 208 corresponds to a subregion downstream of GAA repeats). 209-210 Fragments of human FXN gene intron 1 corresponding to effective regions identified in FIG. 18 (pre GAA, (6201-6633, SEQ ID NO: 208) and post GAA (7078-7161, SEQ ID NO: 210) 211 Primer H1F Example 1
REFERENCES
[0178] 1. Babady N E, Carelle N, Wells R D, Rouault T A, Hirano M, Lynch D R, et al. Advancements in the pathophysiology of Friedreich's Ataxia and new prospects for treatments. Mol Genet Metab. 2007; 92(1-2):23-35. [0179] 2. Cooper J M, Schapira A H. Friedreich's Ataxia: disease mechanisms, antioxidant and Coenzyme Q10 therapy. Biofactors. 2003; 18(1-4):163-71. [0180] 3. Harding A E. Friedreich's ataxia: a clinical and genetic study of 90 families with an analysis of early diagnostic criteria and intrafamilial clustering of clinical features. Brain. 1981; 104(3):589-620. [0181] 4. Lynch D R, Farmer J M, Balcer L J, Wilson R B. Friedreich ataxia: effects of genetic understanding on clinical evaluation and therapy. Arch Neurol. 2002; 59(5):743-7. [0182] 5. Pandolfo M. Molecular pathogenesis of Friedreich ataxia. Arch Neurol. 1999; 56(10):1201-8. [0183] 6. Pandolfo M. Friedreich ataxia: the clinical picture. J Neurol. 2009; 256 Suppl 1:3-8. [0184] 7. Pandolfo M. Friedreich ataxia. Handbook of clinical neurology (Chapter 17)/edited by PJ Vinken and GW Bruyn. 2012; 103:275-94. [0185] 8. Campuzano V, Montermini L, Molto M D, Pianese L, Cossee M, Cavalcanti F, et aL Friedreich's ataxia: autosomal recessive disease caused by an intronic GAA triplet repeat expansion. Science. 1996; 271(5254):1423-7. [0186] 9. Pandolfo M. The molecular basis of Friedreich ataxia. Adv Exp Med Biol. 2002; 516:99-118. [0187] 10. Campuzano V, Montermini L, Lutz Y, Cova L, Hindelang C, Jiralerspong S, et al. FXN is reduced in Friedreich ataxia patients and is associated with mitochondrial membranes. Hum Mol Genet. 1997; 6(11):1771-80. [0188] 11. Pandolfo M. Iron and Friedreich ataxia. J Neural Transm Suppl. 2006(70):143-6. [0189] 12. Coppola G, Choi S H, Santos M M, Miranda C J, Tentler D, Wexler E M, et al. Gene expression profiling in FXN deficient mice: microarray evidence for significant expression changes without detectable neurodegeneration. Neurobiol Dis. 2006; 22(2):302-11. [0190] 13. Coppola G, Marmolino D, Lu D, Wang Q, Cnop M, Rai M, et al. Functional genomic analysis of FXN deficiency reveals tissue-specific alterations and identifies the PPARgamma pathway as a therapeutic target in Friedreich's ataxia. Hum Mol Genet. 2009; 18(13):2452-61. [0191] 14. Gerber J, Muhlenhoff U, Lill R. An interaction between FXN and Isu1/Nfs1 that is crucial for Fe/S cluster synthesis on Isu1. EMBO Rep. 2003; 4(9):906-11. [0192] 15. Wiedenheft B, Sternberg S H, Doudna J A. RNA-guided genetic silencing systems in bacteria and archaea. Nature. 2012; 482(7385):331-8. [0193] 16. Bhaya D, Davison M, Barrangou R. CRISPR-Cas systems in bacteria and archaea: versatile small RNAs for adaptive defense and regulation. Annu Rev Genet. 2011; 45:273-97. [0194] 17. Terns M P, Terns R M. CRISPR-based adaptive immune systems. Curr Opin Microbiol. 2011; 14(3):321-7. [0195] 18. Mali P, Yang L, Esvelt K M, Aach J, Guell M, DiCarlo J E, et al. RNA-guided human genome engineering via Cas9. Science. 2013; 339(6121):823-6. [0196] 19. Mojica FJ, Diez-Villasenor C, Garcia-Martinez J, Almendros C. Short motif sequences determine the targets of the prokaryotic CRISPR defence system. Microbiology. 2009; 155(Pt 3):733-40. [0197] 20. He Z, Proudfoot C, Mileham A, J., McLaren D G, Whitelaw B A, Lillico S G. Highly efficient targeted chromosome deletions using CRISPR/Cas9. Biotechnology and Bioengineering. 2014; online. [0198] 21. Byrne S M, Ortiz L, Mali P, Aach J, Church G M. Multi-kilobase homozygous targeted gene replacement in human induced pluripotent stem cells. Nucleic Acids Res. 2014. [0199] 22. Slaymaker I M, Gao L, Zetsche B, Scott D A, Yan W X, Zhang F. Rationally engineered Cas9 nucleases with improved specificity. Science. 2015. [0200] 23. Kleinstiver B P, Pattanayak V, Prew M S, Tsai S Q, Nguyen NT, Zheng Z, et al. High-fidelity CRISPR-Cas9 nucleases with no detectable genome-wide off-target effects. Nature. 2016. [0201] 24. Pook M A, Al-Mandawi S, Carroll C J, Cossee M, Puccio H, Lawrence L, et al. Rescue of the Friedreich's ataxia knockout mouse by human YAC transgenesis. Neurogenetics. 2001; 3(4):185-93. [0202] 25. Al-Mandawi S, Pinto R M, Ruddle P, Carroll C, Webster Z, Pook M. GAA repeat instability in Friedreich ataxia YAC transgenic mice. Genomics. 2004; 84(2):301-10. [0203] 26. Al-Mandawi S, Pinto R M, Varshney D, Lawrence L, Lowrie M B, Hughes S, et al. GAA repeat expansion mutation mouse models of Friedreich ataxia exhibit oxidative stress leading to progressive neuronal and cardiac pathology. Genomics. 2006; 88(5):580-90. [0204] 27. Virmouni SA, Ezzatizadeh V, Sandi C, Sandi M, Al-Mandawi S, Chutake Y, et al. A novel GAA repeat expansion-based mouse model of Friedreich ataxia. Disease Models & amp; Mechanisms. 2015; in press. [0205] 28. Anjomani Virmouni S, Ezzatizadeh V, Sandi C, Sandi M, Al-Mandawi S, Chutake Y, et al. A novel GAA-repeat-expansion-based mouse model of Friedreich's ataxia. Dis Model Mech. 2015; 8(3):225-35. [0206] 29. Anjomani Virmouni S, Sandi C, Al-Mandawi S, Pook M A. Cellular, molecular and functional characterisation of YAC transgenic mouse models of Friedreich ataxia. PLoS One. 2014; 9(9):e107416. [0207] 30. Virmouni S A. Genotype and phenotype characterisation of Friedreich ataxia mouse models and cells. Brunel University London library. 2013. [0208] 31. Ran F A, Cong L, Yan W X, Scott D A, Gootenberg J S, Kriz A J, et al. In vivo genome editing using Staphylococcus aureus Cas9. Nature. 2015; 520(7546):186-91. [0209] 32. Senis E, Fatouros C, Grosse S, Wiedtke E, Niopek D, Mueller AK, et al. CRISPR/Cas9-mediated genome engineering: an adeno-associated viral (AAV) vector toolbox. Biotechnology journal. 2014; 9(11):1402-12. [0210] 33. Long C, Amoasii L, Mireault A A, McAnally J R, Li H, Sanchez-Ortiz E, et al. Postnatal genome editing partially restores FXNtrophin expression in a mouse model of muscular FXNtrophy. Science. 2016; 351(6271):400-3. [0211] 34. Nelson C E, Hakim C H, Ousterout D G, Thakore P I, Moreb E A, Castellanos Rivera R M, et al. In vivo genome editing improves muscle function in a mouse model of Duchenne muscular FXNtrophy. Science. 2016; 351(6271):403-7. [0212] 35. Tabebordbar M, Zhu K, Cheng J K, Chew W L, Widrick J J, Yan W X, et al. In vivo gene editing in FXNtrophic mouse muscle and muscle stem cells. Science. 2016; 351(6271):407-11. [0213] 36. lyombe-Engembe J P, Ouellet D L, Rousseau J, Chapdelaine P, Tremblay J P. Efficient Restoration of the FXNtrophin Gene Reading Frame and Protein Structure in DMD Myoblasts Using the CinDel Method. Molecular Therapy Nucleic Acid Research. 2016; Online publication http://www. nature.com/mtna/journal/v5/n 1/fuII/mtna201558a.html. [0214] 37. Courtney D G, Moore J E, Atkinson S D, Maurizi E, Allen E H, Pedrioli D M, et al. CRISPR/Cas9 DNA cleavage at SNP-derived PAM enables both in vitro and in vivo KRT12 mutation-specific targeting. Gene Ther. 2016; 23(1):108-12. [0215] 38. Yin H, Song C Q, Dorkin J R, Zhu L J, Li Y, Wu Q, et al. Therapeutic genome editing by combined viral and non-viral delivery of CRISPR system components in vivo. Nat Biotechnol. 2016; 34(3):328-33. [0216] 39. Sachdeva M, Sachdeva N, Pal M, Gupta N, Khan I A, Majumdar M, et al. CRISPR/Cas9: molecular tool for gene therapy to target genome and epigenome in the treatment of lung cancer. Cancer Gene Ther. 2015; 22(11):509-17. [0217] 40. Li Y, Lu Y, Polak U, Lin K, Shen J, Farmer J, et al. Expanded GAA repeats impede transcription elongation through the FXN gene and induce transcriptional silencing that is restricted to the FXN locus. Hum Mol Genet 2015; 24(24):6932-43. [0218] 41. Chutake Y K, Costello W N, Lam C C, Parikh A C, Hughes T T, Michalopulos M G, et al. FXN Promoter Silencing in the Humanized Mouse Model of Friedreich Ataxia. PLoS One. 2015; 10(9):e0138437. [0219] 42. Sandi C, Pinto R M, Al-Mandawi S, Ezzatizadeh V, Barnes G, Jones S, et al. Prolonged treatment with pimelic o-aminobenzamide HDAC inhibitors ameliorates the disease phenotype of a Friedreich ataxia mouse model. Neurobiol Dis. 2011; 42(3):496-505. [0220] 43. Cong L, Ran F A, Cox D, Lin S, Barretto R, Habib N, et al. Multiplex genome engineering using CRISPR/Cas systems. Science. 2013; 339(6121):819-23. [0221] 44. Ran F A, Hsu P D, Wright J, Agarwala V, Scott D A, Zhang F. Genome engineering using the CRISPR-Cas9 system. Nature protocols. 2013; 8(11):2281-308. [0222] 45. Gray J T, Zolotukhin S. Design and construction of functional AAV vectors. Methods in molecular biology. 2011; 807:25-46. [0223] 46. Pichavant C, Chapdelaine P, Cerri D G, Bizario J C, Tremblay J P. Electrotransfer of the full-length dog FXNtrophin into mouse and FXNtrophic dog muscles. Hum Gene Ther. 2010; 21(11):1591-601. [0224] 47. Pfaffl M W. A new mathematical model for relative quantification in real-time RT-PCR. Nucleic Acids Res. 2001; 29(9):e45. [0225] 48. Bustin S A, Benes V, Garson J A, Hellemans J, Huggett J, Kubista M, et al. The MIQE guidelines: minimum information for publication of quantitative real-time PCR experiments. Clin Chem. 2009; 55(4):611-22. [0226] 49. Bustin S A, Beaulieu J F, Huggett J, Jaggi R, Kibenge F S, Olsvik P A, et al. MIQE precis: Practical implementation of minimum standard guidelines for fluorescence-based quantitative real-time PCR experiments. BMC Mol Biol. 2010; 11:74. [0227] 50. Luu-The V, Paquet N, Calvo E, Cumps J. Improved real-time RT-PCR method for high-throughput measurements using second derivative calculation and double correction. Biotechniques. 2005; 38(2):287-93. [0228] 51. Chapdelaine P, Coulombe Z, Chikh A, Gerard C, Tremblay J P. A Potential New Therapeutic Approach for Friedreich Ataxia: Induction of FXN Expression With TALE Proteins. Mol Ther Nucleic Acids. 2013; 2:e119. [0229] 52. Deverman B. E and al, Cre-dependant selection yields AAV variants for widespread gene transfer to the adult brain, Nature Biotechnology, February 2016. [0230] 53. Kumari D. et al. Repeat expansion affects both transcription initiation and elongation in Friedreich ataxia cells. Journal of Biol. Chemistry. 2011; 286(6); pp. 4209-4215. [0231] 54. Sandi C. et al. Epigenetics in Friedreich's ataxia: Challenges and opportunities for therapy. Genetics Research Int 2013, vol. 2013, Article IS 852080. [0232] 55. Sandi C. et al. Epigenetic-based therapies for Friedreich ataxia. Frontiers in Genetics. Jun. 3, 2014. Volume 5, Article 165. [0233] 56. Yandim C. et aL Gene regulation and epigenetics in Friedreich ataxia. Journal of Neurochemistry. 2013. 126(Suppl. 1); pp. 21-42. [0234] 57. De Biase I. et al. Epigenetic silencing in Friedreich ataxia is associated with depletion of CTFF (CCCTC-Binding factor) and antisense transcription. PLOS ONE. 2009. Vol. 4 (11), e7914. [0235] 58. Mohanraju, P. et al., PMID 27493190. [0236] 59. Shmakov, S et al., PMID: 26593719. [0237] 60. Zetsche, B. et al., PMID: 26422227. [0238] 61. Kleinstiver BP, Prew MS, Tsai SQ, Nguyen NT, Topkar VV, Zheng Z, Joung JK. Broadening the targeting range of Staphylococcus aureus CRISPR-Cas9 by modifying PAM recognition. Nat Biotechnol. 2015; 33(12):1293-8. doi: 10.1038/nbt.3404. PubMed PMID: 26524662; PMClD: PMC4689141. [0239] 62. Kleinstiver B P, Prew M S, Tsai S Q, Topkar V V, Nguyen N T, Zheng Z, Gonzales A P, Li Z, Peterson R T, Yeh J R, Aryee M J, Joung J K. Engineered CRISPR-Cas9 nucleases with altered PAM specificities. Nature. 2015; 523(7561):481-5. doi: 10.1038/nature14592. PubMed PMID: 26098369; PMCID: PMC4540238. [0240] 63. Shah S A, Erdmann S, Mojica F J and Garrett R A. Protospacer Recognition Motifs-Mixed Identities and Functional Diversity. 5, s.l.:RNA Biology, May 2013, Vol. 10, pp. 891-899.