MODIFIED CASCADE RIBONUCLEOPROTEINS AND USES THEREOF
20210171926 · 2021-06-10
Assignee
Inventors
Cpc classification
C12N2310/20
CHEMISTRY; METALLURGY
C12N15/74
CHEMISTRY; METALLURGY
C12N9/22
CHEMISTRY; METALLURGY
C07K2319/60
CHEMISTRY; METALLURGY
A61K48/005
HUMAN NECESSITIES
C12N15/66
CHEMISTRY; METALLURGY
C07K2319/80
CHEMISTRY; METALLURGY
C12N15/70
CHEMISTRY; METALLURGY
C12N15/82
CHEMISTRY; METALLURGY
C07K2319/71
CHEMISTRY; METALLURGY
C12Y301/21004
CHEMISTRY; METALLURGY
C12N15/86
CHEMISTRY; METALLURGY
International classification
C12N9/22
CHEMISTRY; METALLURGY
A61K48/00
HUMAN NECESSITIES
C12N15/66
CHEMISTRY; METALLURGY
C12N15/70
CHEMISTRY; METALLURGY
C12N15/74
CHEMISTRY; METALLURGY
C12N15/82
CHEMISTRY; METALLURGY
C12N15/86
CHEMISTRY; METALLURGY
C12N15/90
CHEMISTRY; METALLURGY
Abstract
A clustered regularly interspaced short palindromic repeat (CRISPR)-associated complex for adaptive antiviral defence (Cascade); the Cascade protein complex comprising at least CRISPR-associated protein subunits Cas7, Cas5 and Cash which includes at least one subunit with an additional amino acid sequence possessing nucleic acid or chromatin modifying, visualising, transcription activating or transcription repressing activity. The Cascade complex with additional activity is combined with an RNA molecule to produce a ribonucleoprotein complex. The RNA molecule is selected to have substantial complementarity to a target sequence. Targeted ribonucleoproteins can be used as genetic engineering tools for precise cutting of nucleic acids in homologous recombination, non-homologous end joining, gene modification, gene integration, mutation repair or for their visualisation, transcriptional activation or repression. A pair of ribonucleotides fused to FokI dimers may be used to generate double-strand breakages in the DNA to facilitate these applications in a sequence-specific manner.
Claims
1.-46. (canceled)
47. A method of modifying a target nucleic acid comprising: contacting the target nucleic acid with a Type I CRISPR composition comprising: a Type I CASCADE protein complex comprising a Cse1 subunit protein, having an N-terminus and a C-terminus, a Cas3 mutant protein, and a CRISPR-derived RNA (crRNA) molecule comprising a spacer sequence complementary to the target nucleic acid; wherein the Cas3 mutant protein comprises an Escherichia coli Cas3 mutant protein and the Cas3 mutant protein is fused to the N-terminus of the Cse1 subunit protein by a linker polypeptide.
48. The method of claim 47, wherein the method of modifying a target nucleic acid is carried out in vitro in a cell-free environment.
49. The method of claim 47, wherein the linker polypeptide comprises a Streptomyces griseus linker polypeptide.
50. The method of claim 47, wherein the Type I CRISPR composition further comprises a nuclear localization signal.
51. The method of claim 47, wherein the Cas3 mutant protein comprises one or more mutation in the helicase domain.
52. The method of claim 47, wherein the Cas3 mutant protein comprises one or more mutation in the nuclease domain.
53. The method of claim 47, wherein the method of modifying a target nucleic results in binding of the target nucleic acid.
54. The method of claim 47, wherein the method of modifying a target nucleic results in cleaving of the target nucleic acid.
55. The method of claim 54, wherein the cleaving introduces a nick in the target nucleic acid.
56. The method of claim 47, wherein the spacer sequence is 32 residues long.
57. The method of claim 47, wherein the target nucleic acid comprises a double-stranded DNA (dsDNA).
58. The method of claim 57, wherein the dsDNA is plasmid DNA.
59. The method of claim 47, wherein the Escherichia coli Cas3 mutant protein is selected form the group consisting of Cas3 H74A, Cas3 D75A, Cas3 K78A, Cas3 K320N, Cas3 D452N, and Cas3 S483A/T485A.
60. The method of claim 59, wherein the Escherichia coli Cas3 mutant protein comprises Cas3 K320N.
61. The method of claim 47, wherein the Type I CASCADE protein complex further comprises a Type I CRISPR associated subunit protein selected from the group consisting of a Cas7 subunit protein, a Cas5 subunit protein, and a Cas6 subunit protein.
62. The method of claim 61, further comprising a Type I CRISPR associated Cse2 subunit protein.
63. The method of claim 62, wherein the Type I CRISPR associated subunit proteins have a stoichiometry selected from the group consisting of Cse1.sub.1Cse2.sub.2Cas7.sub.6Cas5.sub.1Cas6.sub.1 and Cse1.sub.1Cse2.sub.2Cas7.sub.6Cas5.sub.1Cas6e.sub.1.
64. The method of claim 47, wherein the CRISPR-derived RNA (crRNA) molecule has a length between 35 to 75 nucleotides.
65. The method of claim 64, wherein the CRISPR-derived RNA (crRNA) molecule has a length of 61 nucleotides.
Description
[0115] The invention will now be described in detail and with reference to examples and drawings in which:
[0116]
[0117]
[0118]
[0119]
[0120]
[0121]
[0122]
[0123]
[0124]
[0125]
[0126]
[0127]
[0128]
[0129]
[0130]
EXAMPLES—MATERIALS AND METHODS USED
[0131] Strains, Gene Cloning, Plasmids and Vectors
[0132] E. coli BL21-AI and E. coli BL21 (DE3) strains were used throughout. Table 1 lists all plasmids used in this study. The previously described pWUR408, pWUR480, pWUR404 and pWUR547 were used for production of Strep-tag II R44-Cascade, and pWUR408, pWUR514 and pWUR630 were used for production of Strep-tag II J3-Cascade (Jore et al., (2011) Nature Structural & Molecular Biology 18, 529-536; Semenova et al., (2011) Proceedings of the National Academy of Sciences of the United States of America 108, 10098-10103.) pUC-λ (pWUR610) and pUC-p7 (pWUR613) have been described elsewhere (Jore et al., 2011; Semenova et al., 2011). The C85Venus protein is encoded by pWUR647, which corresponds to pET52b (Novagen) containing the synthetic GA1070943 construct (Table 2) (Geneart) cloned between the BamHI and NotI sites. The N155Venus protein is encoded by pWUR648, which corresponds to pRSF1b (Novagen) containing the synthetic GA1070941 construct (Table 2) (Geneart) cloned between the NotI and XhoI sites. The Cas3-C85Venus fusion protein is encoded by pWUR649, which corresponds to pWUR647 containing the Cas3 amplification product using primers BG3186 and BG3213 (Table 3) between the NcoI and BamHI sites. The CasA-N155Venus fusion protein is encoded by pWUR650, which corresponds to pWUR648 containing the CasA amplification product using primers BG3303 and BG3212 (Table 3) between the NcoI and BamHI sites. CRISPR 7Tm is encoded by pWUR651, which corresponds to pACYCDuet-1 (Novagen) containing the synthetic GA1068859 construct (Table 2) (Geneart) cloned between the NcoI and KpnI sites. The Cascade encoding pWUR400, the CascadeΔCse1 encoding WUR401 and the Cas3 encoding pWUR397 were described previously (Jore et al., 2011). The Cas3H74A encoding pWUR652 was constructed using site directed mutagenesis of pWUR397 with primers BG3093, BG3094 (Table 3).
TABLE-US-00003 TABLE 1 Plasmids used Description and order Restriction Plasmids of genes (5’-3’) sites Primers Source pWUR397 cas3 in pR5F-1b, no 1 tags pWUR400 casA-casB-casC-casD- 1 casE in pCDF-1b, no tags pWUR401 casB-casC-casD-casE 1 in pCDF-1b, no tags pWUR404 casE in pCDF-1b, no 1 tags pWUR408 casA in pRSF-1b, no 1 tags pWUR480 casB with Strep-tag II 1 (N-term)-casC-casD in pET52b pWUR514 casB with Strep-tag II 2 (N-term)-casC-casD- CasE in pET52b pWUR547 E. coli R44 CRISPR, 7x 2 spacer nr. 2, in pACYCDuet-1 pWUR613 pUC-p7; pUC19 2 containing R44- protospacer on a 350 bp phage P7 amplicon pWUR630 CRISPR poly J3, 5x NcoI/KpnI This spacer J3 in study pACYCDuet-1 pWUR610 pUC-λ.; pUC19 3 containing J3- protospacer on a 350 bp phage λ amplicon pWUR647 C85Venus; GA1070943 BamHI/ This (Table S1) in pET52b NotI study pWUR648 N155Venus; NotI/XhoI This GA1070941 (Table S1) study in pRSF 1b pWUR649 cas3-C85Venus; NcoI/ BG3186 + This pWUR647 containing BamHI BG3213 study cas3 amplicon pWUR650 casA-N155Venus NcoI/NotI BG3303 + This pWUR648 containing BG3212 study casA amplicon pWUR651 CRISPR 7Tm; NcoI/KpnI This GA1068859 (Table S1) study in pACYCDuet-1 casB with Strep-tag II This (N-term)-casC-casD- study CasE in pCDF-1b cas3-casA fusion This study cas3H74A-CasA fusion This study cas3D75A-CasA fusion This study cas3K320N-CasA This fusion study cas3D452N-CasA This fusion study
[0133] Source 1 in the table above is Brouns et al (2008) Science 321, 960-964.
[0134] Source 2 in the table above is Jore et al (2011) Nature Structural & Molecular Biology 18: 529-537.
TABLE-US-00004 TABLE 2 Synthetic Constructs GA1070943 ACTGGAAAGCGGGCAGTGAAAGGAAGGCCCATGAGGCCAGTTAATTAAGC GGATCCTGGCGGCGGCAGCGGCGGCGGCAGCGACAAGCAGAAGAACGGCA TCAAGGCGAACTTCAAGATCCGCCACAACATCGAGGACGGCGGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCTACCAGTCCGCCCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAGTAAGCGGCCGCGGCGCG CCTAGGCCTTGACGGCCTTCCTTCAATTCGCCCTATAGTGAG [SEQ ID NO: 6] GA1070941 CACTATAGGGCGAATTGGCGGAAGGCCGTCAAGGCCGCATTTAATTAAGC GGCCGCAGGCGGCGGCAGCGGCGGCGGCAGCATGGTGAGCAAGGGCGAGG AGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTA AACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTA CGGCAAGCTGACCCTGAAGCTCATCTGCACCACCGGCAAGCTGCCCGTGC CCTGGCCCACCCTCGTGACCACCCTCGGCTACGGCCTGCAGTGCTTCGCC CGCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCC CGAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACT ACAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGC ATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCA CAAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCACGGCCTAAC TCGAGGGCGCGCCCTGGGCCTCATGGGCCTTCCGCTCACTGCCCGCTTTC CAG [SEQ ID NO: 7] GA1068859 CACTATAGGGCGAATTGGCGGAAGGCCGTCAAGGCCGCATGAGCTCCATG GAAACAAAGAATTAGCTGATCTTTAATAATAAGGAAATGTTACATTAAGG TTGGTGGGTTGTTTTTATGGGAAAAAATGCTTTAAGAACAAATGTATACT TTTAGAGAGTTCCCCGCGCCAGCGGGGATAAACCGGGCCGATTGAAGGTC CGGTGGATGGCTTAAAAGAGTTCCCCGCGCCAGCGGGGATAAACCGCCGC AGGTACAGCAGGTAGCGCAGATCATCAAGAGTTCCCCGCGCCAGCGGGGA TAAACCGACTTCTCTCCGAAAAGTCAGGACGCTGTGGCAGAGTTCCCCGC GCCAGCGGGGATAAACCGCCTACGCGCTGAACGCCAGCGGTGTGGTGAAT GAGTTCCCCGCGCCAGCGGGGATAAACCGGTGTGGCCATGCACGCCTTTA ACGGTGAACTGGAGTTCCCCGCGCCAGCGGGGATAAACCGCACGAACTCA GCCAGAACGACAAACAAAAGGCGAGTTCCCCGCGCCAGCGGGGATAAACC GGCACCAGTACGCGCCCCACGCTGACGGTTTCTGAGTTCCCCGCGCCAGC GGGGATAAACCGCAGCTCCCATTTTCAAACCCAGGTACCCTGGGCCTCAT GGGCCTTCCGCTCACTGCCCGCTTTCCAG [SEQ ID NO: 8] GA1047360 GAGCTCCCGGGCTGACGGTAATAGAGGCACCTACAGGCTCCGGTAAAACG GAAACAGCGCTGGCCTATGCTTGGAAACTTATTGATCAACAAATTGCGGA TAGTGTTATTTTTGCCCTCCCAACACAAGCTACCGCGAATGCTATGCTTA CGAGAATGGAAGCGAGCGCGAGCCACTTATTTTCATCCCCAAATCTTATT CTTGCTCATGGCAATTCACGGTTTAACCACCTCTTTCAATCAATAAAATC ACGCGCGATTACTGAACAGGGGCAAGAAGAAGCGTGGGTTCAGTGTTGTC AGTGGTTGTCACAAAGCAATAAGAAAGTGTTTCTTGGGCAAATCGGCGTT TGCACGATTGATCAGGTGTTGATTTCGGTATTGCCAGTTAAACACCGCTT TATCCGTGGTTTGGGAATTGGTAGATCTGTTTTAATTGTTAATGAAGTTC ATGCTTACGACACCTATATGAACGGCTTGCTCGAGGCAGTGCTCAAGGCT CAGGCTGATGTGGGAGGGAGTGTTATTCTTCTTTCCGCAACCCTACCAAT GAAACAAAAACAGAAGCTTCTGGATACTTATGGTCTGCATACAGATCCAG TGGAAAATAACTCCGCATATCCACTCATTAACTGGCGAGGTGTGAATGGT GCGCAACGTTTTGATCTGCTAGCGGATCCGGTACC [SEQ ID NO: 9]
TABLE-US-00005 TABLE 3 Primers BG3186 ATAGCGCCATGGAACCTTTTAAATATATATGCCATTA [SEQ ID NO: 10] BG3213 ACAGTGGGATCCGCTTTGGGATTTGCAGGGATGACTC TGGT [SEQ ID NO: 11] BG3303 ATAGCGTCATGAATTTGCTTATTGATAACTGGATTCC TGTACG [SEQ ID NO: 12] BG3212 ACAGTGGCGGCCGCGCCATTTGATGGCCCTCCTTGCG GTTTTAA [SEQ ID NO: 13] BG3076 CGTATATCAAACTTTCCAATAGCATGAAGAGCAATGA AAAATAAC [SEQ ID NO: 14] BG3449 ATGATACCGCGAGACCCACGCTC [SEQ ID NO: 15] BG3451 CGGATAAAGTTGCAGGACCACTTC [SEQ ID NO: 16]
[0135] Protein Production and Purification
[0136] Cascade was expressed and purified as described (Jore et al., 2011). Throughout purification a buffer containing 20 mM HEPES pH 7.5, 75 mM NaCl, 1 mM DTT, 2 mM EDTA was used for resuspension and washing. Protein elution was performed in the same buffer containing 4 mM desthiobiotin. The Cascade-Cas3 fusion complex was expressed and purified in the same manner, with washing steps being performed with 20 mM HEPES pH 7.5, 200 mM NaCl and 1 mM DTT, and elution in 20 mM HEPES pH 7.5, 75 mM NaCl, 1 mM DTT containing 4 mM desthiobiotin.
[0137] Electrophoretic Mobility Shift Assay
[0138] Purified Cascade or Cascade subsomplexes were mixed with pUC-λ in a buffer containing 20 mM HEPES pH 7.5, 75 mM NaCl, 1 mM DTT, 2 mM EDTA, and incubated at 37° C. for 15 minutes. Samples were run overnight on a 0.8% TAE Agarose gel and post-stained with SybR safe (Invitrogen) 1:10000 dilution in TAE for 30 minutes. Cleavage with BsmI (Fermentas) or nicking with Nt.BspQI (New England Biolabs) was performed in the HEPES reaction buffer supplemented with 5 mM MgCl.sub.2.
[0139] Scanning Force Microscopy
[0140] Purified Cascade was mixed with pUC-λ. (at a ratio of 7:1, 250 nM Cascade, 35 nM DNA) in a buffer containing 20 mM HEPES pH 7.5, 75 mM NaCl, 0.2 mM DTT, 0.3 mM EDTA and incubated at 37° C. for 15 minutes. Subsequently, for AFM sample preparation, the incubation mixture was diluted 10× in double distilled water and MgCl.sub.2 was added at a final concentration of 1.2 mM. Deposition of the protein-DNA complexes and imaging was carried out as described before (Dame et al., (2000) Nucleic Acids Res. 28: 3504-3510).
[0141] Fluorescence Microscopy
[0142] BL21-AI cells carrying CRISPR en cas gene encoding plasmids, were grown overnight at 37° C. in Luria-Bertani broth (LB) containing ampicillin (100 μg/ml), kanamycin (50 μg/ml), streptomycin (50 μg/ml) and chloramphenicol (34 μg/ml). Overnight culture was diluted 1:100 in fresh antibiotic-containing LB, and grown for 1 hour at 37° C. Expression of cas genes and CRISPR was induced for 1 hour by adding L-arabinose to a final concentration of 0.2% and IPTG to a final concentration of 1 mM. For infection, cells were mixed with phage Lambda at a Multiplicity of Infection (MOI) of 4. Cells were applied to poly-L-lysine covered microscope slides, and analyzed using a Zeiss LSM510 confocal laser scanning microscope based on an Axiovert inverted microscope, with a 40× oil immersion objective (N.A. of 1.3) and an argon laser as the excitation source (514 nm) and detection at 530-600 nm. The pinhole was set at 203 μm for all measurements.
[0143] pUC-λ Transformation Studies
[0144] LB containing kanamycin (50 μg/ml), streptomycin (50 μg/ml) and chloramphenicol (34 μg/ml) was inoculated from an overnight pre-inoculum and grown to an OD.sub.600 of 0.3. Expression of cas genes and CRISPR was induced for 45 minutes with 0.2% L-arabinose and 1 mM IPTG. Cells were collected by centrifugation at 4° C. and made competent by resuspension in ice cold buffer containing 100 mM RbCl.sub.2, 50 mM MnCl.sub.2, 30 mM potassium acetate, 10 mM CaCl.sub.2 and 15% glycerol, pH 5.8. After a 3 hour incubation, cells were collected and resuspended in a buffer containing 10 mM MOPS, 10 mM RbCl, 75 mM CaCl.sub.2, 15% glycerol, pH 6.8. Transformation was performed by adding 80 ng pUC-λ, followed by a 1 minute heat-shock at 42° C., and 5 minute cold-shock on ice. Next cells were grown in LB for 45 minutes at 37° C. before plating on LB-agar plates containing 0.2% L-arabinose, 1 mM IPTG, ampicillin (100 μg/ml), kanamycin (50 μg/ml), streptomycin (50 μg/ml) and chloramphenicol (34 μg/ml).
[0145] Plasmid curing was analyzed by transforming BL21-AI cells containing cas gene and CRISPR encoding plasmids with pUC-λ, while growing the cells in the presence of 0.2% glucose to suppress expression of the T7-polymerase gene. Expression of cas genes and CRISPR was induced by collecting the cells and re-suspension in LB containing 0.2% arabinose and 1 mM IPTG. Cells were plated on LB-agar containing either streptomycin, kanamycin and chloramphenicol (non-selective for pUC-λ) or ampicillin, streptomycin, kanamycin and chloramphenicol (selective for pUC-λ). After overnight growth the percentage of plasmid loss can be calculated from the ratio of colony forming units on the selective and non-selective plates.
[0146] Phage Lambda Infection Studies
[0147] Host sensitivity to phage infection was tested using a virulent phage Lambda (λ.sub.vir), as in (Brouns et al (2008) Science 321, 960-964.). The sensitivity of the host to infection was calculated as the efficiency of plaquing (the plaque count ratio of a strain containing an anti-λ CRISPR to that of the strain containing a non-targeting R44 CRISPR) as described in Brouns et al (2008).
Example 1—Cascade Exclusively Binds Negatively Supercoiled Target DNA
[0148] The 3 kb pUC19-derived plasmid denoted pUC-λ, contains a 350 bp DNA fragment corresponding to part of the J gene of phage λ, which is targeted by J3-Cascade (Cascade associated with crRNA containing spacer J3 (Westra et al (2010) Molecular Microbiology 77, 1380-1393). The electrophoretic mobility shift assays show that Cascade has high affinity only for negatively supercoiled (nSC) target plasmid. At a molar ratio of J3-Cascade to pUC-λ of 6:1 all nSC plasmid was bound by Cascade, (see
[0149] J3-Cascade was unable to bind relaxed target DNA with measurable affinity, such as nicked (see
[0150] To distinguish non-specific binding from specific binding, the BsmI restriction site located within the protospacer was used. While adding BsmI enzyme to pUC-λ gives a linear product in the presence of R44-Cascade (see
[0151] Cascade binding to nSC pUC-λ was followed by nicking with Nt.BspQI, giving rise to an OC topology. Cascade is released from the plasmid after strand nicking, as can be seen from the absence of a mobility shift (see
Example 2—Cascade Induces Bending of Bound Target DNA
[0152] Complexes formed between purified Cascade and pUC-λ were visualized. Specific complexes containing a single bound J3-Cascade complex were formed, while unspecific R44-Cascade yields no DNA bound complexes in this assay under identical conditions. Out of 81 DNA molecules observed 76% were found to have J3-Cascade bound (see
Example 3—Naturally Occurring Fusions of Cas3 and Cse1: Cas3 Interacts with Cascade Upon Protospacer Recognition
[0153]
Example 4—Bimolecular Fluorescence Complementation (BiFC) Shows how a Cse1 Fusion Protein Forming Part of Cascade Continues to Interact with Cas3
[0154] BiFC experiments were used to monitor interactions between Cas3 and Cascade in vivo before and after phage λ infection. BiFC experiments rely on the capacity of the non-fluorescent halves of a fluorescent protein, e.g., Yellow Fluorescent Protein (YFP) to refold and to form a fluorescent molecule when the two halves occur in close proximity. As such, it provides a tool to reveal protein-protein interactions, since the efficiency of refolding is greatly enhanced if the local concentrations are high, e.g., when the two halves of the fluorescent protein are fused to interaction partners. Cse1 was fused at the C-terminus with the N-terminal 155 amino acids of Venus (Cse1-N155Venus), an improved version of YFP (Nagai et al (2002) Nature Biotechnology 20, 87-90). Cas3 was C-terminally fused to the C-terminal 85 amino acids of Venus (Cas3-C85Venus).
[0155] BiFC analysis reveals that Cascade does not interact with Cas3 in the absence of invading DNA (
[0156] These results also show quite clearly that a fusion of Cse1 with an heterologous protein does not disrupt the ribonucleoprotein formation of Cascade and crRNA, nor does it disrupt the interaction of Cascade and Cas3 with the target phage DNA, even when the Cas3 itself is also a fusion protein.
Example 5—Preparing a Designed Cas3-Cse1 Fusion Gives a Protein with In Vivo Functional Activity
[0157] Providing in vitro evidence for Cas3 DNA cleavage activity required purified and active Cas3. Despite various solubilization strategies, Cas3 overproduced (Howard et al (2011) Biochem. J. 439, 85-95) in E. coli BL21 is mainly present in inactive aggregates and inclusion bodies. Cas3 was therefore produced as a Cas3-Cse1 fusion protein, containing a linker identical to that of the Cas3-Cse1 fusion protein in S. griseus (see
[0158] Since the J3-Cascade-Cas3 fusion-complex was functional in vivo, in vitro DNA cleavage assays were carried out using this complex. When J3-Cascade-Cas3 was incubated with pUC-λ in the absence of divalent metals, plasmid binding was observed at molar ratios similar to those observed for Cascade (
[0159] Interestingly, the J3-Cascade-Cas3 fusion complex displays magnesium dependent endonuclease activity on nSC target plasmids. In the presence of 10 mM Mg.sup.2+ J3-Cascade-Cas3 nicks nSC pUC-λ (
[0160] The inventors have found that Cascade alone is unable to bind protospacers on relaxed DNA. In contrast, the inventors have found that Cascade efficiently locates targets in negatively supercoiled DNA, and subsequently recruits Cas3 via the Cse1 subunit. Endonucleolytic cleavage by the Cas3 HD-nuclease domain causes spontaneous release of Cascade from the DNA through the loss of supercoiling, remobilizing Cascade to locate new targets. The target is then progressively unwound and cleaved by the joint ATP-dependent helicase activity and HD-nuclease activity of Cas3, leading to complete target DNA degradation and neutralization of the invader.
[0161] Referring to
Example 6—Preparation of Artificial Cas-Strep Tag Fusion Proteins and Assembly of Cascade Complexes
[0162] Cascade complexes are produced and purified as described in Brouns et al (2008) Science 321: 960-4 (2008), using the expression plasmids listed in Supplementary Table 3 of Jore et al (2011) Nature Structural & Molecular Biology 18: 529-537. Cascade is routinely purified with an N-terminal Strep-tag II fused to CasB (or CasC in CasCDE). Size exclusion chromatography (Superdex 200 HR 10/30 (GE)) is performed using 20 mM Tris-HCl (pH 8.0), 0.1 M NaCl, 1 mM dithiotreitol. Cascade preparations (˜0.3 mg) are incubated with DNase I (Invitrogen) in the presence of 2.5 mM MgCl.sub.2 for 15 min at 37° C. prior to size exclusion analysis. Co-purified nucleic acids are isolated by extraction using an equal volume of phenol:chloroform:isoamylalcohol (25:24:1) pH 8.0 (Fluka), and incubated with either DNase I (Invitrogen) supplemented with 2.5 mM MgCl.sub.2 or RNase A (Fermentas) for 10 min at 37° C. Cas subunit proteins fused to the amino acid sequence of Strep-Tag are produced.
[0163] Plaque assays showing the biological activity of the Strep-Tag Cascade subunits are performed using bacteriophage Lambda and the efficiency of plaquing (EOP) was calculated as described in Brouns et al (2008).
[0164] For purification of crRNA, samples are analyzed by ion-pair reversed-phased-HPLC on an Agilent 1100 HPLC with UV.sub.260 nm detector (Agilent) using a DNAsep column 50 mm×4.6 mm I. D. (Transgenomic, San Jose, Calif.). The chromatographic analysis is performed using the following buffer conditions: A) 0.1 M triethylammonium acetate (TEAA) (pH 7.0) (Fluka); B) buffer A with 25% LC MS grade acetonitrile (v/v) (Fisher). crRNA is obtained by injecting purified intact Cascade at 75° C. using a linear gradient starting at 15% buffer B and extending to 60% B in 12.5 min, followed by a linear extension to 100% B over 2 min at a flow rate of 1.0 ml/min. Hydrolysis of the cyclic phosphate terminus was performed by incubating the HPLC-purified crRNA in a final concentration of 0.1 M HCl at 4° C. for 1 hour. The samples are concentrated to 5-10 μl on a vacuum concentrator (Eppendorf) prior to ESI-MS analysis.
[0165] Electrospray Ionization Mass spectrometry analysis of crRNA is performed in negative mode using an UHR-TOF mass spectrometer (maXis) or an HCT Ultra PTM Discovery instrument (both Bruker Daltonics), coupled to an online capillary liquid chromatography system (Ultimate 3000, Dionex, UK). RNA separations are performed using a monolithic (PS-DVB) capillary column (200 μm×50 mm I.D., Dionex, UK). The chromatography is performed using the following buffer conditions: C) 0.4 M 1,1,1,3,3,3,-Hexafluoro-2-propanol (HFIP, Sigma-Aldrich) adjusted with triethylamine (TEA) to pH 7.0 and 0.1 mM TEAA, and D) buffer C with 50% methanol (v/v) (Fisher). RNA analysis is performed at 50° C. with 20% buffer D, extending to 40% D in 5 min followed by a linear extension to 60% D over 8 min at a flow rate of 2 μl/min.
[0166] Cascade protein is analyzed by native mass spectrometry in 0.15 M ammonium acetate (pH 8.0) at a protein concentration of 5 μM. The protein preparation is obtained by five sequential concentration and dilution steps at 4° C. using a centrifugal filter with a cut-off of 10 kDa (Millipore). Proteins are sprayed from borosilicate glass capillaries and analyzed on a LCT electrospray time-of-flight or modified quadrupole time-of-flight instruments (both Waters, UK) adjusted for optimal performance in high mass detection (see Tahallah N et al (2001) Rapid Commun Mass Spectrom 15: 596-601 (2001) and van den Heuvel, R. H. et al. Anal Chem 78: 7473-83 (2006). Exact mass measurements of the individual Cas proteins were acquired under denaturing conditions (50% acetonitrile, 50% MQ, 0.1% formic acid). Sub-complexes in solution were generated by the addition of 2-propanol to the spray solution to a final concentration of 5% (v/v). Instrument settings were as follows; needle voltage ˜1.2 kV, cone voltage ˜175 V, source pressure 9 mbar. Xenon was used as the collision gas for tandem mass spectrometric analysis at a pressure of 1.5 10.sup.−2 mbar. The collision voltage varied between 10-200 V.
[0167] Electrophoretic mobility shift assays (EMSA) are used to demonstrate the functional activity of Cascade complexes for target nucleic acids. EMSA is performed by incubating Cascade, CasBCDE or CasCDE with 1 nM labelled nucleic acid in 50 mM Tris-Cl pH 7.5, 100 mM NaCl. Salmon sperm DNA (Invitrogen) is used as competitor. EMSA reactions are incubated at 37° C. for 20-30 min prior to electrophoresis on 5% polyacrylamide gels. The gels are dried and analyzed using phosphor storage screens and a PMI phosphor imager (Bio-Rad). Target DNA binding and cleavage activity of Cascade is tested in the presence of 1-10 mM Ca, Mg or Mn-ions.
[0168] DNA targets are gel-purified long oligonucleotides (Isogen Life Sciences or Biolegio), listed in Supplementary Table 3 of Jore et al (2011). The oligonucleotides are end-labeled using γ.sup.32P-ATP (PerkinElmer) and T4 kinase (Fermentas). Double-stranded DNA targets are prepared by annealing complementary oligonucleotides and digesting remaining ssDNA with Exonuclease I (Fermentas). Labelled RNA targets are in vitro transcribed using T7 Maxiscript or T7 Mega Shortscript kits (Ambion) with α.sup.32P-CTP (PerkinElmer) and removing template by DNase I (Fermentas) digestion. Double stranded RNA targets are prepared by annealing complementary RNAs and digesting surplus ssRNA with RNase T1 (Fermentas), followed by phenol extraction.
[0169] Plasmid mobility shift assays are performed using plasmid pWUR613 containing the R44 protospacer. The fragment containing the protospacer is PCR-amplified from bacteriophage P7 genomic DNA using primers BG3297 and BG 3298 (see Supplementary Table 3 of Jore et al (2011). Plasmid (0.4 μg) and Cascade were mixed in a 1:10 molar ratio in a buffer containing 5 mM Tris-HCl (pH 7.5) and 20 mM NaCl and incubated at 37° C. for 30 minutes. Cascade proteins were then removed by proteinase K treatment (Fluka) (0.15 U, 15 min, 37° C.) followed by phenol/chloroform extraction. RNA-DNA complexes were then treated with RNaseH (Promega) (2 U, 1 h, 37° C.).
[0170] Strep-Tag-Cas protein subunit fusions which form Cascade protein complexes or active sub-complexes with the RNA component (equivalent to a crRNA), have the expected biological and functional activity of scanning and specific attachment and cleavage of nucleic acid targets. Fusions of the Cas subunits with the amino acid chains of fluorescent dyes also form Cascade complexes and sub-complexes with the RNA component (equivalent to crRNA) which retains biological and functional activity and allows visualisation of the location of a target nucleic acid sequence in ds DNA for example.
Example 7—a Cascade-Nuclease Pair and Test of Nuclease Activity In Vitro
[0171] Six mutations designated “Sharkey” have been introduced by random mutagenesis and screening to improve nuclease activity and stability of the non-specific nuclease domain from Flavobacterium okeanokoites restriction enzyme FokI (see Guo, J., et al. (2010) J. Mol. Biol. 400: 96-107). Other mutations have been introduced that reduce off-target cleavage activity. This is achieved by engineering electrostatic interactions at the FokI dimer interface of a ZFN pair, creating one FokI variant with a positively charged interface (KKR, E490K, I538K, H537R) and another with a negatively charged interface (ELD, Q486E, I499L, N496D) (see Doyon, Y., et al. (2011) Nature Methods 8: 74-9). Each of these variants is catalytically inactive as a homodimer, thereby reducing the frequency of off-target cleavage.
[0172] Cascade-Nuclease Design
[0173] We translationally fused improved FokI nucleases to the N-terminus of Cse1 to generate variants of Cse1 being FokI′-Cse1 and FokI.sup.ELD-Cse1, respectively. These two variants are co-expressed with Cascade subunits (Cse2, Cas7, Cas5 and Cas6e), and one of two distinct CRISPR plasmids with uniform spacers. This loads the Cascade.sup.KKR complex with uniform P7-crRNA, and the Cascade.sup.ELD complex with uniform M13 g8-crRNA. These complexes are purified using the N-terminally StrepII-tagged Cse2 as described in Jore, M. M., et al., (2011) Nat. Struct. Mol. Biol. 18(5): 529-536. Furthermore an additional purification step can be carried out using an N-terminally HIS-tagged FokI, to ensure purifying full length and intact Cascade-nuclease fusion complexes.
[0174] The nucleotide and amino acid sequences of the fusion proteins used in this example were as follows:
TABLE-US-00006 >nucleotide sequence of FokI-(Sharkey-ELD)-Cse1 [SEQ ID NO: 18] ATGGCTCAACTGGTTAAAAGCGAACTGGAAGAGAAAAAAAGTGAACTGCGCCACAAACTGAAA TATGTGCCGCATGAATATATCGAGCTGATTGAAATTGCACGTAATCCGACCCAGGATCGTATT CTGGAAATGAAAGTGATGGAATTTTTTATGAAAGTGTACGGCTATCGCGGTGAACATCTGGGT GGTAGCCGTAAACCGGATGGTGCAATTTATACCGTTGGTAGCCCGATTGATTATGGTGTTATT GTTGATACCAAAGCCTATAGCGGTGGTTATAATCTGCCGATTGGTCAGGCAGATGAAATGGAA CGTTATGTGGAAGAAAATCAGACCCGTGATAAACATCTGAATCCGAATGAATGGTGGAAAGTT TATCCGAGCAGCGTTACCGAGTTTAAATTCCTGTTTGTTAGCGGTCACTTCAAAGGCAACTAT AAAGCACAGCTGACCCGTCTGAATCATATTACCAATTGTAATGGTGCAGTTCTGAGCGTTGAA GAACTGCTGATTGGTGGTGAAATGATTAAAGCAGGCACCCTGACCCTGGAAGAAGTTCGTCGC AAATTTAACAATGGCGAAATCAACTTTGCGGATCCCACCAACCGCGCGAAAGGCCTGGAAGCG GTGAGCGTGGCGAGCatgaatttgcttattgataactggattcctgtacgcccgcgaaacggg gggaaagtccaaatcataaatctgcaatcgctatactgcagtagagatcagtggcgattaagt ttgccccgtgacgatatggaactggccgctttagcactgctggtttgcattgggcaaattatc gccccggcaaaagatgacgttgaatttcgacatcgcataatgaatccgctcactgaagatgag tttcaacaactcatcgcgccgtggatagatatgttctaccttaatcacgcagaacatccatta tgcagaccaaaggtgtcaaagcaaatgatgtgactccaatggaaaaactgttggctggggtaa gcggcgcgacgaattgtgcatttgtcaatcaaccggggcagggtgaagcattatgtggtggat gcactgcgattgcgttattcaaccaggcgaatcaggcaccaggtfttggtggtggtfttaaaa gcggtttacgtggaggaacacctgtaacaacgttcgtacgtgggatcgatcttcgttcaacgg tgttactcaatgtcctcacattacctcgtcttcaaaaacaatttcctaatgaatcacatacgg aaaaccaacctacctggattaaacctatcaagtccaatgagtctatacctgatcgtcaattgg gtftgtccgtggtctattctggcaaccagcgcatattgaattatgcgatcccattgggattgg taaatgttatgctgtggacaggaaagcaatttgcgttataccggttftcttaaggaaaaattt acctttacagttaatgggctatggccccatccgcattcccatgtctggtaacagtcaagaaag gggaggttgaggaaaaatttcttgctttcaccacctccgcaccatcatggacacaaatcagcc gagttgtggtagataagattattcaaaatgaaaatggaaatcgcgtggcggcggttgtgaatc aattcagaaatattgcgccgcaaagtcctcttgaattgattatggggggatatcgtaataatc aagcatctattcttgaacggcgtcatgatgtgttgatgtttaatcaggggtggcaacaatacg gcaatgtgataaacgaaatagtgactgttggtttgggatataaaacagccttacgcaaggcgt tatatacctttgcagaagggfttaaaaataaagacttcaaaggggccggagtctctgttcatg agactgcagaaaggcatttctatcgacagagtgaattattaattcccgatgtactggcgaatg ttaatttttcccaggctgatgaggtaatagctgatttacgagacaaacttcatcaattgtgtg aaatgctatttaatcaatctgtagctccctatgcacatcatcctaaattaataagcacattag cgcttgcccgcgccacgctatacaaacatttacgggagttaaaaccgcaaggagggccatcaa atggctga >protein sequence of FokI-(Sharkey-ELD)-Cse1 [SEQ ID NO: 19] MAQLVKSELEEKKSELRHKLKYVPHEYIELIEIARNPTQDRILEMKVMEFFMKVYGYRGEHLG GSRKPDGAIYTVGSPIDYGVIVDTKAYSGGYNLPIGQADEMERYVEENQTRDKHLNPNEWWKV YPSSVTEFKFLFVSGHFKGNYKAQLTRLNHITNCNGAVLSVEELLIGGEMIKAGTLTLEEVRR KFNNGEINFADPTNRAKGLEAVSVASMNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLS LPRDDMELAALALLVCIGQIIAPAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPF MQTKGVKANDVTPMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFK SGLRGGTPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSI GFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVK KGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGYRN NQASILERRHDVL1VIENQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGV SVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLI STLALARATLYKHLRELKPQGGPSNG* >nucleotide sequence of FokI-(Sharkey-KKR)-Cse1 [SEQ ID NO: 20] ATGGCTCAACTGGTTAAAAGCGAACTGGAAGAGAAAAAAAGTGAACTGCGCCACAAACTGAAA TATGTGCCGCATGAATATATCGAGCTGATTGAAATTGCACGTAATCCGACCCAGGATCGTATT CTGGAAATGAAAGTGATGGAATTTTTTATGAAAGTGTACGGCTATCGCGGTGAACATCTGGGT GGTAGCCGTAAACCGGATGGTGCAATTTATACCGTTGGTAGCCCGATTGATTATGGTGTTATT GTTGATACCAAAGCCTATAGCGGTGGTTATAATCTGCCGATTGGTCAGGCAGATGAAATGCAG CGTTATGTGAAAGAAAATCAGACCCGCAACAAACATATTAACCCGAATGAATGGTGGAAAGTT TATCCGAGCAGCGTTACCGAGTTTAAATTCCTGTTTGTTAGCGGTCACTTCAAAGGCAACTAT AAAGCACAGCTGACCCGTCTGAATCGTAAAACCAATTGTAATGGTGCAGTTCTGAGCGTTGAA GAACTGCTGATTGGTGGTGAAATGATTAAAGCAGGCACCCTGACCCTGGAAGAAGTTCGTCGC AAATTTAACAATGGCGAAATCAACTTTGCGGATCCCACCAACCGCGCGAAAGGCCTGGAAGCG GTGAGCGTGGCGAGCatgaatttgcttattgataactggattcctgtacgcccgcgaaacggg gggaaagtccaaatcataaatctgcaatcgctatactgcagtagagatcagtggcgattaagt ttgccccgtgacgatatggaactggccgctttagcactgctggtttgcattgggcaaattatc gccccggcaaaagatgacgttgaatttcgacatcgcataatgaatccgctcactgaagatgag tttcaacaactcatcgcgccgtggatagatatgttctaccttaatcacgcagaacatccatta tgcagaccaaaggtgtcaaagcaaatgatgtgactccaatggaaaaactgttggctggggtaa gcggcgcgacgaattgtgcatttgtcaatcaaccggggcagggtgaagcattatgtggtggat gcactgcgattgcgttattcaaccaggcgaatcaggcaccaggttftggtggtggttttaaaa gcggtttacgtggaggaacacctgtaacaacgttcgtacgtgggatcgatcttcgttcaacgg tgttactcaatgtcctcacattacctcgtcttcaaaaacaatttcctaatgaatcacatacgg aaaaccaacctacctggattaaacctatcaagtccaatgagtctatacctgcttcgtcaattg ggtttgtccgtggtctattctggcaaccagcgcatattgaattatgcgatcccattgggattg gtaaatgttcttgctgtggacaggaaagcaatttgcgttataccggttttcttaaggaaaaat ttacctttacagttaatgggctatggccccatccgcattccccttgtctggtaacagtcaaga aaggggaggttgaggaaaaatttcttgctttcaccacctccgcaccatcatggacacaaatca gccgagttgtggtagataagattattcaaaatgaaaatggaaatcgcgtggcggcggttgtga atcaattcagaaatattgcgccgcaaagtcctcttgaattgattatggggggatatcgtaata atcaagcatctattcttgaacggcgtcatgatgtgttgatgtttaatcaggggtggcaacaat acggcaatgtgataaacgaaatagtgactgttggtttgggatataaaacagccttacgcaagg cgttatatacctttgcagaagggtttaaaaataaagacttcaaaggggccggagtctctgttc atgagactgcagaaaggcatttctatcgacagagtgaattattaattcccgatgtactggcga atgttaatttttcccaggctgatgaggtaatagctgatttacgagacaaacttcatcaattgt gtgaaatgctatttaatcaatctgtagctccctatgcacatcatcctaaattaataagcacat tagcgcttgcccgcgccacgctatacaaacatttacgggagttaaaaccgcaaggagggccat caaatggctga >protein sequence of FokI-(Sharkey-KKR)-Cse1 [SEQ ID NO: 21] MAQLVKSELEEKKSELRHKLKYVPHEYIELIEIARNPTQDRILEMKVMEFFMKVYGYRGEHLG GSRKPDGAIYTVGSPIDYGVIVDTKAYSGGYNLPIGQADEMQRYVKENQTRNKHINPNEWWKV YPSSVTEFKFLFVSGHFKGNYKAQLTRLNRKTNCNGAVLSVEELLIGGEMIKAGTLTLEEVRR KFNNGEINFADPTNRAKGLEAVSVASMNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLS LPRDDMELAALALLVCIGQIIAPAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPF MQTKGVKANDVTPMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFK SGLRGGTPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSI GFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVK KGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLELEVIGGYR NNQASILERRHDVL1VIENQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAG VSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKL ISTLALARATLYKHLRELKPQGGPSNG* >nuclotide sequence of His.sub.6-Dual-monopartite NLS SV40-FokI-(Sharkey-KKR)-Cse1 (“His.sub.6” disclosed as SEQ ID NO: 48) [SEQ ID NO: 22] ATGcatcaccatcatcaccacCCGAAAAAAAAGCGCAAAGTGGATCCGAAGAAAAAACGTAAA GTTGAAGATCCGAAAGACATGGCTCAACTGGTTAAAAGCGAACTGGAAGAGAAAAAAAGTGAA CTGCGCCACAAACTGAAATATGTGCCGCATGAATATATCGAGCTGATTGAAATTGCACGTAAT CCGACCCAGGATCGTATTCTGGAAATGAAAGTGATGGAATTTTTTATGAAAGTGTACGGCTAT CGCGGTGAACATCTGGGTGGTAGCCGTAAACCGGATGGTGCAATTTATACCGTTGGTAGCCCG ATTGATTATGGTGTTATTGTTGATACCAAAGCCTATAGCGGTGGTTATAATCTGCCGATTGGT CAGGCAGATGAAATGCAGCGTTATGTGAAAGAAAATCAGACCCGCAACAAACATATTAACCCG AATGAATGGTGGAAAGTTTATCCGAGCAGCGTTACCGAGTTTAAATTCCTGTTTGTTAGCGGT CACTTCAAAGGCAACTATAAAGCACAGCTGACCCGTCTGAATCGTAAAACCAATTGTAATGGT GCAGTTCTGAGCGTTGAAGAACTGCTGATTGGTGGTGAAATGATTAAAGCAGGCACCCTGACC CTGGAAGAAGTTCGTCGCAAATTTAACAATGGCGAAATCAACTTTGCGGATCCCACCAACCGC GCGAAAGGCCTGGAAGCGGTGAGCGTGGCGAGCatgaatttgcttattgataactggattcct gtacgcccgcgaaacggggggaaagtccaaatcataaatctgcaatcgctatactgcagtaga gatcagtggcgattaagtttgccccgtgacgatatggaactggccgctttagcactgctggtt tgcattgggcaaattatcgccccggcaaaagatgacgttgaatttcgacatcgcataatgaat ccgctcactgaagatgagtttcaacaactcatcgcgccgtggatagatatgttctaccttaat cacgcagaacatccctttatgcagaccaaaggtgtcaaagcaaatgatgtgactccaatggaa aaactgttggctggggtaagcggcgcgacgaattgtgcatttgtcaatcaaccggggcagggt gaagcattatgtggtggatgcactgcgattgcgttattcaaccaggcgaatcaggcaccaggt tttggtggtggttttaaaagcggtttacgtggaggaacacctgtaacaacgttcgtacgtggg atcgatcttcgttcaacggtgttactcaatgtcctcacattacctcgtcttcaaaaacaattt cctaatgaatcacatacggaaaaccaacctacctggattaaacctatcaagtccaatgagtct atacctgcttcgtcaattgggtttgtccgtggtctattctggcaaccagcgcatattgaatta tgcgatcccattgggattggtaaatgttcttgctgtggacaggaaagcaatttgcgttatacc ggttftcttaaggaaaaatttacctttacagttaatgggctatggccccatccgcattcccct tgtctggtaacagtcaagaaaggggaggttgaggaaaaatttcttgctttcaccacctccgca ccatcatggacacaaatcagccgagttgtggtagataagattattcaaaatgaaaatggaaat cgcgtggcggcggttgtgaatcaattcagaaatattgcgccgcaaagtcctcttgaattgatt atggggggatatcgtaataatcaagcatctattatgaacggcgtcatgatgtgttgatgttta atcaggggtggcaacaatacggcaatgtgataaacgaaatagtgactgttggtttgggatata aaacagccttacgcaaggcgttatatacctttgcagaagggtttaaaaataaagacttcaaag gggccggagtctctgttcatgagactgcagaaaggcatttctatcgacagagtgaattattaa ttcccgatgtactggcgaatgttaatttttcccaggctgatgaggtaatagctgatttacgag acaaacttcatcaattgtgtgaaatgctatttaatcaatctgtagctccctatgcacatcatc ctaaattaataagcacattagcgcttgcccgcgccacgctatacaaacatttacgggagttaa aaccgcaaggagggccatcaaatggctga >protein sequence of His.sub.6-Dual-monopartite NLS SV40-FokI-(Sharkey-KKR)-Cse1 (“His6” disclosed as SEQ ID NO: 48) [SEQ ID NO: 23] MHHHHHHPKKKRKVDPKKKRKVEDPKDMAQLVKSELEEKKSELRHKLKYVPHEYIELIEIARN PTQDRILEMKVMEFFMKVYGYRGEHLGGSRKPDGAIYTVGSPIDYGVIVDTKAYSGGYNLPIG QADEMQRYVKENQTRNKHINPNEWWKVYPSSVTEFKFLFVSGHFKGNYKAQLTRLNRKTNCNG AVLSVEELLIGGEMIKAGTLTLEEVRRKFNNGEINFADPTNRAKGLEAVSVASMNLLIDNWIP VRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAPAKDDVEFRHRIMN PLTEDEFQQLIAPWID1VIFYLNHAEHPFMQTKGVKANDVTPMEKLLAGVSGATNCAFVNQPG QGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFVRGIDLRSTVLLNVLTLPRLQK QFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLR YTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNEN GNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINEIVTVGL GYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIAD LRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKPQGGPSNG* >nucleotide sequence of His.sub.6-Dual-monopartite NLS SV40-FokI (Sharkey-ELD)-Cse1 (“His.sub.6” disclosed as SEQ ID NO: 48) [SEQ ID NO: 24] ATGcatcaccatcatcaccacCCGAAAAAAAAGCGCAAAGTGGATCCGAAGAAAAAACGTAAA GTTGAAGATCCGAAAGACATGGCTCAACTGGTTAAAAGCGAACTGGAAGAGAAAAAAAGTGAA CTGCGCCACAAACTGAAATATGTGCCGCATGAATATATCGAGCTGATTGAAATTGCACGTAAT CCGACCCAGGATCGTATTCTGGAAATGAAAGTGATGGAATTTTTTATGAAAGTGTACGGCTAT CGCGGTGAACATCTGGGTGGTAGCCGTAAACCGGATGGTGCAATTTATACCGTTGGTAGCCCG ATTGATTATGGTGTTATTGTTGATACCAAAGCCTATAGCGGTGGTTATAATCTGCCGATTGGT CAGGCAGATGAAATGGAACGTTATGTGGAAGAAAATCAGACCCGTGATAAACATCTGAATCCG AATGAATGGTGGAAAGTTTATCCGAGCAGCGTTACCGAGTTTAAATTCCTGTTTGTTAGCGGT CACTTCAAAGGCAACTATAAAGCACAGCTGACCCGTCTGAATCATATTACCAATTGTAATGGT GCAGTTCTGAGCGTTGAAGAACTGCTGATTGGTGGTGAAATGATTAAAGCAGGCACCCTGACC CTGGAAGAAGTTCGTCGCAAATTTAACAATGGCGAAATCAACTTTGCGGATCCCACCAACCGC GCGAAAGGCCTGGAAGCGGTGAGCGTGGCGAGCatgaatttgcttattgataactggattcct gtacgcccgcgaaacggggggaaagtccaaatcataaatctgcaatcgctatactgcagtaga gatcagtggcgattaagtttgccccgtgacgatatggaactggccgctttagcactgctggtt tgcattgggcaaattatcgccccggcaaaagatgacgttgaatttcgacatcgcataatgaat ccgctcactgaagatgagtttcaacaactcatcgcgccgtggatagatatgttctaccttaat cacgcagaacatccctttatgcagaccaaaggtgtcaaagcaaatgatgtgactccaatggaa aaactgttggctggggtaagcggcgcgacgaattgtgcatttgtcaatcaaccggggcagggt gaagcattatgtggtggatgcactgcgattgcgttattcaaccaggcgaatcaggcaccaggt tttggtggtggttttaaaagcggtttacgtggaggaacacctgtaacaacgttcgtacgtggg atcgatcttcgttcaacggtgttactcaatgtcctcacattacctcgtcttcaaaaacaattt cctaatgaatcacatacggaaaaccaacctacctggattaaacctatcaagtccaatgagtct atacctgcttcgtcaattgggtttgtccgtggtctattctggcaaccagcgcatattgaatta tgcgatcccattgggattggtaaatgttcttgctgtggacaggaaagcaatttgcgttatacc ggttttcttaaggaaaaatttacctttacagttaatgggctatggccccatccgcattcccct tgtctggtaacagtcaagaaaggggaggttgaggaaaaatttcttgctttcaccacctccgca ccatcatggacacaaatcagccgagttgtggtagataagattattcaaaatgaaaatggaaat cgcgtggcggcggttgtgaatcaattcagaaatattgcgccgcaaagtcctcttgaattgatt atggggggatatcgtaataatcaagcatctattcttgaacggcgtcatgatgtgttgatgttt aatcaggggtggcaacaatacggcaatgtgataaacgaaatagtgactgttggtttgggatat aaaacagccttacgcaaggcgttatatacctttgcagaagggtttaaaaataaagacttcaaa ggggccggagtctctgttcatgagactgcagaaaggcatttctatcgacagagtgaattatta attcccgatgtactggcgaatgttaatttttcccaggctgatgaggtaatagctgatttacga gacaaacttcatcaattgtgtgaaatgctatttaatcaatctgtagctccctatgcacatcat cctaaattaataagcacattagcgcttgcccgcgccacgctatacaaacatttacgggagtta aaaccgcaaggagggccatcaaatggctga >protein sequence of His6-Dual-monopartite NLS SV40-FokI(Sharkey-ELD)-Cse1 (“His.sub.6” disclosed as SEQ ID NO: 48) [SEQ ID NO: 25] MHHEIHHHPKKKRKVDPKKKRKVEDPKDMAQLVKSELEEKKSELRHKLKYVPHEYIELIEIAR NPTQDRILEMKVMEFFMKVYGYRGEHLGGSRKPDGAIYTVGSPIDYGVIVDTKAYSGGYNLPI GQADEMERYVEENQTRDKHLNPNEWWKVYPSSVTEFKFLFVSGHFKGNYKAQLTRLNHITNCN GAVLSVEELLIGGEMIKAGTLTLEEVRRKFNNGEINFADPTNRAKGLEAVSVASMNLLIDNWI PVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAPAKDDVEFRHRIM NPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAGVSGATNCAFVNQPGQ GEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFVRGIDLRSTVLLNVLTLPRLQKQ FPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRY TGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENG NRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLG YKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADL RDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKPQGGPSNG*
[0175] DNA Cleavage Assay
[0176] The specificity and activity of the complexes was tested using an artificially constructed target plasmid as a substrate. This plasmid contains M13 and P7 binding sites on opposing strands such that both FokI domains face each other (see
[0177] The sequences of the target plasmids used are as follows. The number indicated the distance between the M13 and P7 target sites. Protospacers are shown in bold, PAMs underlined:
[0178] Sequences of the target plasmids. The number indicates the distance between the M13 and P7 target sites. (protospacers in bold, PAMs underlined)
TABLE-US-00007 >50 bp [SEQ ID NO: 26] gaattcACAACGGTGAGCAAGTCACTGTTGGCAAGCCAGGATCTGAACAA TACCGTCTTGCTTTCGAGCGCTAGCTCTAGAACTAGTCCTCAGCCTAGGC CTCGTTCCGAAGCTGTCTTTCGCTGCTGAGGGTGACGATCCCGCATAGGC GGCCTTTAACTCggatcc >45 bp [SEQ ID NO: 27] gaattcACAACGGTGAGCAAGTCACTGTTGGCAAGCCAGGATCTGAACAA TACCGTCTTTTCGAGCGCTAGCTCTAGAACTAGTCCTCAGCCTAGGCCTC GTTCAAGCTGTCTTTCGCTGCTGAGGGTGACGATCCCGCATAGGCGGCCT TTAACTCggatcc >40 bp [SEQ ID NO: 28] gaattcACAACGGTGAGCAAGTCACTGTTGGCAAGCCAGGATCTGAACAA TACCGTCTTCGAGCGCTAGCTCTAGAACTAGTCCTCAGCCTAGGCCTCGA AGCTGTCTTTCGCTGCTGAGGGTGACGATCCCGCATAGGCGGCCTTTAAC TCggatcc >35 bp [SEQ ID NO: 29] gaattcACAACGGTGAGCAAGTCACTGTTGGCAAGCCAGGATCTGAACAA TACCGTCTTGCGCTAGCTCTAGAACTAGTCCTCAGCCTAGGCCTAAGCTG TCTTTCGCTGCTGAGGGTGACGATCCCGCATAGGCGGCCTTTAACTCgga tcc >30 bp [SEQ ID NO: 30] gaattcACAACGGTGAGCAAGTCACTGTTGGCAAGCCAGGATCTGAACAA TACCGTCTTGCTAGCTCTAGAACTAGTCCTCAGCCTAGGAAGCTGTCTTT CGCTGCTGAGGGTGACGATCCCGCATAGGCGGCCTTTAACTCggatcc >25 bp [SEQ ID NO: 31] gaattcACAACGGTGAGCAAGTCACTGTTGGCAAGCCAGGATCTGAACAA TACCGTCTTCTCTAGAACTAGTCCTCAGCCTAGGAAGCTGTCTTTCGCTG CTGAGGGTGACGATCCCGCATAGGCGGCCTTTAACTCggatcc
[0179] Cleavage of the target plasmids was analysed on agarose gels, where negatively supercoiled (nSC) plasmid can be distinguished from linearized- or nicked plasmid. The cleavage site of the Cascade.sup.KKR/ELD pair in a target vector was determined by isolating linear cleavage products from an agarose gel and filling in the recessed 3′ ends left by FokI cleavage with the Klenow fragment of E. coli DNA polymerase to create blunt ends. The linear vector was self-ligated, transformed, amplified, isolated and sequenced. Filling in of recessed 3′ ends and re-ligation will lead to extra nucleotides in the sequence that represents the overhang left by FokI cleavage. By aligning the sequence reads to the original sequence, the cleavage sites can be found on a clonal level and mapped. Below, the additional bases incorporated into the sequence after filling in recessed 3′ ends left by FokI cleavage are underlined:
TABLE-US-00008 FokI cleavage 5′ CTTGCGCTAGCTCTAGAA CTAGTCCTCAGCCTAGGCCTAAG 3′ 3′ GAACGCGATCGAGATCTTGATC AGGAGTCGGATCCGGATTC 5′ 3′ fill in, ligation 5′ CTTGCGCTAGCTCTAGAACTAG - CTAGTCCTCAGCCTAGGCCTAAG 3′ 3′ GAACGCGATCGAGATCTTGATC - GATCAGGAGTCGGATCCGGATTC 5′
[0180] Reading from top to bottom, the 5′-3′ sequences above are SEQ ID NOs: 32-35, respectively.
[0181] Cleavage of a Target Locus in Human Cells
[0182] The human CCR5 gene encodes the C-C chemokine receptor type 5 protein, which serves as the receptor for the human immunodeficiency virus (HIV) on the surface of white blood cells. The CCR5 gene is targeted using a pair of Cascade.sup.KKR/ELD nucleases in addition to an artificial GFP locus. A suitable binding site pair is selected on the coding region of CCR5. Two separate CRISPR arrays containing uniform spacers targeting each of the binding sites are constructed using DNA synthesis (Geneart).
[0183] The human CCR5 target gene selection and CRISPR designs used are as follows:
TABLE-US-00009 >Part of genomic human CCR5 sequence, containing whole ORF (position 347-1446). [SEQ ID NO: 36] GGTGGAACAAGATGGATTATCAAGTGTCAAGTCCAATCTATGACATCAAT TATTATACATCGGAGCCCTGCCAAAAAATCAATGTGAAGCAAATCGCAGC CCGCCTCCTGCCTCCGCTCTACTCACTGGTGTTCATCTTTGGTTTTGTGG GCAACATGCTGGTCATCCTCATCCTGATAAACTGCAAAAGGCTGAAGAGC ATGACTGACATCTACCTGCTCAACCTGGCCATCTCTGACCTGTTTTTCCT TCTTACTGTCCCCTTCTGGGCTCACTATGCTGCCGCCCAGTGGGACTTTG GAAATACAATGTGTCAACTCTTGACAGGGCTCTATTTTATAGGCTTCTTC TCTGGAATCTTCTTCATCATCCTCCTGACAATCGATAGGTACCTGGCTGT CGTCCATGCTGTGTTTGCTTTAAAAGCCAGGACGGTCACCTTTGGGGTGG TGACAAGTGTGATCACTTGGGTGGTGGCTGTGTTTGCGTCTCTCCCAGGA ATCATCTTTACCAGATCTCAAAAAGAAGGTCTTCATTACACCTGCAGCTC TCATTTTCCATACAGTCAGTATCAATTCTGGAAGAATTTCCAGACATTAA AGATAGTCATCTTGGGGCTGGTCCTGCCGCTGCTTGTCATGGTCATCTGC TACTCGGGAATCCTAAAAACTCTGCTTCGGTGTCGAAATGAGAAGAAGAG GCACAGGGCTGTGAGGCTTATCTTCACCATCATGATTGTTTATTTTCTCT TCTGGGCTCCCTACAACATTGTCCTTCTCCTGAACACCTTCCAGGAATTC TTTGGCCTGAATAATTGCAGTAGCTCTAACAGGTTGGACCAAGCTATGCA GGTGACAGAGACTCTTGGGATGACGCACTGCTGCATCAACCCCATCATCT ATGCCTTTGTCGGGGAGAAGTTCAGAAACTACCTCTTAGTCTTCTTCCAA AAGCACATTGCCAAACGCTTCTGCAAATGCTGTTCTATTTTCCAGCAAGA GGCTCCCGAGCGAGCAAGCTCAGTTTACACCCGATCCACTGGGGAGCAGG AAATATCTGTGGGCTTGTGACACGGACTCAAGTGGGCTGGTGACCCAGTC
[0184] Red1/2: chosen target sites (distance: 34 bp, PAM 5′-CTT-3′). “Red 1 is first appearing underlined sequence in the above. Red2 is the second underlined sequence.
TABLE-US-00010 >CRISPR array red1 (italics = spacers, bold = repeats) [SEQ ID NO: 37] ccatggTAATACGACTCACTATAGGGAGAATTAGCTGATCTTTAATAATA AGGAAATGTTACATTAAGGTTGGTGGGTTGTTTTTATGGGAAAAAATGCT TTAAGAACAAATGTATACTTTTAGAGAGTTCCCCGCGCCAGCGGGGATAA ACCGCAAACACAGCATGGACGACAGCCAGGTACCTAGAGTTCCCCGCGCC AGCGGGGATAAACCGCAAACACAGCATGGACGACAGCCAGGTACCTAGAG TTCCCCGCGCCAGCGGGGATAAACCGCAAACACAGCATGGACGACAGCCA GGTACCTAGAGTTCCCCGCGCCAGCGGGGATAAACCGAAAACAAAAGGCT CAGTCGGAAGACTGGGCCTTTTGTTTTAACCCCTTGGGGCCTCTAAACGG GTCTTGAGGGGTTTTTTGggtacc >CRISPR array red2 (italics: spacers, bold: repeats) [SEQ ID NO: 38] ccatggTAATACGACTCACTATAGGGAGAATTAGCTGATCTTTAATAATA AGGAAATGTTACATTAAGGTTGGTGGGTTGTTTTTATGGGAAAAAATGCT TTAAGAACAAATGTATACTTTTAGAGAGTTCCCCGCGCCAGCGGGGATAA ACCGTGTGATCACTTGGGTGGTGGCTGTGTTTGCGTGAGTTCCCCGCGCC AGCGGGGATAAACCGTGTGATCACTTGGGTGGTGGCTGTGTTTGCGTGAG TTCCCCGCGCCAGCGGGGATAAACCGTGTGATCACTTGGGTGGTGGCTGT GTTTGCGTGAGTTCCCCGCGCCAGCGGGGATAAACCGAAAACAAAAGGCT CAGTCGGAAGACTGGGCCTTTTGTTTTAACCCCTTGGGGCCTCTAAACGG GTCTTGAGGGGTTTTTTGggtacc
[0185] Delivery of Cascade.sup.KKR/ELD into the Nucleus of Human Cells
[0186] Cascade is very stable as a multi-subunit protein-RNA complex and is easily produced in mg quantities in E. coli. Transfection or micro-injection of the complex in its intact form as purified from E. coli is used as methods of delivery (see
[0187] To deliver Cascade into human cells, protein transfection reagents are used from various sources including Pierce, NEB, Fermentas and Clontech. These reagents have recently been developed for the delivery of antibodies, and are useful to transfect a broad range of human cell lines with efficiencies up to 90%. Human HepG2 cells are transfected. Also, other cell lines including CHO-K1, COS-7, HeLa, and non-embryonic stem cells, are transfected.
[0188] To import the Cascade.sup.KKR/ELD nuclease pair into the nucleus, a tandem monopartite nuclear localisation signal (NLS) from the large T-antigen of simian virus 40 (SV40) is fused to the N-terminus of FokI. This ensures import of only intact Cascade.sup.ELD/KKR into the nucleus. (The nuclear pore complex translocates RNA polymerases (550 kDa) and other large protein complexes). As a check prior to transformations, the nuclease activity of the Cascade.sup.KKR/ELD nuclease pair is checked in vitro using purified complexes and CCR5 PCR amplicons to exclude transfecting non-productive Cascade.sup.KKR/ELD nuclease pairs.
[0189] Surveyor Assay
[0190] Transfected cells are cultivated and passaged for several days. The efficiency of in vivo target DNA cleavage is then assessed by using the Surveyor assay of Guschin, D. Y., et al (2010) Methods Mol. Biol., 649: 247-256. Briefly, PCR amplicons of the target DNA locus will be mixed 1:1 with PCR amplicons from untreated cells. These are heated and allowed to anneal, giving rise to mismatches at target sites that have been erroneously repaired by NHEJ. A mismatch nuclease is then used to cleave only mismatched DNA molecules, giving a maximum of 50% of cleavage when target DNA cleavage by Cascade.sup.KKR/ELD is complete. This procedure was then followed up by sequencing of the target DNA amplicons of treated cells. The assay allows for rapid assessment and optimization of the delivery procedure.
[0191] Production of Cascade-Nuclease Pairs
[0192] The Cascade-nuclease complexes were constructed as explained above. Affinity purification from E. coli using the StrepII-tagged Cse2 subunit yields a complex with the expected stoichiometry when compared to native Cascade. Referring to
[0193] Apart from an intact FokI-Cse1 fusion protein, we observed that a fraction of the FokI-Cse1-fusion protein is proteolytically cleaved, resulting in a Cse1 protein with only the linker and a small part of FokI attached to it (as confirmed by Mass Spectrometry, data not shown). In most protein isolations the fraction of degraded fusion protein is approximately 40%. The isolated protein is stably stored in the elution buffer (20 mM HEPES pH 7.5, 75 mM NaCl, 1 mM DTT, 4 mM desthiobiotin) with additional 0.1% Tween 20 and 50% glycerol at −20° C. Under these storage conditions, integrity and activity of the complex have been found stable for at least three weeks (data not shown).
[0194] Introduction of a His.sub.6-Tag (SEQ ID NO: 48) and NLS to the Cascade-Nuclease
[0195] The Cascade nuclease fusion design was modified to incorporate a Nucleolar Localization Signal (NLS) to enable transport into the nucleus of eukaryotic cells. For this a tandem monopartite NLS from the large T-antigen of Simian Virus SV40 (sequence: PKKKRKVDPKKKRKV) (SEQ ID NO: 49) was translationally fused to the N-terminus of the FokI-Cse1 fusion protein, directly preceded by a His.sub.6-tag at the N-terminus. The His.sub.6-tag (sequence: MHHHHHH) (“His.sub.6” disclosed as SEQ ID NO: 48 and “MHHHHHH” disclosed as SEQ ID NO: 50) allows for an additional Ni.sup.2+-resin affinity purification step after StrepII purification. This additional step ensures the isolation of only full-length Cascade-nuclease fusion complex, and increases the efficiency of cleavage by eliminating the binding of non-intact Cascade complexes to the target site forming an unproductive nuclease pair.
[0196] In Vitro Cleavage Assay
[0197] Cascade.sup.KKR/ELD activity and specificity was assayed in vitro as described above.
[0198] Cleavage Conditions
[0199] To assess the optimal buffer conditions for cleavage assays, and to estimate whether activity of the complex is expected at physiological conditions, the following two buffers were selected: (1) NEB4 (New England Biolabs, 50 mM potassium acetate, 20 mM Tris-acetate, 10 mM magnesium acetate, 1 mM dithiothreitol, pH 7.9) and (2) Buffer 0 (Fermentas, 50 mM Tris-HCl, 10 mM MgCl.sub.2, 100 mM NaCl, 0.1 mg/mL BSA, pH 7.5). Of the two, NEB4 is recommended for optimal activity of the commercial intact FokI enzyme. Buffer 0 was chosen from a quick screen to give good activity and specificity (data not shown).
[0200] Cleavage Site
[0201] The site of cleavage in the target plasmid with a spacing of 35 bp (pTarget35) was determined.
[0202]
[0203] Cleaving a target locus in human cells.
[0204] Cascade.sup.KKR/ELD nucleases were successfully modified to contain an N-terminal His.sub.6-tag (SEQ ID NO: 48) followed by a dual mono-partite Nucleolar Localisation Signal. These modified Cascade nuclease fusion proteins were co-expressed with either one of two synthetically constructed CRISPR arrays, each targeting a binding site in the human CCR5 gene. First the activity of this new nuclease pair is validated in vitro by testing the activity on a plasmid containing this region of the CCR5 gene. The nuclease pair is transfected to a human cell line, e.g. HeLa cell line. Efficiency of target cleavage is assessed using the Surveyor assay as described above.