VERSATILE METHOD FOR THE DETECTION OF MARKER-FREE PRECISION GENOME EDITING AND GENETIC VARIATION
20210283567 · 2021-09-16
Inventors
Cpc classification
B01J2219/00378
PERFORMING OPERATIONS; TRANSPORTING
B01J2219/00369
PERFORMING OPERATIONS; TRANSPORTING
C12Q2521/313
CHEMISTRY; METALLURGY
C12Q2525/185
CHEMISTRY; METALLURGY
C12Q2521/313
CHEMISTRY; METALLURGY
B01J19/0046
PERFORMING OPERATIONS; TRANSPORTING
C12Q2525/185
CHEMISTRY; METALLURGY
C40B40/06
CHEMISTRY; METALLURGY
International classification
B01J19/00
PERFORMING OPERATIONS; TRANSPORTING
C40B40/06
CHEMISTRY; METALLURGY
Abstract
The present disclosure provides, inter alia, specially designed DNA adaptors and methods of preparing the same. Methods and kits for carrying out and detecting marker-free precision genome editing and genetic variation using such adaptors are also provided.
Claims
1. A method for detecting a genetic modification in a DNA sequence of interest, comprising the steps of: (a) amplifying the DNA sequence of interest using a specially designed Type IIS restriction enzyme-tagging primer, comprising: (i) obtaining the DNA sequence of interest from a biological sample; (ii) synthesizing the Type IIS restriction enzyme-tagging primer based on the DNA sequence of interest; (iii) amplifying the DNA sequence of interest using the Type IIS restriction enzyme-tagging primer and a reverse primer; and (iv) purifying a Type IIS restriction enzyme-tagged amplicon; (b) digesting the Type IIS restriction enzyme-tagged amplicon with the Type IIS restriction enzyme; (c) isolating the smaller DNA fragment containing the genetic modification exposed in a 3′ single-stranded overhang; (d) capturing the genetic modification, comprising: (i) preparing a library of 16 DNA adaptors, wherein each DNA adaptor comprises one strand with sequence of 5′-CTGGGGCACGGGTAAGAAGCATTCTGTCTCTCTTCTAAGAATTCGAG CTCGGTACCCG-3′ (SEQ ID NO: 230); and one complementary strand with sequence of 5′-CGGGTACCGAGCTCGAATTCTTAGAAGAGAGACAGAATGCTTCTTAC CCGTGCCCCAGNN-3′ with “N” corresponding to A, T, G or C (SEQ ID NOs: 231-246), and wherein each DNA adaptor has a different “NN”; (ii) incubating the isolated smaller DNA fragment containing the 3′ overhang with the library of DNA adaptors and performing a ligation; and (iii) obtaining a ligated product; and (e) amplifying the ligated product to detect the presence of the genetic modification, wherein the DNA sequence of interest is a genomic locus or corresponds to a genomic locus of an RNA virus variant.
2. The method of claim 1, wherein the DNA sequence of interest corresponds to a genomic locus of an RNA virus variant, and wherein obtaining the DNA sequence of interest comprises obtaining the RNA sequence from the RNA virus variant and converting it to the corresponding DNA sequence by reverse transcription PCR (RT-PCR).
3. The method of claim 2, wherein the RNA virus is SARS-CoV-2.
4. The method of claim 1, wherein the Type IIS restriction enzyme is selected from AcuI, BpmI, BpuEI, BsgI, MmeI and NmeAIII.
5. The method of claim 4, wherein the Type IIS restriction enzyme is AcuI.
6. The method of claim 1, wherein the Type IIS restriction enzyme-tagging primer is an oligonucleotide comprising: (a) a non-complementary handle sequence positioned on the 5′ side; (b) a complementary sequence of the genomic locus of interest on the 5′ side; (c) a recognition motif of the Type IIS restriction enzyme that is positioned at a predicted distance from its cleavage site to generate the genomic signature of interest; and (d) a complementary sequence of the genomic locus of interest on the 3′ side.
7. A kit for detecting a genetic modification of interest, comprising a specially designed Type IIS restriction enzyme-tagging primer according to claim 6, and a library of DNA adaptors according to claim 1, packaged together with instructions for its use.
8. The method of claim 5, wherein the AcuI-tagging primer is an oligonucleotide comprising: (a) a non-complementary handle sequence positioned on the 5′ side; and (b) a complementary sequence of the genomic locus of interest containing an AcuI motif (5′-CTGAAG-3′) positioned 14 bp upstream from the genomic locus of interest.
9. The method of claim 8, wherein the reverse primer is positioned at more than 100 bp downstream of the genomic locus of interest.
10. The method of claim 8, wherein the non-complementary handle sequence is 25 bp.
11. The method of claim 8, wherein the complementary sequence has the structure of: 5′-N(20)CTGAAGN(14)-3′ or 5′-N(15)CTGAAGN(14)-3′, with “N” corresponding to A, T, G or C, depending on the DNA sequence of the genomic locus of interest.
12. The method of claim 8, wherein the non-complementary handle sequence is 5′-GCAATTCCTCACGAGACCCGTCCTG-3′ (SEQ ID NO: 3) and the complementary sequence is 5′-N(15)CTGAAGN(14)-3′, with “N” corresponding to A, T, G or C.
13. A kit for detecting a genetic modification, comprising a specially designed AcuI-tagging primer and a library of DNA adaptors according to claim 1, packaged together with instructions for its use.
14. A method for quantifying a genomic variant in a biological system, comprising the steps of: (a) obtaining a sample from the biological system; (b) amplifying a DNA sequence of interest using a specially designed AcuI-tagging primer, wherein the DNA sequence of interest is a genomic locus or corresponds to a genomic locus of an RNA virus variant, comprising: (i) obtaining the DNA sequence of interest by (1) genomic extraction or (2) obtaining the RNA sequence from the RNA virus variant and converting it to the corresponding DNA sequence by reverse transcription PCR (RT-PCR); (ii) synthesizing the AcuI-tagging primer based on the DNA sequence of interest; (iii) amplifying the DNA sequence of interest using the AcuI-tagging primer and a reverse primer; and (iv) purifying an AcuI-tagged amplicon; (c) digesting the AcuI-tagged amplicon with restriction enzyme AcuI; (d) isolating the smaller DNA fragment containing the genomic variant of interest produced by the AcuI-digestion; (e) capturing the genomic variant of interest, comprising: (i) preparing the library of DNA adaptors according to claim 1; (ii) incubating the isolated smaller DNA fragment with the library of DNA adaptors and performing a ligation; and (iii) obtaining a ligated product; and (f) quantifying the genomic variant and determining its relative abundance.
15. The method of claim 14, wherein the genomic variant is generated by precision genome editing.
16. The method of claim 15, wherein the precision genome editing is CRISPER-dependent homology-directed repair, base editing or prime editing.
17. The method of claim 14, wherein the quantification in step (f) is carried out by quantitative PCR (qPCR).
18. A method for identifying and quantifying an oncogenic mutation of interest in a biological sample, comprising the steps of: (a) obtaining a biological sample; (b) amplifying a genomic locus of interest using a specially designed AcuI-tagging primer, comprising: (i) extracting DNA of interest; (ii) synthesizing the AcuI-tagging primer based on the genomic locus of interest; (iii) amplifying the genomic locus of interest using the AcuI-tagging primer and a reverse primer; and (iv) purifying an AcuI-tagged genomic amplicon; (c) digesting the AcuI-tagged genomic amplicon with restriction enzyme AcuI; (d) isolating the smaller DNA fragment containing the oncogenic mutation of interest produced by the AcuI-digestion; (e) capturing the genomic signature of interest, comprising: (i) preparing the library of DNA adaptors according to claim 1; (ii) incubating the isolated smaller DNA fragment with the library of DNA adaptors and performing a ligation; and (iii) obtaining a ligated product; (f) amplifying the ligated product to identify the presence of the oncogenic mutation of interest; and (g) quantifying the oncogenic mutation of interest, if present, and determining its frequency.
19. The method of claim 18, wherein the biological sample is obtained from a cancer animal model, a patient-derived xenograft (PDX), or a human cancer patient sample.
20. The method of claim 18, wherein the quantification in step (g) is carried out by quantitative PCR (qPCR).
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0019] The application file contains at least one photograph executed in color. Copies of this patent application with color photographs will be provided by the Office upon request and payment of the necessary fee.
[0020]
[0021]
[0022]
[0023]
[0024]
[0025]
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
[0033]
[0034]
[0035]
[0036]
[0037]
[0038]
[0039]
[0040]
[0041]
[0042]
[0043]
[0044]
[0045]
[0046]
[0047]
[0048]
[0049]
[0050]
[0051]
[0052]
[0053]
[0054]
[0055]
[0056]
[0057]
[0058]
[0059]
[0060]
[0061]
[0062]
[0063]
[0064]
[0065]
[0066]
[0067]
[0068]
[0069]
[0070]
[0071]
[0072]
[0073]
[0074]
[0075]
[0076]
[0077]
[0078]
[0079]
[0080]
[0081]
[0082]
[0083]
[0084]
[0085]
[0086]
[0087]
[0088]
[0089]
[0090]
[0091]
[0092]
[0093]
[0094]
[0095]
[0096]
[0097]
[0098]
[0099]
[0100]
[0101]
[0102]
[0103]
[0104]
[0105]
[0106]
[0107]
[0108]
[0109]
[0110]
DETAILED DESCRIPTION OF THE DISCLOSURE
[0111] The present disclosure provides a versatile method that uses standard molecular biology techniques to detect variants introduced by precision genome editing or resulting from genetic variation. This detection method, designated Dinucleotide signaTurE CapTure (DTECT), enables accurate and sensitive quantification of marker-free precision genome editing events induced by CRISPR-dependent HDR, base editing and prime editing. In addition, we show that DTECT can readily identify oncogenic mutations in cancer mouse models, patient-derived xenograft models and cancer patient samples. These studies establish a cost-effective method for the rapid detection of genetic variants, which will aid the generation of marker-free cellular and animal models of human disease and expedite the detection of pathogenic variants for clinical applications.
[0112] Accordingly, one embodiment of the present disclosure is a DNA adaptor comprising: (a) one strand with sequence of 5′-CTGGGGCACGGGTAAGAAGCATTCTGTCTCTCTTCTAAGAATTCGAGCTCGGTACC CG-3′ (SEQ ID NO: 230); and (b) one complementary strand with sequence of 5′-CGGGTACCGAGCTCGAATTCTTAGAAGAGAGACAGAATGCTTCTTACCCGTGCCC CAGNN-3′ with “N” corresponding to A, T, G or C (SEQ ID NOs: 231-246).
[0113] In some embodiments, the DNA adaptor is labeled with a detection molecule. Non-limiting examples of the detection molecule include a radiolabel, a fluorescent label, a biotinylated label, a non-fluorescent label, an enzyme, a hapten, a phosphorescent molecule, a chemiluminescent molecule, a chromophore, a luminescent molecule, a photoaffinity molecule, a color particle or a ligand.
[0114] Another embodiment of the present disclosure is a method of preparing a DNA adaptor disclosed herein, comprising: (a) synthesizing one constant oligonucleotide with sequence of 5′-CTGGGGCACGGGTAAGAAGCATTCTGTCTCTCTTCTAAGAATTCGAGCTCGGTACC CG-3′ (SEQ ID NO: 230); (b) synthesizing one complementary oligonucleotide with sequence of 5′-CGGGTACCGAGCTCGAATTCTTAGAAGAGAGACAGAATGCTTCTTACCCGTGCCC CAGNN-3′ with “N” corresponding to A, T, G or C (SEQ ID NOs: 231-246); (c) mixing the constant and complementary oligonucleotides; and (d) annealing the mixture to obtain the DNA adaptor.
[0115] Another embodiment of the present disclosure is a library of DNA adaptors prepared by methods disclosed herein, the library comprises 16 DNA adaptors, wherein each DNA adaptor has a different “NN”.
[0116] Another embodiment of the present disclosure is a method for detecting a genetic modification, comprising the steps of: (a) amplifying a genomic locus of interest using a specially designed Type IIS restriction enzyme-tagging primer, comprising: (i) extracting genomic DNA from a biological sample of interest; (ii) synthesizing the Type IIS restriction enzyme-tagging primer based on the genomic locus of interest; (iii) amplifying the genomic locus of interest using the Type IIS restriction enzyme-tagging primer and a reverse primer; and (iv) purifying a Type IIS restriction enzyme-tagged genomic amplicon; (b) digesting the Type IIS restriction enzyme-tagged genomic amplicon with the Type IIS restriction enzyme; (c) isolating the smaller DNA fragment containing a genomic signature of interest exposed in a 3′ single-stranded overhang; (d) capturing the genomic signature of interest, comprising: (i) preparing the library of DNA adaptors disclosed herein; (ii) incubating the isolated smaller DNA fragment containing the 3′ overhang signature with the library of DNA adaptors and performing a ligation; and (iii) obtaining a ligated product; and (e) amplifying the ligated product to detect the presence of the genetic modification.
[0117] In some embodiments, the genetic modification is selected from a base change, a deletion, or an insertion. In some embodiments, the genetic modification is selected from a single genomic change or multiple genomic changes. In some embodiments, the multiple genomic changes can occur within a single locus or distinct loci.
[0118] In some embodiments, the Type IIS restriction enzyme is selected from AcuI, BpmI, BpuEI, BsgI, MmeI and NmeAIII. In some embodiments, the Type IIS restriction enzyme is selected from AcuI and BpuEI. In some embodiments, the Type IIS restriction enzyme is AcuI.
[0119] In some embodiments, the Type IIS restriction enzyme-tagging primer is an oligonucleotide comprising: (a) a non-complementary handle sequence positioned on the 5′ side; (b) a complementary sequence of the genomic locus of interest on the 5′ side; (c) a recognition motif of the Type IIS restriction enzyme that is positioned at a predicted distance from its cleavage site to generate the genomic signature of interest; and (d) a complementary sequence of the genomic locus of interest on the 3′ side.
[0120] In some embodiments, the reverse primer is positioned at more than 100 bp downstream of the genomic locus of interest.
[0121] In some embodiments, the non-complementary handle sequence can have any suitable length. In some embodiments, the non-complementary handle sequence is 25 bp. In some embodiments, the non-complementary handle sequence can have any suitable sequence. In some embodiments, the non-complementary handle sequence is 5′-GCAATTCCTCACGAGACCCGTCCTG-3′ (SEQ ID NO: 3).
[0122] In some embodiments, the ligation in step (d)(ii) of the methods disclosed above is carried out by T4 DNA ligase.
[0123] A further embodiment of the present disclosure is a kit for detecting a genetic modification of interest, comprising a specially designed Type IIS restriction enzyme-tagging primer disclosed herein, and a library of DNA adaptors disclosed herein, packaged together with instructions for its use. In some embodiments, the Type IIS restriction enzyme is AcuI.
[0124] Another embodiment of the present disclosure is a method for detecting a genetic modification, comprising the steps of: (a) amplifying a genomic locus of interest using a specially designed AcuI-tagging primer, comprising: (i) extracting DNA of interest; (ii) synthesizing the AcuI-tagging primer based on the genomic locus of interest; (iii) amplifying the genomic locus of interest using the AcuI-tagging primer and a reverse primer; and (iv) purifying an AcuI-tagged genomic amplicon; (b) digesting the AcuI-tagged genomic amplicon with restriction enzyme AcuI; (c) isolating the smaller DNA fragment containing a genomic signature of interest produced by AcuI-digestion; (d) capturing the genomic signature of interest, comprising: (i) preparing the library of DNA adaptors disclosed herein; (ii) incubating the isolated smaller DNA fragment with the library of DNA adaptors and performing a ligation; and (iii) obtaining a ligated product; and (e) amplifying the ligated product to detect the presence of the genetic modification.
[0125] In some embodiments, the AcuI-tagging primer is an oligonucleotide comprising: (a) a non-complementary handle sequence positioned on the 5′ side; and (b) a complementary sequence of the genomic locus of interest containing an AcuI motif (5′-CTGAAG-3′) positioned 14 bp upstream from the genomic locus of interest.
[0126] In some embodiments, the AcuI-tagging primer can have any suitable length. In some embodiments, the AcuI-tagging primer is 60 bp.
[0127] In some embodiments, the reverse primer is positioned at more than 100 bp downstream of the genomic locus of interest.
[0128] In some embodiments, the non-complementary handle sequence can have any suitable length. In some embodiments, the non-complementary handle sequence is 25 bp.
[0129] In some embodiments, the complementary sequence has the structure of: 5′-N(20)CTGAAGN(14)-3′ or 5′-N(15)CTGAAGN(14)-3′, with “N” corresponding to A, T, G or C, depending on the DNA sequence of the genomic locus of interest.
[0130] In some embodiments, the non-complementary handle sequence is 5′-GCAATTCCTCACGAGACCCGTCCTG-3′ (SEQ ID NO: 3) and the complementary sequence is 5′-N(15)CTGAAGN(14)-3′, with “N” corresponding to A, T, G or C.
[0131] In some embodiments, the ligation in step (d)(ii) of the methods disclosed above is carried out by T4 DNA ligase.
[0132] An additional embodiment of the present disclosure is a kit for detecting a genetic modification, comprising a specially designed AcuI-tagging primer and a library of DNA adaptors disclosed herein, packaged together with instructions for its use.
[0133] Another embodiment of the present disclosure is a method for quantifying a genomic variant in a biological system, comprising the steps of: (a) obtaining a sample from the biological system; (b) amplifying a genomic locus of interest using a specially designed AcuI-tagging primer, comprising: (i) extracting DNA of interest; (ii) synthesizing the AcuI-tagging primer based on the genomic locus of interest; (iii) amplifying the genomic locus of interest using the AcuI-tagging primer and a reverse primer; and (iv) purifying an AcuI-tagged genomic amplicon; (c) digesting the AcuI-tagged genomic amplicon with restriction enzyme AcuI; (d) isolating the smaller DNA fragment containing a genomic signature of interest produced by the AcuI-digestion; (e) capturing the genomic signature of interest, comprising: (i) preparing the library of DNA adaptors disclosed herein; (ii) incubating the isolated smaller DNA fragment with the library of DNA adaptors and performing a ligation; and (iii) obtaining a ligated product; and (f) quantifying the genomic variant and determining its relative abundance.
[0134] In some embodiments, the genomic variant is generated by precision genome editing. In some embodiments, the precision genome editing is CRISPER-dependent homology-directed repair, base editing or prime editing.
[0135] In some embodiments, the biological system is a mammalian cell line, an organoid, or a tissue.
[0136] In some embodiments, the quantification in step (f) of the methods disclosed above is carried out by quantitative PCR (qPCR).
[0137] Still another embodiment of the present disclosure is a method for identifying and quantifying an oncogenic mutation of interest in a biological sample, comprising the steps of: (a) obtaining a biological sample; (b) amplifying a genomic locus of interest using a specially designed AcuI-tagging primer, comprising: (i) extracting DNA of interest; (ii) synthesizing the AcuI-tagging primer based on the genomic locus of interest; (iii) amplifying the genomic locus of interest using the AcuI-tagging primer and a reverse primer; and (iv) purifying an AcuI-tagged genomic amplicon; (c) digesting the AcuI-tagged genomic amplicon with restriction enzyme AcuI; (d) isolating the smaller DNA fragment containing a genomic signature of interest produced by the AcuI-digestion; (e) capturing the genomic signature of interest, comprising: (i) preparing the library of DNA adaptors disclosed herein; (ii) incubating the isolated smaller DNA fragment with the library of DNA adaptors and performing a ligation; and (iii) obtaining a ligated product; (f) amplifying the ligated product to identify the presence of the oncogenic mutation of interest; and (g) quantifying the oncogenic mutation of interest, if present, and determining its frequency.
[0138] In some embodiments, the biological sample is obtained from a cancer animal model, a patient-derived xenograft (PDX), or a human cancer patient sample.
[0139] In some embodiments, the quantification in step (g) of the methods disclosed above is carried out by quantitative PCR (qPCR).
[0140] A further embodiment of the present disclosure is a process for marker-free detection of a precision genome editing event comprising carrying out Dinucleotide signaTurE CapTure (DTECT) on a nucleic acid sequence of interest.
[0141] DTECT can also be used to detect genetic signatures in any organism, for example, a virus. Thus, still another embodiment of the present disclosure is a method for detecting a virus variant of interest, comprising the steps of: (a) obtaining a nucleic acid of the virus variant of interest from a biological sample; and (b) if the nucleic acid is DNA, carrying out Dinucleotide signaTurE CapTure (DTECT) to detect the variant of interest; or (c) if the nucleic acid is RNA, coverting it to DNA by reverse transcription PCR (RT-PCR) and then carrying out DTECT to detect the variant of interest. This detection method is applicable to any type of virus including but not limited to a DNA virus, an RNA virus, a retrovirus, etc. In some embodiments, the virus is an RNA virus. In some embodiments, the virus is SARS-CoV-2.
[0142] The following examples are provided to further illustrate the methods of the present disclosure. These examples are illustrative only and are not intended to limit the scope of the disclosure in any way.
EXAMPLES
Example 1
Methods and Materials
[0143] Material Availability
[0144] Plasmids for DTECT quantification and expression of base editing sgRNAs targeting BRCA1, BRCA2 and FANCD2 have been deposited to Addgene (#139321-139333, and 139511).
[0145] Cell Line Generation and Single Clone Isolation
[0146] HEK293T and DLD1 cell lines were obtained from ATCC. Cells were cultured in DMEM (ThermoFisher Scientific) supplemented with 10% Fetalgro bovine growth serum (BGS, RMBIO) and 1% penicillin-streptomycin (ThermoFisher Scientific). Cells were grown at 37° C. with 5% CO.sub.2 and tested regularly for mycoplasma. NIH/3T3 were maintained in DMEM supplemented with 10% bovine calf serum. Organoids were isolated and cultured as previously described (Zafra et al., 2018). To generate cells constitutively expressing FNLS-BE3-P2A-BlastR, HEK293T cells were infected with a lentivirus expressing the above construct. Viruses were produced in HEK293T in 6-well plates by transfecting 2 μg of FNLS-BE3-P2A-BlastR, 0.2 μg of Tat, 0.2 μg of Gag/Pol, 0.2 μg of Rev, 0.4 μg of VSV-G expressing plasm ids in 250 μl of DMEM without serum. 9 μl of TranslT-293 (Mirus) were added to the DNA, mixed and incubated for 15 min at room temperature. The DNA transfection reagent mix was added dropwise to the cells and incubated at 37° C. with 5% CO.sub.2. The next day the cell medium was replaced and cells were incubated for 48 hours. The medium containing lentiviruses was then collected and utilized to infect new HEK293T cells. 48 hours after infection, blasticidin was added to the medium until the uninfected control cells were killed. FNLS-BE3 expression was determined by western blot and the base editing activity of the construct was tested using previously validated sgRNAs. Single HEK293T clones were selected for high base editing efficiency. Clones were isolated by trypsinization of the initial cell population into individual cells. Cell density was evaluated by counting the cells with a hemocytometer and cells were diluted to approximately 0.13 cells/μl, equivalent to 20 cells per 150 μl. Serial dilutions were prepared and 150 μl of the diluted cell mixture were seeded into 96-well plates. Single clones were expanded and further examined for FNLS-BE3 expression and activity.
[0147] Editing of Cell Lines, Organoids and Mice
[0148] To induce CRISPR-mediated HDR editing, HEK293T cells were seeded at 50%-70% confluency into 24-well plates and reverse transfected with 0.25 μg of sgRNA and 0.25 μg of Cas9 expressing plasmid (Addgene #42230) with or without 0.5 μl of ssODN (40 μM) into 100 μl of DMEM without Fetalgro BGS and antibiotics. 3 μl of TranslT-293 (Mirus) were added to the DNA, mixed and incubated for 15 min at room temperature. Experiments involving i53 were done by adding 0.25 μg of i53 (Addgene #77939) to the transfection mixture. The gDNAs of cell populations and individual clones were recovered by resuspending the cell pellets in the Quick Extract DNA Extraction Solution (Epicentre), followed by incubation at 65° C. for 10 min and 95° C. for 5 min. The isolated gDNAs were diluted in H.sub.2O, quantified using Nanodrop and stored at −20° C. or directly used in PCR reactions. In base editing experiments, we used cells constitutively expressing FNLS-BE3 or transfected with pCMV-BE3 (Addgene #73021) and sgRNAs, as described above. Empty plasmids (Addgene #100708) with no sgRNAs were used as controls. To determine the accuracy of the quantification of variant frequency by DTECT (
[0149] In order to introduce multiple variants into the BRCA1 and BRCA2 genes, HEK293T cells expressing FNLS-BE3 were seeded at 50%-70% confluency into 24-well plates and reverse transfected with 1 μg of sgRNA into 100 μl of DMEM without Fetalgro BGS and antibiotics. 3 μl of TranslT-293 (Mirus) were added to the DNA, mixed and incubated for 15 min at room temperature. The DNA transfection mix was added dropwise to the cells and incubated at 37° C. with 5% CO.sub.2 for 4 days. Single clones were generated and the gDNAs of cell populations and individual clones were recovered as describe above. Genomic loci were Sanger sequenced by Eton Bioscience or Genewiz. Sanger sequencing data were analyzed using Serial cloner and viewed by Snapgene Viewer. The sequencing profiles shown in this manuscript were generated by SnapGene Viewer. Quantitative detection of the editing level using the AcuI-tagged amplicon was done blindly.
[0150] In vivo mouse editing was performed as previously described (Zafra et al., 2018). Briefly, eight week-old C57BL/6N mice (Charles River) were injected with 0.9% sterile sodium chloride solution containing 20 μg of pLenti-FNLS-P2A-Puro and 10 μg of sgRNA vector. The total injection volume corresponded to 20% of the individual mouse body weight and was injected into the lateral tail vein in 5-7 seconds. All animal experiments were authorized by the regional board of Karlsruhe, Germany.
[0151] Mouse Genotyping and Bone Marrow Transplantation
[0152] The generation of genetically engineered mice harboring the Brca1 S1598F and Bard1 S563F alleles was previously described (Billing et al., 2018; Shakya et al., 2011). Mouse genotyping was performed using DTECT on genomic DNA extracted from mouse tails. AcuI-tagging of the targeted loci was performed using 50 ng of gDNA (see DTECT protocol above). All primer sequences are listed in Table S1. Genotyping experiments were conducted blindly.
[0153] Competitive transplantation experiments were performed to assess chimerism of Jak2 V617F mutant cells in relation to wild-type support. Specifically, Mx1-Cre.sup.+; CD45.2 Jak2.sup.V617F/+ and Mx1Cre.sup.+; CD45.1 wild-type mice were dosed with polyinosine-polycytosine (PIPC) 8 weeks prior to sacrifice to induce MPN in mutant mice. On day of sacrifice, dissected femurs and tibias were isolated and bone marrow flushed with a syringe into PBS. Red blood cells (RBCs) were lysed in ammonium chloride-potassium bicarbonate lysis buffer for 10 min on ice. 1.5×10.sup.6 filtered whole donor Mx1-Cre.sup.+; Jak.sup.2V617F/+ bone marrow cells (CD45.2) were then mixed with wild-type 1.5×10.sup.6 competitor bone marrow cells (CD45.1) and transplanted via tail vein injection into lethally irradiated (2×550 Rad) CD45.1 host mice. Mice were then monitored serially for the development of MPN based on blood counts and donor chimerism by retroorbital bleed draws using heparinized microhematocrit capillary tubes (ThermoFisher Scientific). After 3 consecutive hematocrits of >65%, mice were then sacrificed for peripheral blood fluorescence-activated cell sorting (FACS) analysis and DNA extraction. All animal procedures were conducted in accordance with the Guidelines for the Care and Use of Laboratory Animals and were approved by the Institutional Animal Care and Use Committees at Memorial Sloan Kettering Cancer Center. The conditional Mx1-Cre.sup.+; Jak.sup.2V617F/+ mice are all C57BL/6 background and have been previously described (Mullally et al., 2010). Automated peripheral blood counts were obtained using a ProCyte Dx (IDEXX Laboratories) according to the manufacturer's protocol. For surface flow cytometry of mouse peripheral blood, bone marrow, and spleen, RBCs were lysed and stained with monoclonal antibodies in PBS plus 1% BSA for 1 hour on ice. For flow cytometry of erythroid lineage, bone marrow or splenic cells were stained without RBC lysis. DAPI was used for live/dead cell analysis. Cell populations were analyzed using an LSR Fortessa (Becton Dickinson), and data were analyzed with FlowJo software (Tree Star). DNA extraction was performed using the QIAamp DNA Micro Kit (Qiagen) per manufacturer's protocol.
[0154] Analysis of ALL Patient Samples and PDXs
[0155] DNA samples from leukemic ALL blasts obtained at diagnosis and after relapse were provided by multiple institutions, as previously described (Oshima et al., 2016). Informed consent was obtained at study entry and samples were collected under the supervision of local Institutional Review Boards for participating institutions and analyzed under the supervision of the Columbia University Irving Medical Center Institutional Review Board. Research was conducted in compliance with ethical regulations. ALL patients received standard combination chemotherapy at diagnosis. Diagnosis and relapse samples were harvested from bone marrow. High molecular weight genomic DNA from matched diagnosis and relapse samples of ALL patients was extracted from patient leukemic blasts or from xenografts using the DNeasy Blood & Tissue Kit (Qiagen) or the AllPrep DNA/RNA Mini Kit (Qiagen). Primary human xenograft ALL cells were passaged and harvested from the spleens of NRG (NOD.Cg-ag1tm1MomII2rgtm1WjI/SzJ, The Jackson Laboratory) mice. Whole exome sequencing was performed and analyzed as previously described (Oshima et al., 2016).
[0156] Vector Construction and Cloning
[0157] sgRNAs were synthesized as complementary oligonucleotides (IDT) compatible with BbsI restriction sites located into the B52 plasmid (Addgene #100708). Oligonucleotides were designed as previously described (Billon et al., 2017). Cloned sgRNAs were verified by Sanger sequencing. Sequences of the sgRNAs are available in Table S1. ssODNs used in HDR experiments were synthesized as ultramer oligos (IDT) and their sequences are available in Table S1. To generate the FNLS-BE3-P2A-BlastR plasmid, the pLenti-FNLS-P2A-Puro plasmid (Addgene #110841) (Zafra et al., 2018) was modified by replacing the puromycin resistance gene with the blasticidin resistance gene. Briefly, the blasticidin resistance gene coding sequence was amplified by PCR and recombined using Gibson assembly into FNLS-BE3-P2A. The FNLS-BE3-P2A-BlastR sequence was verified by Sanger sequencing.
[0158] AcuI-Tagging Primer Design
[0159] The AcuI-tagging oligonucleotide enables the insertion of an AcuI motif (5′-CTGAAG-3′) 14 bp away from a targeted dinucleotide. This motif is inserted as a hairpin in the middle of a sequence complementary to the targeted genomic locus. The AcuI-tagging oligonucleotide is 60 bp-long and contains a non-complementary handle sequence of 20-25 bp. Common handle sequences used are PB547 (5′-GATCCTCTAGAGTCGACCTG-3′) (SEQ ID NO: 1) or PB1072 (5′-GCAATTCCTCACGAGACCCGTCCTG-3′) (SEQ ID NO: 3) (Table S1). The oligonucleotide sequence complementary to the targeted genomic locus plus the AcuI motif has the following sequence: 5′-N(20)CTGAAGN(14)-3′ or 5′-N(15)CTGAAGN(14)-3′, with “N” corresponding to A, T, G or C bases complementary to the targeted locus. Reverse primers used in AcuI-tagging reactions were designed by Primer 3 (http://bioinfo.ut.ee/primer3-0.4.0/) using the default parameters with the following changes: Mispriming library=“HUMAN” for amplifying from human genomic DNA or Mispriming library=“RODENT” for amplifying from mouse genomic DNA, Primer size “min=25, Opt=27, Max=30”, Primer Tm “Min=57.0° C., Opt=60.0° C., Max=63.0° C.”. Reverse primers are located >100 bp away from the targeted dinucleotides. All sequences of the primers used in this study are available in Table S1.
[0160] Adaptor Library Generation and Characterization
[0161] A set of 17 individual oligonucleotides constitutes the full adaptor library. This library contains: a) One constant oligonucleotide with the following sequence: 5′-CTGGGGCACGGGTAAGAAGCATTCTGTCTCTcttctaagaattcgagctcggtacccg-3′ (SEQ ID NO: 230). The lowercase nucleotide sequence located at the 3′-end of the constant oligonucleotide (5′-cttctaagaattcgagctcggtacccg-3′) (SEQ ID NO: 319) corresponds to the handle sequence used to detect the ligated products with either PB548 (5′-cgggtaccgagctcgaattc-3′) (SEQ ID NO: 2) or PB1073 (5′-cgggtaccgagctcgaattcttagaag-3′) (SEQ ID NO: 4); b) 16 variable oligonucleotides that contain a sequence complementary to the constant oligonucleotide plus one of 16 different dinucleotides at their 3′-end. The variable oligonucleotides have the following sequence: 5′-cgggtaccgagctcgaattcttagaagAGAGACAGAATGCTTCTTACCCGTGCCCCAGNN-3′. NN, with N=A, C, G or T (SEQ ID NOs: 231-246), corresponds to the dinucleotide that is different for each of the 16 oligos. The adaptor sequences are available in Table S1. The constant oligonucleotide and each variable oligonucleotide were resuspended at a concentration of 100 μM in H.sub.2O. 2.5 μl of constant oligonucleotide and 2.5 μl of each variable oligonucleotide were mixed with 1× ligase buffer (ThermoFisher Scientific) and water in a 20 μl reaction. The reactions were placed in a thermocycler and oligonucleotides were annealed by incubating them for 5 min at 95° C., followed by a gradual temperature decrease from 95° C. to 15° C. After annealing was completed, 100 μl of water were added to dilute the adaptors in a 120 μl final volume. Adaptors were frozen and stored at −20° C.
[0162] The adaptor library was tested at two independent loci, as shown in
[0163] The measurement of the dinucleotide capture efficiency of each adaptor (
[0164] The assay performed to measure the efficiency of DNA ligation (
[0165] To calculate the frequency of non-specific dinucleotide capture shown in
[0166] DTECT Protocol
[0167] The DTECT protocol consists of 6 steps (I-VI,
[0168] Next-Generation Sequencing
[0169] Samples for NGS were prepared by amplifying the edited regions of interest by PCR. Samples were sequenced by the Genome Sciences Facility at The Pennsylvania State College of Medicine or by Genewiz and the results were analyzed by Genewiz, or by using an R-based script of the Ciccia laboratory or CRISPResso2 (Clement et al., 2019). To ensure that no biases were introduced during DTECT assays, the AcuI-tagging amplicons for the BRCA1 and BRCA2 mutant samples were sequenced by NGS and analyzed using an R-based script. In this analysis, 7 sequences with >6000 reads were filtered out from the analysis due to incorrect sequence. The editing frequency from the NGS results were determined using the formula: ((Number of reads for the edited dinucleotide)/(total number of reads))×100. Oligonucleotides used for PCR amplifications, Illumina sequencing adaptors and indexes are listed in Table S1.
[0170] Quantification and Statistical Analysis
[0171] Technical duplicates of each sample were performed in each qPCR reaction. A standard curve to determine the concentration of the captured material was generated using predefined concentrations of a DTECT ligation product (
[0172] Data and Code Availability
[0173] R-based scripts of the Ciccia laboratory for analysis of NGS reads and ClinVar datasets are available upon request. Raw NGS reads of edited DLD1 and NIH/3T3 cells, organoids and liver samples are available under accession SRP151111 in the Sequence Read Archive. NGS reads have been deposited into the NCBI database and are and are accessible as BioProject #PRJNA603357. All uncropped gels, raw qPCR data and Sanger sequencing reads are available in Mendeley (https://data.mendeley.com/datasets/gtkk6sthtw/draft?a=ca72630e-56eb-4e29-bcdb-158b2c7d4123).
TABLE-US-00001 KEY RESOURCES TABLE REAGENT or RESOURCE SOURCE IDENTIFIER Bacterial and Virus Strains Subcloning Efficiency DH5α ThermoFisher 1 8265-017 Scientific Chemicals, Peptides, and Recombinant Proteins Q5 High-Fidelity DNA polymerase NEB M0491L T4 DNA ligase ThermoFisher 15224017 Scientific Acul NEB R0641L rSAP NEB M0371L SybrGold (for gel staining) ThermoFisher S-11494 Scientific SybrGold (for qPCR) ThermoFisher 4367659 Scientific BamHI-HF NEB R3136S dNTPs NEB N0447L T4 Polynucleotide Kinase NEB M0201S Critical Commercial Assays Agencourt AMPure XP magnetic beads Beckman Coulter A63881 Zymoclean gel DNA recovery kit Zymo Research D4008 Quick Extract DNA Extraction Solution Epicentre QE09050 Zero BLUNT II TOPO PCR Cloning kit ThermoFisher 450245 Scientific Deposited Data Unprocessed images of gels This disclosure, Raw gel images Mendeley Data Raw Sanger sequencing files This disclosure, Sequences of Mendeley Data BRCA1-2 edited cells; Repeated sequences Raw NGS sequencing files This disclosure, BioProject # NCBI PRJNA603357 Raw and processed qPCR data This disclosure, Raw and Mendeley Data processed qPCR data Raw and processed DTECT, ICE, EditR and This disclosure, Quantification of NGS data Mendeley Data BRCA1-2 variants by DTECT, ICE, EditR and NGS Experimental Models: Cell Lines Human: HEK293T ATCC CRL-11268 Human: DLD1 ATCC CCL-221 Mouse: NIH/3T3 ATCC CRL-1658 Experimental Models: Organisms/Strains Mouse: C57BL/6N Charles River C57BL/6NCrl Mouse: Brca1.sup.S1598F/+ Shakya et al, 2011 N/A Mouse: Bard1.sup.S563F/+ Billing et al, 2018 N/A Mouse: Mx1Cre.sup.+; CD45.1 Mullally et al, 2010 N/A Mouse: Mx1-Cre.sup.+; CD45.2 Jak2.sup.V617F/+ Mullally et al, 2010 N/A Mouse: NRG The Jackson 007799 Laboratory Oligonucleotides Primers for PCR This disclosure Table S1 Oligonucleotides for sgRNA cloning This disclosure Table S1 ssODNs (for HDR) This disclosure Table S1 Oligonucleotides for adaptors This disclosure Table S1 Recombinant DNA Plasmid: B52 (containing 2 empty sgRNAs- Addgene 100708 expressing cassettes) pCMV-PE2 Addgene 132775 pCMV-BE3 Addgene 73021 DTECT - Plasmid for standard curve This disclosure, 139333 Addgene pTOPO-SPRTN WT This disclosure N/A pTOPO-SPRTN STOP This disclosure N/A pTOPO-SMARCAL1 WT This disclosure N/A pTOPO-SMARCAL1 STOP This disclosure N/A pTOPO-PIK3R1 WT This disclosure N/A pTOPO-PIK3R1 STOP This disclosure N/A pX330-U6-Chimeric_BB-CBh-hSpCas9 Addgene 42230 pCDNA3-Flag::UbvG08 I44A, deltaGG Addgene 74939 pU6-Sp-pegRNA-HEK3-CTT_ins Addgene 132778 Plasmids expressing sgRNAs for base editing This disclosure, 139321-139332, of FANCD2, BRCA1 and BRCA2 Addgene and 139511 Software and Algorithms R Studio Desktop IDE 1.0.143 RStudio https://www.rstudio.com Bioconductor R packages Bioconductor https://www.bioconductor.org R 3.4.1 The R project for https://www.r-project.org statistical computing Other ClinVar database NCBI https://www.ncbi. nlm.nih.gov/clinvar/ Li-COR Odyssey N/A https://www.licor. com/bio/products/ imaging_systems/ odyssey q-PCR QuantStudio 3 Applied N/A Biosystems
Example 2
Design of DTECT, a Detection Method Based on the Capture of Dinucleotide Signatures
[0174] In our detection method, we take advantage of the property of type IIS restriction enzymes to generate single-stranded DNA overhangs at a specific distance from their recognition motif. Based on the above property, we hypothesized that single-stranded DNA overhangs generated by digestion of genomic DNA sequences with type IIS restriction enzymes could be captured and identified using DNA adaptors containing overhangs complementary to the exposed DNA signatures (
[0175] In our approach, the genomic locus of interest is PCR-amplified using a locus-specific DNA primer (red) and a DNA oligonucleotide (AcuI-tagging primer) containing two regions of complementarity to the genomic locus (purple) interrupted by an AcuI recognition site (AcuI hairpin, green) positioned 14 bp upstream of a dinucleotide of interest (
Example 3
DTECT Efficiently Captures Dinucleotide Signatures Generated by AcuI-Mediated Digestion
[0176] To demonstrate the feasibility of DTECT, we designed two AcuI-tagging DNA primers flanking four adjacent bases (5′-TTGG-3′) on opposite DNA strands (TT and CC signatures, blue) (
Example 4
DTECT Enables Specific and Sensitive Quantification of DNA Variants
[0177] Next, we examined whether DTECT can determine the relative abundance of DNA variants with distinct DNA signatures, including low abundance DNA variants. To this end, we transfected HEK293T cells with sgRNAs that introduce nonsense mutations into the SPRTN, PIK3R1 and SMARCAL1 genes using iSTOP, a CRISPR-mediated base editing approach that creates STOP codons within genes of interest (Billon et al., 2017) (
Example 5
DTECT Accurately Identifies Genomic Changes Introduced by CRISPR-Dependent HDR, Base Editing and Prime Editing in Mammalian Cells
[0178] To examine the ability of DTECT to identify precise genomic changes introduced into mammalian cell populations, we utilized CRISPR-mediated HDR for generating various types of disease-related mutations using single-stranded oligodeoxynucleotides (ssODNs), including a cancer-associated frameshift mutation in TP53 (i.e., R209fs*6), a missense mutation in HBB (i.e., G6V) that causes sickle cell anemia, a small tandem duplication in BRCA2 (dupAGAAGAT) identified in breast cancer, and small insertions into JAK2 and EMX1 (Paulsen et al., 2017), two genes associated with myeloproliferative disorders and Kallmann syndrome, respectively. Three days after co-transfection of Cas9 with site-specific sgRNAs and ssODNs into HEK293T cells, we harvested the cellular genomic DNA and utilized DTECT to determine by analytical and quantitative PCR whether the desired changes were incorporated into the targeted chromosomal loci (
[0179] To further confirm the accuracy of DTECT in quantifying precision genome editing, we compared the frequency of editing events determined by either DTECT or NGS across 62 samples derived from human cells, mouse cells and intestinal organoids, which were modified using CRISPR-mediated HDR or base editing (Zafra et al., 2018). As shown in
[0180] Recent studies led to the development of Sanger sequencing-based methods, such as ICE (Synthego; https://ice.synthego.com/#/) or EditR (Kluesner et al., 2018), that enable the detection of genomic variants based on the deconvolution of chromatogram peaks. To compare DTECT with the above methods, we subjected to Sanger sequencing the genomic amplicons of 23 samples edited by precision genome editing. In these experiments, we used two primers annealing to opposite DNA strands to obtain independent sequencing duplicates of the same amplicons, and analyzed the Sanger sequencing reads using either ICE or EditR. Notably, ˜10% of the sequencing reactions failed to generate high quality reads required for ICE or EditR, despite using high quality amplicons for sequencing (Mendeley dataset, Data availability section). Independent repeats using new genomic amplicons did not improve the sequencing outcome (Mendeley dataset, Data availability section). In addition, we noted that technical duplicates of Sanger sequencing reactions analyzed by ICE or EditR displayed lower levels of consistency relative to technical replicates of DTECT assays (
Example 6
DTECT Enables the Identification of Precision Genome Editing Events In Vivo
[0181] The modeling and correction of pathogenic mutations in adult mice is critical for the development of novel approaches to therapeutic intervention against cancer and other diseases (Chadwick et al., 2017; Gao et al., 2018; Levy et al., 2020; Ryu et al., 2018; Song et al., 2020; Villiger et al., 2018; Yin et al., 2016; Yin et al., 2014). To determine whether DTECT can determine editing levels in adult mouse tissue, we hydrodynamically delivered into the mouse liver (Tschaharganeh et al., 2014) a cytidine base editor and an sgRNA introducing the oncogenic Pik3ca E545K mutation (Zafra et al., 2018) (
Example 7
DTECT is Capable of Identifying Multiple Genome Editing Events Occurring within a Single Locus or Distinct Loci
[0182] The above studies indicate that DTECT can determine the identity of individual genomic changes. To examine whether DTECT can also identify complex sets of mutations, we employed CRISPR-dependent base editing to target two adjacent cytosines in the EMX1 locus that had previously been converted into four distinct dinucleotide combinations (i.e., CC, CT, TC or TT) by base editing (Komor et al., 2016) (
Example 8
DTECT Expedites the Derivation of Marker-Free Cell Lines Carrying Clinically Relevant Mutations and Facilitates the Genotyping of Cellular and Animal Disease Models
[0183] Precision genome editing allows the modeling of clinically relevant gene variants. Given that DTECT enables the identification of newly created DNA signatures without requiring the insertion of markers or elaborate experimental design specific for each edited site, we tested whether DTECT could facilitate the generation of multiple cell lines harboring clinically relevant mutations. In particular, we focused our attention on mutations in the BRCA1 and BRCA2 genes, which in heterozygosity can predispose women to the development of breast and/or ovarian cancer (Apostolou and Fostira, 2013), whereas in homozygosity can cause Fanconi anemia (Ceccaldi et al., 2016). More than 7,000 clinically associated SNVs have been identified in BRCA1/2, according to the ClinVar database, but efforts to characterize their functional impact and pathogenic potential have been limited in part due to the challenge of generating cell lines that carry such a large number of individual homozygous and heterozygous variants. To determine whether DTECT can facilitate the production of cell lines harboring clinically relevant BRCA1/2 SNVs, we expressed a cytidine base editor in HEK293T cells along with individual sgRNAs to generate 23 different BRCA1/2 mutations identified in patients with ovarian and breast cancers, as reported in ClinVar (
[0184] Given the ability of DTECT to correctly determine the genotype of cellular clones, we then tested whether DTECT could also be applied to mouse genotyping. To this end, we obtained tail DNA samples from genetically engineered mice carrying knock-in mutations in Brca1 (S1598F) and its partner protein Bard1 (S563F) (Billing et al., 2018). As shown in
Example 9
DTECT Identifies the Presence of Oncogenic Mutations in Cancer Mouse Models and Human Cancer Patient Samples
[0185] Precise and rapid detection of pathogenic variants in patients is critical for accurate diagnosis and personalized therapy. Given the ability of DTECT to identify genetic variants rapidly and accurately, we tested whether DTECT could be utilized to expedite the identification of pathogenic variants in pre-clinical and clinical settings. In particular, we examined whether DTECT could identify the presence of oncogenic variants in various biological systems. In our studies we focused our attention on the JAK2 V617F variant, which is present in the majority of patients with myeloproliferative neoplasm (MPN) (Levine et al., 2005). Mice transplanted with Jak2 V617F mutant bone marrow cells develop MPN and recapitulate the human disease (Mullally et al., 2010). Therefore, we analyzed the Jak2 V617F variant in the peripheral blood of mice transplanted with a mixture of bone marrow cells that do or do not carry an inducible Jak2 V617F variant (Bhagwat et al., 2014) (
[0186] We next examined whether DTECT can identify the presence of specific oncogenic mutations in human samples from patients diagnosed with acute lymphoblastic leukemia (ALL), the most common form of childhood cancer (Inaba et al., 2013). Although most ALL patients respond to chemotherapy, ˜20% suffer a relapse as a result of resistance to chemotherapy (Bhojwani and Pui, 2013). Moreover, secondary genetic alterations that promote chemoresistance, including mutations in the NT5C2 gene (Tzoneva et al., 2018; Tzoneva et al., 2013), are found in a large fraction of ALL relapse cases (Dieck and Ferrando, 2019; Oshima et al., 2016). To test whether DTECT can identify these relapse-specific oncogenic signatures, we obtained matched DNA samples from the bone marrow of ALL patients at diagnosis and relapse and analyzed them for the presence of three common NT5C2 mutations (R238W, K359Q and R367Q) (
Example 10
Discussion
[0187] In this study, we established DTECT as a sensitive method for the identification of genomic DNA signatures. In particular, we show that DTECT readily identifies precision genome editing events induced by CRISPR-dependent HDR, base editing and prime editing, including low abundance and complex genomic changes. In addition, we show that DTECT can be employed to identify pathogenic lesions of interest, such as oncogenic mutations, in cancer mouse models, PDXs, and cancer patient specimens. DTECT is a rapid (˜4-5 hours) and easy-to-perform detection method that relies on standard molecular biology techniques (PCR, DNA digestion and ligation) and common laboratory reagents. This methodology is also not labor-intensive, given that it entails short periods (5-10 min) of sample processing followed by hands-free incubations. Importantly, DTECT assays utilize a unique and common set of adaptors that includes positive and negative controls to ensure specificity and accuracy. The ease, speed and cost efficiency by which DTECT identifies genetic variants in a wide variety of cellular and animal systems (e.g., cell lines, organoids, animal models, patient samples) should facilitate the generation and study of biological models of human diseases and expedite the detection of pathogenic variants for both pre-clinical and clinical applications.
[0188] Although highly robust, DTECT has three potential limitations. First, AcuI-induced dinucleotide byproducts can be generated if a genomic AcuI restriction site located in close proximity to the targeted dinucleotide is incorporated into the amplicon of the targeted locus. However, an analysis of the ClinVar database revealed that genomic AcuI sites occur relatively infrequently and 95% of clinically relevant variants (404,393 variants) are compatible with DTECT (
[0189] In addition to its ease of use, speed and cost efficiency, DTECT has several advantages compared to other detection methods. A major benefit of DTECT is its versatility, which allows the detection and quantification of nucleotide substitutions, precise base insertions and deletions using the same small set of 16 predefined adaptors (
[0190] The ability to model clinically relevant mutations in a marker-free manner is critical for assessing their potential pathogenicity, especially in the case of genes, such as BRCA1 and BRCA2, which have thousands of clinically-associated SNVs. Recent studies have led to the development of high-throughput saturation genome editing (SGE) to examine en masse the pathogenicity of BRCA1 variants (Findlay et al., 2018). Although highly useful for classifying BRCA1 SNVs, SGE requires the use of haploid cells and is therefore not compatible with the study of the functional impact of BRCA1 mutations in heterozygosity, as observed in BRCA1 mutation carriers (Apostolou and Fostira, 2013). BRCA1/2 heterozygous mutations have been recently shown to cause genome instability induced by DNA replication stress (Billing et al., 2018; Pathania et al., 2014; Tan et al., 2017). By facilitating the derivation of both heterozygous and homozygous BRCA1/2 mutant cells and animal models (
[0191] In addition to facilitating precision genome editing, we showed that DTECT can also be used to detect pathogenic variants in pre-clinical and clinical settings. In particular, DTECT can rapidly identify the presence of oncogenic variants in cancer mouse models (
[0192] Collectively, our work established DTECT as a facile, rapid and cost-effective method for identifying genomic variants in various biological systems, such as mammalian cell lines, organoids, mouse tissues, PDX models and human patient samples. Given the growing number of genetic variants identified in the human population (Lek et al., 2016) and in human genetic disorders (McClellan and King, 2010), this versatile method for the detection of genomic signatures should facilitate the study of human genetic variation and expedite the diagnosis and treatment of human disease.
TABLE-US-00002 TABLE S1 Primers, ssODNs, adaptors and other oligos used in this disclosure. Detection primers Sequence (5′- -> 3′) Notes PB547 gatcctctagagtcgacctg (SEQ ID NO: 1) Oligos for detection (step VI) PB548 cgggtaccgagctcgaattc (SEQ ID NO: 2) Oligos for detection (step VI) PB1072 gcaattcctcacgagacccgtcctg (SEQ ID NO: 3) Oligos for detection (step VI) - Only these oligos were used for qPCR PB1073 cgggtaccgagctcgaattcttagaag (SEQ ID NO: 4) Oligos for detection (step VI) - Only these oligos were used for qPCR AcuI- tagging Sequence (5′- -> 3′): Handle for primers detection-gDNA-AcuI hairpin-gDNA Notes PB1021 gatcctctagagtcgacctgGGAGTCCCTGTCGCTAGTGG AcuI for signature CTGAAGACGCGTCGTGGGAG (SEQ ID NO: 5) TT PB1022 gatcctctagagtcgacctgACAAACAGTGCCTGCAAGTCC AcuI for signature TGAAGCGGTGTGGGGTCCA (SEQ ID NO: 6) CC PB1071 GCAATTCCTCACGAGACCCGTCCTGATTTCAGGG AcuI for PIK3R1- AAGAAGCTGAAGTGAATGAAAAACTT (SEQ ID NO: STOP 7) PB1153 GCAATTCCTCACGAGACCCGTCCTGTGTAGTTTTA AcuI for JAK2 CTTACCTGAAGTCTCGTCTCCACAG (SEQ ID NO: (HDR) 8) PB1151 GCAATTCCTCACGAGACCCGTCCTGAGGACATCG AcuI for EMX1 ATGTCACTGAAGCCTCCAATGACTAG (SEQ ID NO: (HDR) 9) PB1019 gatcctctagagtcgacctgAAACGGCAGAAGCTGGAGGA AcuI for EMX1 CTGAAGGGAAGGGCCTGAGT (SEQ ID NO: 10) (Base editing) PB1080 GCAATTCCTCACGAGACCCGTCCTGGTTCAGTTTA AcuI for SPRTN- ACGACCTGAAGCAATTCTTCTGGGG (SEQ ID NO: STOP 11) PB1149 GCAATTCCTCACGAGACCCGTCCTGTGTGTTCACT AcuI for HBB AGCAACTGAAGCCTCAAACAGACAC (SEQ ID NO: (HDR) 12) PB1211 GCAATTCCTCACGAGACCCGTCCTGGAGGAGGAG AcuI for TCOF1 GCCCCTCTGAAGGCAGGGACACGAAG (SEQ ID (Base editing) NO: 13) oligo plate GAT CCT CTA GAG TCG ACC TGC CAA ATT ATA BRCA1 C64Y TAC CTT TTG GCT GAA GTT ATA TCA TTC TTA AcuI (SEQ ID NO: 14) oligo plate GAT CCT CTA GAG TCG ACC TGT CTT CAC TGC BRCA1 E638K TAG AAC AAC TCT GAA GAT CAA TTT GCA ATT AcuI (SEQ ID NO: 15) oligo plate GAT CCT CTA GAG TCG ACC TGA TAT TGC TTG BRCA1 E1033K AGC TGG CTT CCT GAA GTT TAA AAA CAT TTT AcuI (SEQ ID NO: 16) oligo plate GAT CCT CTA GAG TCG ACC TGG GTT CAG CTT BRCA1 E575K TCG TTT TGA ACT GAA GAG CAG ATT CTT TTT AcuI (SEQ ID NO: 17) oligo plate GAT CCT CTA GAG TCG ACC TGT CCT CTA GCA BRCA1 V990I GAT TTT TCT TCT GAA GAC ATT TAG TTT TAA AcuI (SEQ ID NO: 18) oligo plate GAT CCT CTA GAG TCG ACC TGG GAA AGA ATG BRCA1 T922I AGT CTA ATA TCT GAA GCA AGC CTG TAC AGA AcuI (SEQ ID NO: 19) oligo plate GAT CCT CTA GAG TCG ACC TGC ATC ATT ACC BRCA1 D67N AAA TTA TAT ACT GAA GCC TTT TGG TTA TAT AcuI (SEQ ID NO: 20) oligo plate GAT CCT CTA GAG TCG ACC TGG AGG GAG GGA BRCA1 E1754K GCT TTA CCT TCT GAA GTC TGT CCT GGG ATT AcuI (SEQ ID NO: 21) oligo plate GAT CCT CTA GAG TCG ACC TGG AAG AAA ATA BRCA1 S1363L ATC AAG AAG ACT GAA GGC AAA GCA TGG ATT AcuI (SEQ ID NO: 22) oligo plate GAT CCT CTA GAG TCG ACC TGG CAG TGA TTT BRCA1 Q1779* TAC ATC TAA ACT GAA GTG TCC ATT TTA GAT AcuI (SEQ ID NO: 23) oligo plate GAT CCT CTA GAG TCG ACC TGG ATG GAG AAG BRCA2 R2842C ACA TCA TCT GCT GAA GGA TTA TAC ATA TTT AcuI (SEQ ID NO: 24) oligo plate GAT CCT CTA GAG TCG ACC TGT GAA TCT TTT BRCA2 R2973H TCT TTT TTT GCT GAA GAA TAG CTT ACA ATA AcuI (SEQ ID NO: 25) oligo plate GAT CCT CTA GAG TCG ACC TGC TGA GTA TTT BRCA2 S2998F GGC GTC CAT CCT GAA GAT CAG ATT TAT ATT AcuI (SEQ ID NO: 26) oligo plate GAT CCT CTA GAG TCG ACC TGC AAA TTT TTA BRCA2 S3070F GAT CCA GAC TCT GAA GTC AGC CAT CTT GTT AcuI (SEQ ID NO: 27) oligo plate GAT CCT CTA GAG TCG ACC TGA GTG CAA ATT BRCA2 E2772K AAT TTA CCT TCT GAA GTA ACA TAA GAG ATT AcuI (SEQ ID NO: 28) oligo plate GAT CCT CTA GAG TCG ACC TGG GAA TAT TTG BRCA2 T1707I ATG GTC AAC CCT GAA GAG AAA GAA TAA ATA AcuI (SEQ ID NO: 29) oligo plate GAT CCT CTA GAG TCG ACC TGA TCT TGT TCT BRCA2 V3079I GAG GTG GAC CCT GAA GTA ATA GGA TTT GTC AcuI (SEQ ID NO: 30) oligo plate GAT CCT CTA GAG TCG ACC TGT AGG AAG GCC BRCA2 Q2960* ATG GAA TCT GCT GAA GCT GAA CAA AAG GAA AcuI (SEQ ID NO: 31) oligo plate GAT CCT CTA GAG TCG ACC TGA ACT GAA GCC BRCA2 T544I TCT GAA AGT GCT GAA GAC TGG AAA TAC ATA AcuI (SEQ ID NO: 32) oligo plate GAT CCT CTA GAG TCG ACC TGT TTA CCA TCA BRCA2 R2896C CGT GCA CTA ACT GAA GCA AGA CAG CAA GTT AcuI (SEQ ID NO: 33) oligo plate GAT CCT CTA GAG TCG ACC TGT GGA AGC TGG BRCA2 V572I CCA GCC ACC ACT GAA GCC ACA CAG AAT TCT AcuI (SEQ ID NO: 34) oligo plate GAT CCT CTA GAG TCG ACC TGT TGC CTC TAG BRCA2 V778I AAA TCA TGA CCT GAA GTA GGT TTG ACA GAA AcuI (SEQ ID NO: 35) oligo plate GAT CCT CTA GAG TCG ACC TGT TTC TCT TAT BRCA2 V2102I CAA CAC GAG GCT GAA GAA GTA TTT TTG ATA AcuI (SEQ ID NO: 36) AA1 GAT CCT CTA GAG TCG ACC TGC AAA CGA CGA For adaptor library GCG TGA CAC CCT GAA GAC GAT GCC TGT AGC testing (SEQ ID NO: 37) AA2 GAT CCT CTA GAG TCG ACC TGT CGT TGG GAA For adaptor library CCG GAG CTG ACT GAA GAT GAA GCC ATA CCA testing (SEQ ID NO: 38) AC1 GAT CCT CTA GAG TCG ACC TGG AGC TGA ATG For adaptor library AAG CCA TAC CCT GAA GAA ACG ACG AGC GTG testing (SEQ ID NO: 39) AC2 GAT CCT CTA GAG TCG ACC TGG CTG AAT GAA For adaptor library GCC ATA CCA ACT GAA GAC GAC GAG CGT GAC testing (SEQ ID NO: 40) AG1 GAT CCT CTA GAG TCG ACC TGG AAC CGG AGC For adaptor library TGA ATG AAG CCT GAA GCA TAC CAA ACG ACG testing (SEQ ID NO: 41) AG2 GAT CCT CTA GAG TCG ACC TGT ACC AAA CGA For adaptor library CGA GCG TGA CCT GAA GAC CAC GAT GCC TGT testing (SEQ ID NO: 42) AT1 GAT CCT CTA GAG TCG ACC TGT GAA GCC ATA For adaptor library CCA AAC GAC GCT GAA GAG CGT GAC ACC ACG testing (SEQ ID NO: 43) AT2 GAT CCT CTA GAG TCG ACC TGA AAC GAC GAG For adaptor library CGT GAC ACC ACT GAA GCG ATG CCT GTA GCA testing (SEQ ID NO: 44) CA1 GAT CCT CTA GAG TCG ACC TGG ATC GTT GGG For adaptor library AAC CGG AGC TCT GAA GGA ATG AAG CCA TAC testing (SEQ ID NO: 45) CA2 GAT CCT CTA GAG TCG ACC TGA GCT GAA TGA For adaptor library AGC CAT ACC ACT GAA GAA CGA CGA GCG TGA testing (SEQ ID NO: 46) CC1 GAT CCT CTA GAG TCG ACC TGC TGA ATG AAG For adaptor library CCA TAC CAA ACT GAA GCG ACG AGC GTG ACA testing (SEQ ID NO: 47) CC2 GAT CCT CTA GAG TCG ACC TGA GCC ATA CCA For adaptor library AAC GAC GAG CCT GAA GGT GAC ACC ACG ATG testing (SEQ ID NO: 48) CG1 GAT CCT CTA GAG TCG ACC TGA CCG GAG CTG For adaptor library AAT GAA GCC ACT GAA GTA CCA AAC GAC GAG testing (SEQ ID NO: 49) CG2 GAT CCT CTA GAG TCG ACC TGA ATG AAG CCA For adaptor library TAC CAA ACG ACT GAA GCG AGC GTG ACA CCA testing (SEQ ID NO: 50) CT1 GAT CCT CTA GAG TCG ACC TGG CCA TAC CAA For adaptor library ACG ACG AGC GCT GAA GTG ACA CCA CGA TGC testing (SEQ ID NO: 51) CT2 GAT CCT CTA GAG TCG ACC TGT CAT GTA ACT For adaptor library CGC CTT GAT CCT GAA GGT TGG GAA CCG GAG testing (SEQ ID NO: 52) GA1 GAT CCT CTA GAG TCG ACC TGG GAG CTG AAT For adaptor library GAA GCC ATA CCT GAA GCA AAC GAC GAG CGT testing (SEQ ID NO: 53) GA2 GAT CCT CTA GAG TCG ACC TGG GAA CCG GAG For adaptor library CTG AAT GAA GCT GAA GCC ATA CCA AAC GAC testing (SEQ ID NO: 54) GC1 GAT CCT CTA GAG TCG ACC TGA ACC GGA GCT For adaptor library GAA TGA AGC CCT GAA GAT ACC AAA CGA CGA testing (SEQ ID NO: 55) GC2 GAT CCT CTA GAG TCG ACC TGA AGC CAT ACC For adaptor library AAA CGA CGA GCT GAA GCG TGA CAC CAC GAT testing (SEQ ID NO: 56) GG1 GAT CCT CTA GAG TCG ACC TGA CGA CGA GCG For adaptor library TGA CAC CAC GCT GAA GAT GCC TGT AGC AAT testing (SEQ ID NO: 57) GG2 GAT CCT CTA GAG TCG ACC TGA GCA ATG GCA For adaptor library ACA ACG TTG CCT GAA GGC AAA CTA TTA ACT testing (SEQ ID NO: 58) GT1 GAT CCT CTA GAG TCG ACC TGC CGG AGC TGA For adaptor library ATG AAG CCA TCT GAA GAC CAA ACG ACG AGC testing (SEQ ID NO: 59) GT2 GAT CCT CTA GAG TCG ACC TGC ATA CCA AAC For adaptor library GAC GAG CGT GCT GAA GAC ACC ACG ATG CCT testing (SEQ ID NO: 60) TA1 GAT CCT CTA GAG TCG ACC TGC TTG ATC GTT For adaptor library GGG AAC CGG ACT GAA GGC TGA ATG AAG CCA testing (SEQ ID NO: 61) TA2 GAT CCT CTA GAG TCG ACC TGA TAC CAA ACG For adaptor library ACG AGC GTG ACT GAA GCA CCA CGA TGC CTG testing (SEQ ID NO: 62) TC1 GAT CCT CTA GAG TCG ACC TGc cgc ttt ttt For adaptor library (PB1040) gca caa cat gCT GAA Ggg gga tca tgt aac testing (SEQ ID NO: 63) TC2 GAT CCT CTA GAG TCG ACC TGC GTT GCG CAA For adaptor library ACT ATT AAC TCT GAA GGG CGA ACT ACT TAC testing (SEQ ID NO: 64) TG1 GAT CCT CTA GAG TCG ACC TGC GGA GCT GAA For adaptor library TGA AGC CAT ACT GAA GCC AAA CGA CGA GCG testing (SEQ ID NO: 65) TG2 gat cct cta gag tcg acc tgc cat acc aaa For adaptor library (PB1070) cga cga gcg tCT GAA Gga cac cac gat gcc testing (SEQ ID NO: 66) TT1 GAT CCT CTA GAG TCG ACC TGT GAC ACC ACG For adaptor library ATG CCT GTA GCT GAA GCA ATG GCA ACA ACG testing (SEQ ID NO: 67) TT2 GAT CCT CTA GAG TCG ACC TGG CCT GTA GCA For adaptor library ATG GCA ACA ACT GAA GCG TTG CGC AAA CTA testing (SEQ ID NO: 68) PB1477 GCAATTCCTCACGAGACCCGTCCTGACCTGAGTT FANCD2 AcuI CTTTCCCTGAAGCCACATCAGCGTGC (SEQ ID NO: 69) PB1257 GATCCTCTAGAGTCGACCTGCCGCAGAGCTGAGA SMARCAL1 AcuI AGTTATCTGAAGTGGCAGAACAGCAT (SEQ ID NO: 70) PB1264 gatcctctagagtcgacctgGTTTTCATTTCAGGGAAGAAC PIK3R1 TGAAGGTGAATGAAAAACT (SEQ ID NO: 71) signatures PB1265 gatcctctagagtcgacctgTCTCGTACCAAAAAGGTCCCC PIK3R1 TGAAGGTCTGCTGTATCTC (SEQ ID NO: 72) signatures PB1266 gatcctctagagtcgacctgATCTCGTACCAAAAAGGTCCC PIK3R1 TGAAGCGTCTGCTGTATCT (SEQ ID NO: 73) signatures PB1010 gatcctctagagtcgacctgTTTTCATTTCAGGGAAGAAGC PIK3R1 TGAAGTGAATGAAAAACTT (SEQ ID NO: 74) signatures PB1433 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattC AA-Oligo to test TGAAGaactggcgaactacAA (SEQ ID NO: 75) dinucleotide capture efficiency (DTECT) PB1434 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattC AC-Oligo to test TGAAGaactggcgaactacAC (SEQ ID NO: 76) dinucleotide capture efficiency (DTECT) PB1435 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattC AG-Oligo to test TGAAGaactggcgaactacAG (SEQ ID NO: 77) dinucleotide capture efficiency (DTECT) PB1436 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattC AT-Oligo to test TGAAGaactggcgaactacAT (SEQ ID NO: 78) dinucleotide capture efficiency (DTECT) PB1437 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattC CA-Oligo to test TGAAGaactggcgaactacCA (SEQ ID NO: 79) dinucleotide capture efficiency (DTECT) PB1438 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattC CC-Oligo to test TGAAGaactggcgaactacCC (SEQ ID NO: 80) dinucleotide capture efficiency (DTECT) PB1439 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattC CG-Oligo to test TGAAGaactggcgaactacCG (SEQ ID NO: 81) dinucleotide capture efficiency (DTECT) PB1440 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattC CT-Oligo to test TGAAGaactggcgaactacCT (SEQ ID NO: 82) dinucleotide capture efficiency (DTECT) PB1441 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattC GA-Oligo to test TGAAGaactggcgaactacGA (SEQ ID NO: 83) dinucleotide capture efficiency (DTECT) PB1442 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattC GC-Oligo to test TGAAGaactggcgaactacGC (SEQ ID NO: 84) dinucleotide capture efficiency (DTECT) PB1443 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattC GG-Oligo to test TGAAGaactggcgaactacGG (SEQ ID NO: 85) dinucleotide capture efficiency (DTECT) PB1444 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattC GT-Oligo to test TGAAGaactggcgaactacGT (SEQ ID NO: 86) dinucleotide capture efficiency (DTECT) PB1445 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattC TA-Oligo to test TGAAGaactggcgaactacTA (SEQ ID NO: 87) dinucleotide capture efficiency (DTECT) PB1446 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattC TC-Oligo to test TGAAGaactggcgaactacTC (SEQ ID NO: 88) dinucleotide capture efficiency (DTECT) PB1447 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattC TG-Oligo to test TGAAGaactggcgaactacTG (SEQ ID NO: 89) dinucleotide capture efficiency (DTECT) PB1448 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattC TT-Oligo to test TGAAGaactggcgaactacTT (SEQ ID NO: 90) dinucleotide capture efficiency (DTECT) PB1449 gtagttcgccagttCTTCAGaatagtttgcgcaCAGGACGGGT Complementary CTCGTGAGGAATTGC (SEQ ID NO: 91) 5′-phosphorylated oligo PB1321 GCAATTCCTCACGAGACCCGTCCTGGTGGCTCCA mouse Pik3ca TAGGAACTGAAGGTCTTTCTCTTGTT (SEQ ID NO: (545) AcuI 92) PB1380 GCAATTCCTCACGAGACCCGTCCTGTTATATACCT BRCA1 Cys64Tyr TTTGGCTGAAGTTATATCATTCTTA (SEQ ID NO: AcuI 93) PB1381 GCAATTCCTCACGAGACCCGTCCTGACTGCTAGA BRCA1 ACAACTCTGAAGATCAATTTGCAATT (SEQ ID NO: Glu638Lys AcuI 94) PB1382 GCAATTCCTCACGAGACCCGTCCTGGCTTGAGCT BRCA1 GGCTTCCTGAAGTTTAAAAACATTTT (SEQ ID NO: Glu1033Lys AcuI 95) PB1383 GCAATTCCTCACGAGACCCGTCCTGAGCTTTCGTT BRCA1 TTGAACTGAAGAGCAGATTCTTTTT (SEQ ID NO: Glu575Lys AcuI 96) PB1386 GCAATTCCTCACGAGACCCGTCCTGTAGCAGATTT BRCA1 Va1990Ile TTCTTCTGAAGACATTTAGTTTTAA (SEQ ID NO: AcuI 97) PB1388 GCAATTCCTCACGAGACCCGTCCTGGAATGAGTC BRCA1 Thr922Ile TAATATCTGAAGCAAGCCTGTACAGA (SEQ ID NO: AcuI 98) PB1389 GCAATTCCTCACGAGACCCGTCCTGTTACCAAATT BRCA1 Asp67Asn ATATACTGAAGCCTTTTGGTTATAT (SEQ ID NO: AcuI 99) PB1390 GCAATTCCTCACGAGACCCGTCCTGAGGGAGCTT BRCA1 TACCTTCTGAAGTCTGTCCTGGGATT (SEQ ID NO: Glu1754Lys AcuI 100) PB1393 GCAATTCCTCACGAGACCCGTCCTGAAATAATCAA BRCA1 GAAGACTGAAGGCAAAGCATGGATT (SEQ ID NO: Ser1363Leu AcuI 101) PB1394 GCAATTCCTCACGAGACCCGTCCTGGATTTTACAT BRCA1 CTAAACTGAAGTGTCCATTTTAGAT (SEQ ID NO: Gln1779Ter AcuI 102) PB1396 GCAATTCCTCACGAGACCCGTCCTGAGAAGACAT BRCA2 CATCTGCTGAAGGATTATACATATTT (SEQ ID NO: Arg2842Cys AcuI 103) PB1397 GCAATTCCTCACGAGACCCGTCCTGCTTTTTCTTT BRCA2 TTTTGCTGAAGAATAGCTTACAATA (SEQ ID NO: Arg2973His AcuI 104) PB1398 GCAATTCCTCACGAGACCCGTCCTGTATTTGGCG BRCA2 TCCATCCTGAAGATCAGATTTATATT (SEQ ID NO: Ser2998Phe AcuI 105) PB1399 GCAATTCCTCACGAGACCCGTCCTGTTTTAGATCC BRCA2 AGACTCTGAAGTCAGCCATCTTGTT (SEQ ID NO: Ser3070Phe AcuI 106) PB1400 GCAATTCCTCACGAGACCCGTCCTGAAATTAATTT BRCA2 ACCTTCTGAAGTAACATAAGAGATT (SEQ ID NO: Glu2772Lys AcuI 107) PB1401 GCAATTCCTCACGAGACCCGTCCTGATTTGATGGT BRCA2 CAACCCTGAAGAGAAAGAATAAATA (SEQ ID NO: Thr17071Ie AcuI 108) PB1402 GCAATTCCTCACGAGACCCGTCCTGGTTCTGAGG BRCA2 TGGACCCTGAAGTAATAGGATTTGTC (SEQ ID NO: Va13079Ile AcuI 109) PB1403 GCAATTCCTCACGAGACCCGTCCTGAGGCCATGG BRCA2 AATCTGCTGAAGCTGAACAAAAGGAA (SEQ ID NO: Gln2960Ter AcuI 110) PB1405 GCAATTCCTCACGAGACCCGTCCTGAAGCCTCTG BRCA2 Thr544Ile AAAGTGCTGAAGACTGGAAATACATA (SEQ ID NO: AcuI 111) PB1406 GCAATTCCTCACGAGACCCGTCCTGCTTATCAACA BRCA2 CGAGGCTGAAGAAGTATTTTTGATA (SEQ ID NO: Va12102Ile AcuI 112) PB1407 GCAATTCCTCACGAGACCCGTCCTGCATCACGTG BRCA2 CACTAACTGAAGCAAGACAGCAAGTT (SEQ ID NO: Arg2896Cys AcuI 113) PB1408 GCAATTCCTCACGAGACCCGTCCTGGCTGGCCAG BRCA2 Val572Ile CCACCACTGAAGCCACACAGAATTCT (SEQ ID NO: AcuI 114) PB1409 GCAATTCCTCACGAGACCCGTCCTGTCTAGAAATC BRCA2 Val778Ile ATGACCTGAAGTAGGTTTGACAGAA (SEQ ID NO: AcuI 115) PB1509 GCAATTCCTCACGAGACCCGTCCTGGCATTTTCTG Bard1 S563F AcuI CTGCTCTGAAGGTGAAGAAAGCCCA (SEQ ID NO: 116) PB1513 GCAATTCCTCACGAGACCCGTCCTGgagcggatagag Brca1 S1598F acaCTGAAGtatccatggtggtg (SEQ ID NO: 117) AcuI PB1483 GCAATTCCTCACGAGACCCGTCCTGTGTGCGAGT NT5C2 R367Q TCAGGACTGAAGATCACCAAAAAAGT (SEQ ID NO: AcuI 118) PB1486 GCAATTCCTCACGAGACCCGTCCTGTTGGAGATC NT5C2 K359Q ACATTTCTGAAGTTGGGGACATTTTA (SEQ ID NO: AcuI 119) PB1493 GCAATTCCTCACGAGACCCGTCCTGTTTCAGGGA NT5C2 R238W AAACTGCTGAAGCCTTTGCTTCTGAG (SEQ ID NO: AcuI 120) PB1296 GCAATTCCTCACGAGACCCGTCCTGTGATACTGA BRCA2 AATTGACTGAAGTAGAAGCAGAAGAT (SEQ ID NO: dupAGAAGAT 121) AcuI PB1473 GCAATTCCTCACGAGACCCGTCCTGGCCAGCGAG TIMELESS AcuI AGATGGCTGAAGCAGAAAAGAAGACT (SEQ ID NO: 122) PB1476 GCAATTCCTCACGAGACCCGTCCTGGGGCAGCG SLX4 AcuI GGTGCCGCTGAAGGCGAGGACGCTGAC (SEQ ID NO: 123) PB1472 GCAATTCCTCACGAGACCCGTCCTGACGTTTACG FANCM AcuI GCCAGTCTGAAGTCTACCCATTCGTT (SEQ ID NO: 124) PB1427 GCAATTCCTCACGAGACCCGTCCTGGAAGCTCGG FANCF AcuI AAAAGCCTGAAGGATCCAGGTGCTGC (SEQ ID NO: 125) PB1430 GCAATTCCTCACGAGACCCGTCCTGATGTAGAATT AcuI Apc. 1529 AAGAACTGAAGTCATGCCTCCAGTT (SEQ ID NO: 126) PB1431 GCAATTCCTCACGAGACCCGTCCTGCCCGGGGCA AcuI Apc. 492 TTTCATCTGAAGCCCAGGAGCTAGGT (SEQ ID NO: 127) PB1318 GCAATTCCTCACGAGACCCGTCCTGTTGAGAGTC AcuI Apc. 1405 GCTCCACTGAAGTTGCCAGCTCTGTT (SEQ ID NO: 128) PB1332 GCAATTCCTCACGAGACCCGTCCTGAGCATTTGG AcuI Jak2 #1 TTTTGACTGAAGATTATGGTGTCTGT (SEQ ID NO: 129) PB1333 GCAATTCCTCACGAGACCCGTCCTGCTGGCTTTA AcuI Jak2 #2 CTTACTCTGAAGCTCCTCTCCACAGA (SEQ ID NO: 130) PB1460 GCAATTCCTCACGAGACCCGTCCTGAAGCATTTG AcuI Jak2 #3 GTTTTGCTGAAGAATTATGGTGTCTG (SEQ ID NO: 131) PB1461 GCAATTCCTCACGAGACCCGTCCTGGCTGGCTTT AcuI Jak2 #4 ACTTACCTGAAGTCTCCTCTCCACAG (SEQ ID NO: 132) PB1545 GCAATTCCTCACGAGACCCGTCCTGGAAGCAGGG AcuI HEK3 CTTCCTCTGAAGTTCCTCTGCCATCA (SEQ ID NO: 133) PB1301 GCAATTCCTCACGAGACCCGTCCTGGAAATTTGC AcuI TP53 R209fs GTGTGGCTGAAGAGTATTTGGATGAC (SEQ ID NO: delGA 134) PB1535 GCAATTCCTCACGAGACCCGTCCTGAACCAGACC AcuI TP53 delAG TCAGGCCTGAAGGGCTCATAGGGCAC (SEQ ID (PAM) NO: 135) Standard PCR primers Sequence (5′- ->3′) Notes Ampicillin CCA ATG CTT AAT CAG TGA GG (SEQ ID For adaptor library reverse NO: 136) testing AcuI-tagging AAT CGC TTG ATC ACA GAT GTA TGT A PCR BRCA1 C64Y oligo reverse (SEQ ID NO: 137) and BRCA1 D67N AcuI-tagging GAA GAC AAA ATA TTT GGG AAA ACC T PCR BRCA1 E638K oligo reverse (SEQ ID NO: 138) and BRCA1 E575K AcuI-tagging TCT CGT TAC TGG AAG TTA GCA CTC T PCR BRCA1 oligo reverse (SEQ ID NO: 139) E1033K and BRCA1 AcuI-tagging ATT TCA CCA TCA TCT AAC AGG TCA T V990I oligo reverse (SEQ ID NO: 140) PCR BRCA1 T922I AcuI-tagging CAC CTC CTG CAT TCA AAA GAT TC (SEQ PCR BRCA1 oligo reverse ID NO: 141) E1754K AcuI-tagging GCT GCT TCA CCT TAA ATA ACA AAA A PCR BRCA1 oligo reverse (SEQ ID NO: 142) S1363L AcuI-tagging AGG GAC ATA TGG GAA AAA GAG TTA G PCR BRCA1 oligo reverse (SEQ ID NO: 143) Q1779* AcuI-tagging TTA GAC CTG ATA TTT CTG TCC CTT G PCR BRCA2 oligo reverse (SEQ ID NO: 144) R2842C AcuI-tagging ACC TCT ACT ACC TAT GTG GCT TGT G PCR BRCA2 oligo reverse (SEQ ID NO: 145) R2973H AcuI-tagging GGT TTG TAC CGG TAG TTG TTG ATA C PCR BRCA2 oligo reverse (SEQ ID NO: 146) S2998F and BRCA2 Q2960* AcuI-tagging AAA TAG CCC TGT ACA ATG AAA AGT AGA PCR BRCA2 oligo reverse (SEQ ID NO: 147) S3070F and BRCA2 V30791 AcuI-tagging TCA TAT ACG GCA GTA TGG TTA AGG T PCR BRCA2 oligo reverse (SEQ ID NO: 148) E2772K AcuI-tagging GTG GCC CTA CCT CAA AAT TAT TAC T PCR BRCA2 T17071 oligo reverse (SEQ ID NO: 149) AcuI-tagging TAT CTA CCA TGT TTG AGT GAC CTG A PCR BRCA T544I oligo reverse (SEQ ID NO: 150) and BRCA2 V572I AcuI-tagging CTT CAT AAG TCA GTC TCA TCT GCA A PCR BRCA2 oligo reverse (SEQ ID NO: 151) V2102I AcuI-tagging GTA CAG GAG GGA CAA AAA TAA AAC A PCR BRCA2 oligo reverse (SEQ ID NO: 152) R2896C AcuI-tagging CCT TAA CTA GCT CTT TTG GGA CAA T PCR BRCA2 V778I oligo reverse (SEQ ID NO: 153) PB1150 GAAAATAGACCAATAGGCAGAGAGAGTC HBB PCR rev (SEQ ID NO: 154) PB1152 TGTCATTAAGAGAGAGACTTTTATTATTCC EMX1 PCR rev (SEQ ID NO: 155) PB1154 ATCCATCTACCTCAGTTTCCTATATCTATC JAK2 PCR rev (SEQ ID NO: 156) PB783 CCCTTTCCTGTAAAAACAATATAAAAA (SEQ PIK3R1 PCR rev ID NO: 157) PB764 TTCTGGAAAATGGATCTAAAGCTAATA (SEQ TCOF1 PCR RFLP ID NO: 158) for PB765 TCACAATTCGTAGTCCTACTTCTACCT (SEQ TCOF1 PCR RFLP ID NO: 159) rev TP226 ACGTTGATGGCAGTTGCAGGTC (SEQ ID JAK2 (HDR) for NO: 160) TP227 CTGACAGAGTTGCTAGACACTGGGTTG JAK2 (HDR) rev (SEQ ID NO: 161) PB969 AACGATCTTCAATATGCTTACCAAG (SEQ ID HBB PCR RFLP for NO: 162) PB970 CTTAACCATAGAAAAGAAGGGGAAA (SEQ ID HBB PCR RFLP rev NO: 163) PB327 GCCATCCCCTTCTGTGAATGTTAGAC (SEQ EMX1 PCR for ID NO: 164) PB328 GGAGATTGGAGACACGGAGAGCAG (SEQ ID EMX1 PCR rev NO: 165) PB1302 AACTGTGCAATAGTTAAACCCATTTAC (SEQ PCR TP53 (HDR) ID NO: 166) PB862 GTAGGTGTTCGGTAAATGTTAATGG (SEQ ID PCR FANCD2 NO: 167) PB863 AAGTCAAATCCCATACCCTACTCAT (SEQ ID PCR FANCD2 NO: 168) PB1334 TACTTGCTTTCAGTGTTGTGTTATAGG (SEQ PCR Jak2 (mouse) ID NO: 169) PB1335 ATTTGTTTACTGTAATCCTCATCCATC (SEQ PCR Jak2 (mouse) ID NO: 170) PB1319 GGAAAAGTTTATAGGTGTCCCTTCTAC (SEQ PCR Apc. 1405 ID NO: 171) PB1320 AGCAGGTGTACTTCTGTCAGCTC (SEQ ID PCR Apc. 1405 NO: 172) PB1432 AATATTCTGCAGACTGATATTCTGGTT (SEQ PCR Apc. 492 ID NO: 173) PB1428 CGTTACTTAATTTTGAAAAACCTCAAC (SEQ PCR FANCF ID NO: 174) PB1429 AGATTTGGGTTCTCTCTATAGCCATT (SEQ PCR FANCF ID NO: 175) PB745 GACTCCAGTCAAAAATTCTCCTAGTTA (SEQ PCR FANCM ID NO: 176) PB858 ATGTCTGCAGCTATAGTTAGGAAGC (SEQ ID PCR SLX4 NO: 177) PB859 ATCTCTCCCTGAGTTGATGAGAAG (SEQ ID PCR SLX4 NO: 178) PB764 TTCTGGAAAATGGATCTAAAGCTAATA (SEQ PCR TCOF1 ID NO: 179) PB765 TCACAATTCGTAGTCCTACTTCTACCT (SEQ PCR TCOF1 ID NO: 180) PB746 CTGTTTGTCCTAAACAAGATGTGAAT (SEQ PCR TIMELESS ID NO: 181) PB747 CATTGGAGCAAGTTAAAACTACAAAAT (SEQ PCR TIMELESS ID NO: 182) PB1297 CCTTAACCTCTTGATGTATGAGAAGAA (SEQ PCR BRCA2 ID NO: 183) dupAGAAGAT PB1298 AGTACATCTAAGAAATTGAGCATCCTT (SEQ PCR BRCA2 ID NO: 184) dupAGAAGAT PB590 GTGTGTGTGCAATTATAAAAGAAACTT (SEQ PCR SMARCAL1 ID NO: 185) PB591 GTCAGCATTAGATGAGCTACTGAGATT (SEQ PCR SMARCAL1 ID NO: 186) PB1322 CTGTTCTACTTGTTGGTGGTGATAATA (SEQ PCR mouse Pik3ca ID NO: 187) (545) PB1323 ATGGTAAGAAATATGGTTAACACCAAG (SEQ PCR mouse Pik3ca ID NO: 188) (545) PB1510 CTATTTTAGGTTACTGGGAACAGAATG (SEQ Oligos for Bard1 ID NO: 189) S563F genotyping PB1511 AAACTACATAACTACAACCCAATGCTT (SEQ Oligos for Bard1 ID NO: 190) S563F genotyping PB1514 GAACCCCATACCTGGGATCT (SEQ ID NO: Oligos for Brca1 191) S1598F genotyping PB1515 tcatacctcacaaggtgccta (SEQ ID NO: 192) Oligos for Brca1 S1598F genotyping PB1548 TTATCAGTTTTGGAGGATGTACATAAA (SEQ PCR HEK3 rev ID NO: 193) PB780 CTCCTTCCTCTTCCTACAGTACTCC (SEQ ID TP53 gDNA for NO: 194) (PAM) Illumina primers (NGS) Sequence (5′- ->3′) Notes Primers for amplifying AcuI-tagged amplicons SAM175 ACACTCTTTCCCTACACGACGCTCTTCCGATC Adaptor constant TTTCCTCACGAGACCCGTCCTG (SEQ ID NO: forward - Forward 195) primer used with all amplicons - binds AcuI-tagging primer sequence SAM176 AGACGTGTGCTCTTCCGATCTCTTGATCACAG NGS BRCA1 C64Y ATGTATGTA (SEQ ID NO: 196) AcuI SAM177 AGACGTGTGCTCTTCCGATCTCAAAATATTTG NGS BRCA1 E638K GGAAAACCT (SEQ ID NO: 197) AcuI SAM178 AGACGTGTGCTCTTCCGATCTTTACTGGAAGT NGS BRCA1 TAGCACTCT (SEQ ID NO: 198) E1033K AcuI SAM179 AGACGTGTGCTCTTCCGATCTCAAAATATTTG NGS BRCA1 E575K GGAAAACCT (SEQ ID NO: 199) AcuI SAM182 AGACGTGTGCTCTTCCGATCTTTACTGGAAGT NGS BRCA1 V990I TAGCACTCT (SEQ ID NO: 200) AcuI SAM184 AGACGTGTGCTCTTCCGATCTACCATCATCTA NGS BRCA1 T922I ACAGGTCAT (SEQ ID NO: 201) AcuI SAM185 AGACGTGTGCTCTTCCGATCTCTTGATCACAG NGS BRCA1 D67N ATGTATGTA (SEQ ID NO: 202) AcuI SAM186 AGACGTGTGCTCTTCCGATCTCTCCTGCATTC NGS BRCA1 AAAAGATTC (SEQ ID NO: 203) E1754K AcuI SAM189 AGACGTGTGCTCTTCCGATCTTTCACCTTAAA NGS BRCA1 TAACAAAAA (SEQ ID NO: 204) S1363L AcuI SAM190 AGACGTGTGCTCTTCCGATCTCATATGGGAAA NGS BRCA1 AAGAGTTAG (SEQ ID NO: 205) Q1779* AcuI SAM192 AGACGTGTGCTCTTCCGATCTCCTGATATTTC NGS BRCA2 TGTCCCTTG (SEQ ID NO: 206) R2842C AcuI SAM193 AGACGTGTGCTCTTCCGATCTTACTACCTATG NGS BRCA2 TGGCTTGTG (SEQ ID NO: 207) R2973H AcuI SAM194 AGACGTGTGCTCTTCCGATCTGTACCGGTAGT NGS BRCA2 TGTTGATAC (SEQ ID NO: 208) S2998F AcuI SAM195 AGACGTGTGCTCTTCCGATCTCCTGTACAATG NGS BRCA2 AAAAGTAGA (SEQ ID NO: 209) S3070F AcuI SAM196 AGACGTGTGCTCTTCCGATCTTACGGCAGTAT NGS BRCA2 GGTTAAGGT (SEQ ID NO: 210) E2772K AcuI SAM197 AGACGTGTGCTCTTCCGATCTCCTACCTCAAA NGS BRCA2 T17071 ATTATTACT (SEQ ID NO: 211) AcuI SAM198 AGACGTGTGCTCTTCCGATCTCCTGTACAATG NGS BRCA2 V30791 AAAAGTAGA (SEQ ID NO: 212) AcuI SAM199 AGACGTGTGCTCTTCCGATCTGTACCGGTAGT NGS BRCA2 TGTTGATAC (SEQ ID NO: 213) Q2960* AcuI SAM201 AGACGTGTGCTCTTCCGATCTACCATGTTTGA NGS BRCA2 T544I GTGACCTGA (SEQ ID NO: 214) AcuI SAM202 AGACGTGTGCTCTTCCGATCTTAAGTCAGTCT NGS BRCA2 V2102I CATCTGCAA (SEQ ID NO: 215) AcuI SAM203 AGACGTGTGCTCTTCCGATCTGGAGGGACAA NGS BRCA2 AAATAAAACA (SEQ ID NO: 216) R2896C AcuI SAM204 AGACGTGTGCTCTTCCGATCTACCATGTTTGA NGS BRCA2 V572I GTGACCTGA (SEQ ID NO: 217) AcuI SAM205 AGACGTGTGCTCTTCCGATCTACTAGCTCTTT NGS BRCA2 V778I TGGGACAAT (SEQ ID NO: 218) AcuI Primers for indexing the above amplicons SAM113 caagcagaagacggcatacgagatTGCCTCTTgtgactgga N711 gttcagacgtgtgctcttccgatct (SEQ ID NO: 219) SAM64 aatgatacggcgaccaccgagatctacacACTGCATAacact S506 ctttccctacacgacg (SEQ ID NO: 220) TP370 acactctttccctacacgacgctcttccgatctGTTTAAACAGT BRCA2_NGS_F GGAATTCTAGAGTCA (SEQ ID NO: 221) TP371 agacgtgtgctcttccgatctTTTTTGCAGCTGTGTCATC BRCA2_NGS_R C (SEQ ID NO: 222) TP372 acactctttccctacacgacgctcttccgatctGCCCCTCCTC TP53_NGS_F AGCATCTTAT (SEQ ID NO: 223) TP373 agacgtgtgctcttccgatctCTTAACCCCTCCTCCCAG TP53_NGS_R AG (SEQ ID NO: 224) ssODNs: Targeted Sequence (5′- ->3′) gene TTCCTTAGTCTTTCTTTGAAGCAGCAAGTATGATGAGCAAGCTTTCTCA JAK2 CAAGCATTTGGTTTTAAATTATGGAGTATGTGTgtttaaacCTGTGGAGACG AGAGTAAGTAAAACTACAGGCTTTCTAATGCCTTTCTCAGAGCATCTGT TTTTGTTTATATAGAAAATTCAGTTTCAGGATCA (SEQ ID NO: 225) AAGAAGGGCTCCCATCACATCAACCGGTGGCGCATTGCCACGAAGCA EMX1 GGCCAATGGGGAGGACATCGATGTCACCTCCAATGACTAgtttaaacGGG TGGGCAACCACAAACCCACGAGGGCAGAGTGCTGCTTGCTGCTGGCC AGGCCCCTGCGTGGGCCCAAGCTGGACTCTGGCCACTCCC (SEQ ID NO: 226) TACATTTGCTTCTGACACAACTGTGTTCACTAGCAACCTCAAACAGACA HBB CAATGGTGCATCTGACTCCTGTCGAGAAGTCTGCCGTTACTGCCCTGT GGGGCAAGGTGAACGTGGATGAAGTTGGTGGTGAGGCCCTGGG (SEQ ID NO: 227) TCTTAGGTCTGGCCCCTCCTCAGCATCTTATCCGAGTGGAAGGAAATT TP53 TGCGTGTGGAGTATTTGGATGACAAACACTTTTCGTCATAGTGTGGTTG R209fs*6 TGCCCTATGAGCCGCCTGAGGTCTGGTTTGCAACTGGGGTCTCTGGG AGGAGGGGTTAAGGGTGGTTGT (SEQ ID NO: 228) TTGTTTAAACAGTGGAATTCTAGAGTCACACTTCCTAAAATATGCATTTT BRCA2 TGTTTTCACTTTTAGATATGATACTGAAATTGATAGAAGCAGAAGATAG dupAGA AAGATCGGCTATAAAAAAGATAATGGAAAGGGATGACACAGCTGCAAA AGAT AACACTTGTTCTCTGTGTTTCTGACATAAT (SEQ ID NO: 229) Libray of adaptors: Oligo Sequence (5′- ->3′) Notes PB984 CTGGGGCACGGGTAAGAAGCATTCTGTCTCTCT Oligo corresponds to TCTAAgaattcgagctcggtacccg (SEQ ID NO: 230) the constant strand of the adaptor PB985 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG Oligo corresponds to CTTCTTACCCGTGCCCCAGGG (SEQ ID NO: the variable strand of 231) the adaptor. It contains a 3′ GG, expected to ligate to CC PB986 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG Oligo corresponds to CTTCTTACCCGTGCCCCAGAG (SEQ ID NO: 232) the variable strand of the adaptor. It contains a 3′ AG, expected to ligate to CT PB987 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG Oligo corresponds to CTTCTTACCCGTGCCCCAGAA (SEQ ID NO: 233) the variable strand of the adaptor. It contains a 3′ AA, expected to ligate to TT PB988 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG Oligo corresponds to CTTCTTACCCGTGCCCCAGTG (SEQ ID NO: 234) the variable strand of the adaptor. It contains a 3′ TG, expected to ligate to CA PB989 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG Oligo corresponds to CTTCTTACCCGTGCCCCAGTA (SEQ ID NO: 235) the variable strand of the adaptor. It contains a 3′ TA, expected to ligate to TA PB990 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG Oligo corresponds to CTTCTTACCCGTGCCCCAGCG (SEQ ID NO: 236) the variable strand of the adaptor. It contains a 3′ CG, expected to ligate to CG PB991 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG Oligo corresponds to CTTCTTACCCGTGCCCCAGCA (SEQ ID NO: 237) the variable strand of the adaptor. It contains a 3′ CA, expected to ligate to TG PB992 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG O1igo corresponds to CTTCTTACCCGTGCCCCAGCT (SEQ ID NO: 238) the variable strand of the adaptor. It contains a 3′ CT, expected to ligate to AG PB993 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG Oligo corresponds to CTTCTTACCCGTGCCCCAGGA (SEQ ID NO: 239) the variable strand of the adaptor. It contains a 3′ GA, expected to ligate to TC PB1000 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG Oligo corresponds to CTTCTTACCCGTGCCCCAGAC (SEQ ID NO: 240) the variable strand of the adaptor. It contains a 3′ AC, expected to ligate to GT PB1001 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG Oligo corresponds to CTTCTTACCCGTGCCCCAGAT (SEQ ID NO: 241) the variable strand of the adaptor. It contains a 3′ AT, expected to ligate to AT PB1002 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG Oligo corresponds to CTTCTTACCCGTGCCCCAGCC (SEQ ID NO: 242) the variable strand of the adaptor. It contains a 3′ CC, expected to ligate to GG PB1003 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG Oligo corresponds to CTTCTTACCCGTGCCCCAGGC (SEQ ID NO: 243) the variable strand of the adaptor. It contains a 3′ GC, expected to ligate to GC PB1004 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG Oligo corresponds to CTTCTTACCCGTGCCCCAGGT (SEQ ID NO: 244) the variable strand of the adaptor. It contains a 3′ GT, expected to ligate to AC PB1005 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG Oligo corresponds to CTTCTTACCCGTGCCCCAGTC (SEQ ID NO: 245) the variable strand of the adaptor. It contains a 3′ TC, expected to ligate to GA PB1006 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG Oligo corresponds to CTTCTTACCCGTGCCCCAGTT (SEQ ID NO: 246) the variable strand of the adaptor. It contains a 3′ TT, expected to ligate to AA Oligos (sgRNAs cloning): Oligo Sequence (5′- ->3′) Target/Notes oligo plate CAC CGT ACA TAA AGG ACA CTG TGA BRCA1 C64Y for (SEQ ID NO: 247) oligo plate CAC CGC AAT TCA GTA CAA TTA GGT BRCA1 E638K for (SEQ ID NO: 248) oligo plate CAC CGA TTT TCT CTA ATG TTA TTA BRCA1 E1033K for (SEQ ID NO: 249) oligo plate CAC CGT TTT TCG AGT GAT TCT ATT BRCA1 E575K for (SEQ ID NO: 250) oligo plate CAC CGT TTT AAC AAA TGA CTT GAT BRCA1 V990I for (SEQ ID NO: 251) oligo plate CAC CGA GAC AGT TAA TAT CAC TGC BRCA1 T922I for (SEQ ID NO: 252) oligo plate CAC CGT TAT ATC ATT CTT ACA TAA BRCA1 D67N for (SEQ ID NO: 253) oligo plate CAC CGG GGA TTC TCT TGC TCG CTT BRCA1 E1754K for (SEQ ID NO: 254) oligo plate CAC CGT GGA TTC AAA CTT AGG TAT BRCA1 51363L for (SEQ ID NO: 255) oligo plate CAC CGT TAG ATC AAC TGG AAT GGA BRCA1 Q1779* for (SEQ ID NO: 256) oligo plate CAC CGA TAT TTC GCA ATG AAA GAG BRCA2 R2842C for (SEQ ID NO: 257) oligo plate CAC CGA CAA TAC GCA ACT TCC ACA BRCA2 R2973H for (SEQ ID NO: 258) oligo plate CAC CGT ATA TTC TCT GTT AAC AGA BRCA2 S2998F for (SEQ ID NO: 259) oligo plate CAC CGG TTC TGA GGT GGA CCT AAT BRCA2 S3070F for (SEQ ID NO: 260) oligo plate CAC CGG AGA TTC TGG GGC TTC AAG BRCA2 E2772K for (SEQ ID NO: 261) oligo plate CAC CGT AAA TAC TGC AGA TTA TGT BRCA2 T1707I for (SEQ ID NO: 262) oligo plate CAC CGA GAA ACG ACA AAT CCT ATT BRCA2 V3079I for (SEQ ID NO: 263) oligo plate CAC CGA AGG AAC AAG GTT TAT CAA BRCA2 Q2960* for (SEQ ID NO: 264) oligo plate CAC CGC ATA CTG TTT GCT CAC AGA BRCA2 T544I for (SEQ ID NO: 265) oligo plate CAC CGG CTA CAG AAT TCT GTG TGG BRCA2 V572I for (SEQ ID NO: 266) oligo plate CAC CGA CAG AAC ATC CTT GGA AGT BRCA2 V778I for (SEQ ID NO: 267) oligo plate AAA CTC ACA GTG TCC TTT ATG TAC BRCA1 C64Y rev (SEQ ID NO: 268) oligo plate AAA CAC CTA ATT GTA CTG AAT TGC BRCA1 E638K rev (SEQ ID NO: 269) oligo plate AAA CTA ATA ACA TTA GAG AAA ATC BRCA1 E1033K rev (SEQ ID NO: 270) oligo plate AAA CAA TAG AAT CAC TCG AAA AAC BRCA1 E575K rev (SEQ ID NO: 271) oligo plate AAA CAT CAA GTC ATT TGT TAA AAC BRCA1 V990I rev (SEQ ID NO: 272) oligo plate AAA CGC AGT GAT ATT AAC TGT CTC BRCA1 T922I rev (SEQ ID NO: 273) oligo plate AAA CTT ATG TAA GAA TGA TAT AAC BRCA1 D67N rev (SEQ ID NO: 274) oligo plate AAA CAA GCG AGC AAG AGA ATC CCC BRCA1 E1754K rev (SEQ ID NO: 275) oligo plate AAA CAT ACC TAA GTT TGA ATC CAC BRCA1 S1363L rev (SEQ ID NO: 276) oligo plate AAA CTC CAT TCC AGT TGA TCT AAC BRCA1 Q1779* rev (SEQ ID NO: 277) oligo plate AAA CCT CTT TCA TTG CGA AAT ATC BRCA2 R2842C rev (SEQ ID NO: 278) oligo plate AAA CTG TGG AAG TTG CGT ATT GTC BRCA2 R2973H rev (SEQ ID NO: 279) oligo plate AAA CTC TGT TAA CAG AGA ATA TAC BRCA2 S2998F rev (SEQ ID NO: 280) oligo plate AAA CAT TAG GTC CAC CTC AGA ACC BRCA2 S3070F rev (SEQ ID NO: 281) oligo plate AAA CCT TGA AGC CCC AGA ATC TCC BRCA2 E2772K rev (SEQ ID NO: 282) oligo plate AAA CAC ATA ATC TGC AGT ATT TAC BRCA2 T17071 rev (SEQ ID NO: 283) oligo plate AAA CAA TAG GAT TTG TCG TTT CTC BRCA2 V30791 rev (SEQ ID NO: 284) oligo plate AAA CTT GAT AAA CCT TGT TCC TTC BRCA2 Q2960* rev (SEQ ID NO: 285) oligo plate AAA CTC TGT GAG CAA ACA GTA TGC BRCA2 T544I rev (SEQ ID NO: 286) oligo plate AAA CCC ACA CAG AAT TCT GTA GCC BRCA2 V572I rev (SEQ ID NO: 287) oligo plate AAA CAC TTC CAA GGA TGT TCT GTC BRCA2 V778I rev (SEQ ID NO: 288) PB776 CACCGAACTTcGAGATACAGCAGAC (SEQ PIK3R1 R348* for ID NO: 289) PB777 AAACGTCTGCTGTATCTCgAAGTTC (SEQ PIK3R1 R348* rev ID NO: 290) PB551 CACCGGGCCAGCTGGAGGCCGTCG SPRTN Q60* for (SEQ ID NO: 291) PB552 AAACCGACGGCCTCCAGCTGGCCC (SEQ SPRTN Q60* rev ID NO: 292) PB756 CACCGAGCcAGGTGAGGCCTGGAGG TCOF1 Q290* for (SEQ ID NO: 293) PB757 AAACCCTCCAGGCCTCACCTgGCTC (SEQ TCOF1 Q290* rev ID NO: 294) TP212 CACCGAATTATGGAGTATGTGTCTG (SEQ JAK2 HDR for ID NO: 295) TP213 AAACCAGACACATACTCCATAATTC (SEQ JAK2 HDR rev ID NO: 296) PB963 CACCGATGGTGCATCTGACTCCTG (SEQ HBB E6V HDR for ID NO: 297) PB964 AAACCAGGAGTCAGATGCACCATC (SEQ HBB E6V HDR rev ID NO: 298) PB1017 CACCGAGTCCGAGCAGAAGAAGAA (SEQ EMX1 Base editing for ID NO: 299) PB1018 AAACTTCTTCTTCTGCTCGGACTC (SEQ EMX1 Base editing rev ID NO: 300) PB325 CACCGGTCACCTCCAATGACTAGGG EMX1 HDR for (SEQ ID NO: 301) PB326 AAACCCCTAGTCATTGGAGGTGACC EMX1 HDR rev (SEQ ID NO: 302) PB1299 CACCGCACTTTTCGACATAGTGTGG (SEQ TP53 R209fs*6 ID NO: 303) PB1300 AAACCCACACTATGTCGAAAAGTGC (SEQ TP53 R209fs*6 ID NO: 304) PB580 CACCGCAGCATCAGAGGACTAGCTC SMARCAL1 Q34* (SEQ ID NO: 305) PB581 AAACGAGCTAGTCCTCTGATGCTGC SMARCAL1 Q34* (SEQ ID NO: 306) PB838 CACCGATTCCcAGCACGCTGATGTG (SEQ FANCD2 Q223* for ID NO: 307) PB839 AAACCACATCAGCGTGCTgGGAATC (SEQ FANCD2 Q223* rev ID NO: 308) E12 CAC CGA TAC ATT TTG TCT AGA CGT BRCA2 V2102I for (SEQ ID NO: 309) H06 AAA CAC GTC TAG ACA AAA TGT ATC BRCA2 V2102I rev (SEQ ID NO: 310) PB1294 CACCGTTTCACTTTTAGATATGATA (SEQ BRCA2 dupAGAAGAT for ID NO: 311) PB1295 AAACTATCATATCTAAAAGTGAAAC (SEQ BRCA2 dupAGAAGAT rev ID NO: 312) PB738 CACCGAAGACTCGAGCCCTCCAGCG TIMELESS R267* for (SEQ ID NO: 313) PB739 AAACCGCTGGAGGGCTCGAGTCTTC TIMELESS R267* rev (SEQ ID NO: 314) PB834 CACCGCAGCcAGTCAGCGTCCTCGC SLX4 W879* for (SEQ ID NO: 315) PB835 AAACGCGAGGACGCTGACTgGCTGC SLX4 W879* rev (SEQ ID NO: 316) PB736 CACCGGTACAACGAATGGGTAGAAC FANCM Q572* for (SEQ ID NO: 317) PB737 AAACGTTCTACCCATTCGTTGTACC (SEQ FANCM Q572* rev ID NO: 318)
DOCUMENTS CITED
[0193] Allen, F., Crepaldi, L., Alsinet, C., Strong, A. J., Kleshchevnikov, V., De Angeli, P., Palenikova, P., Khodak, A., Kiselev, V., Kosicki, M., et al. (2018). Predicting the mutations generated by repair of Cas9-induced double-strand breaks. Nature biotechnology. [0194] Anzalone, A. V., Randolph, P. B., Davis, J. R., Sousa, A. A., Koblan, L. W., Levy, J. M., Chen, P. J., Wilson, C., Newby, G. A., Raguram, A., et al. (2019). Search-and-replace genome editing without double-strand breaks or donor DNA. Nature. [0195] Apostolou, P., and Fostira, F. (2013). Hereditary breast cancer: the era of new susceptibility genes. Biomed Res Int 2013, 747318. [0196] Barbieri, E. M., Muir, P., Akhuetie-Oni, B. O., Yellman, C. M., and Isaacs, F. J. (2017). Precise Editing at DNA Replication Forks Enables Multiplex Genome Engineering in Eukaryotes. Cell 171, 1453-1467 e1413. [0197] Bath, A. J., Milsom, S. E., Gormley, N. A., and Halford, S. E. (2002). Many type IIs restriction endonucleases interact with two recognition sites before cleaving DNA. J Biol Chem 277, 4024-4033. [0198] Bhagwat, N., Koppikar, P., Keller, M., Marubayashi, S., Shank, K., Rampal, R., Qi, J., Kleppe, M., Patel, H. J., Shah, S. K., et al. (2014). Improved targeting of JAK2 leads to increased therapeutic efficacy in myeloproliferative neoplasms. Blood 123, 2075-2083. [0199] Bhojwani, D., and Pui, C. H. (2013). Relapsed childhood acute lymphoblastic leukaemia. Lancet Oncol 14, e205-217. [0200] Billing, D., Horiguchi, M., Wu-Baer, F., Taglialatela, A., Leuzzi, G., Nanez, S. A., Jiang, W., Zha, S., Szabolcs, M., Lin, C. S., et al. (2018). The BRCT Domains of the BRCA1 and BARD1 Tumor Suppressors Differentially Regulate Homology-Directed Repair and Stalled Fork Protection. Mol Cell 72, 127-139 e128. [0201] Billon, P., Bryant, E. E., Joseph, S. A., Nambiar, T. S., Hayward, S. B., Rothstein, R., and Ciccia, A. (2017). CRISPR-Mediated Base Editing Enables Efficient Disruption of Eukaryotic Genes through Induction of STOP Codons. Mol Cell 67, 1068-1079 e1064. [0202] Brinkman, E. K., Chen, T., Amendola, M., and van Steensel, B. (2014). Easy quantitative assessment of genome editing by sequence trace decomposition. Nucleic Acids Res 42, e168. [0203] Brinkman, E. K., Kousholt, A. N., Harmsen, T., Leemans, C., Chen, T., Jonkers, J., and van Steensel, B. (2018). Easy quantification of template-directed CRISPR/Cas9 editing. Nucleic Acids Res 46, e58. [0204] Canny, M. D., Moatti, N., Wan, L. C. K., Fradet-Turcotte, A., Krasner, D., Mateos-Gomez, P. A., Zimmermann, M., Orthwein, A., Juang, Y. C., Zhang, W., et al. (2018). Inhibition of 53BP1 favors homology-dependent DNA repair and increases CRISPR-Cas9 genome-editing efficiency. Nat Biotechnol 36, 95-102. [0205] Ceccaldi, R., Sarangi, P., and D'Andrea, A. D. (2016). The Fanconi anaemia pathway: new players and new functions. Nat Rev Mol Cell Biol 17, 337-349. [0206] Chadwick, A. C., Wang, X., and Musunuru, K. (2017). In Vivo Base Editing of PCSK9 (Proprotein Convertase Subtilisin/Kexin Type 9) as a Therapeutic Alternative to Genome Editing. Arterioscler Thromb Vasc Biol 37, 1741-1747. [0207] Clement, K., Rees, H., Canver, M. C., Gehrke, J. M., Farouni, R., Hsu, J. Y., Cole, M. A., Liu, D. R., Joung, J. K., Bauer, D. E., et al. (2019). CRISPResso2 provides accurate and rapid genome editing sequence analysis. Nature biotechnology 37, 224-226. [0208] Cong, L., Ran, F. A., Cox, D., Lin, S., Barretto, R., Habib, N., Hsu, P. D., Wu, X., Jiang, W., Marraffini, L. A., et al. (2013). Multiplex genome engineering using CRISPR/Cas systems. Science 339, 819-823. [0209] Dieck, C. L., and Ferrando, A. A. (2019). Genetics and mechanisms of NT5C2-driven chemotherapy resistance in relapsed ALL. Blood. [0210] Dow, L. E. (2015). Modeling Disease In Vivo With CRISPR/Cas9. Trends Mol Med 21, 609-621. [0211] Fahim Farzadfard, T. K. L. (2018). Emerging applications for DNA writers and molecular recorders. Science 361, 870-875. [0212] Findlay, G. M., Daza, R. M., Martin, B., Zhang, M. D., Leith, A. P., Gasperini, M., Janizek, J. D., Huang, X., Starita, L. M., and Shendure, J. (2018). Accurate classification of BRCA1 variants with saturation genome editing. Nature. [0213] Gao, X., Tao, Y., Lamas, V., Huang, M., Yeh, W. H., Pan, B., Hu, Y. J., Hu, J. H., Thompson, D. B., Shu, Y., et al. (2018). Treatment of autosomal dominant hearing loss by in vivo delivery of genome editing agents. Nature 553, 217-221. [0214] Gaudelli, N. M., Komor, A. C., Rees, H. A., Packer, M. S., Badran, A. H., Bryson, D. I., and Liu, D. R. (2017). Programmable base editing of A*T to G*C in genomic DNA without DNA cleavage. Nature. [0215] Germini, D., Tsfasman, T., Zakharova, V. V., Sjakste, N., Lipinski, M., and Vassetzky, Y. (2018). A Comparison of Techniques to Evaluate the Effectiveness of Genome Editing. Trends Biotechnol 36, 147-159. [0216] Guo, X., Chavez, A., Tung, A., Chan, Y., Kaas, C., Yin, Y., Cecchi, R., Garnier, S. L., Kelsic, E. D., Schubert, M., et al. (2018). High-throughput creation and functional profiling of DNA sequence variant libraries using CRISPR-Cas9 in yeast. Nat Biotechnol 36, 540-546. [0217] Inaba, H., Greaves, M., and Mullighan, C. G. (2013). Acute lymphoblastic leukaemia. Lancet 381, 1943-1955. [0218] Jasin, M., and Haber, J. E. (2016). The democratization of gene editing: Insights from site-specific cleavage and double-strand break repair. DNA Repair (Amst) 44, 6-16. [0219] Kluesner, M. G., Nedveck, D. A., Lahr, W. S., Garbe, J. R., Abrahante, J. E., Webber, B. R., and Moriarity, B. S. (2018). EditR: A Method to Quantify Base Editing from Sanger Sequencing. CRISPR J 1, 239-250. [0220] Komor, A. C., Kim, Y. B., Packer, M. S., Zuris, J. A., and Liu, D. R. (2016). Programmable editing of a target base in genomic DNA without double-stranded DNA cleavage. Nature 533, 420-424. [0221] Komor, A. C., Zhao, K. T., Packer, M. S., Gaudelli, N. M., Waterbury, A. L., Koblan, L. W., Kim, Y. B., Badran, A. H., and Liu, D. R. (2017). Improved base excision repair inhibition and bacteriophage Mu Gam protein yields C:G-to-T:A base editors with higher efficiency and product purity. Sci Adv 3, eaao4774. [0222] Leenay, R. T., Aghazadeh, A., Hiatt, J., Tse, D., Roth, T. L., Apathy, R., Shifrut, E., Hultquist, J. F., Krogan, N., Wu, Z., et al. (2019). Large dataset enables prediction of repair after CRISPR-Cas9 editing in primary T cells. Nature biotechnology 37, 1034-1037. [0223] Lek, M., Karczewski, K. J., Minikel, E. V., Samocha, K. E., Banks, E., Fennell, T., O'Donnell-Luria, A. H., Ware, J. S., Hill, A. J., Cummings, B. B., et al. (2016). Analysis of protein-coding genetic variation in 60,706 humans. Nature 536, 285-291. [0224] Levine, R. L., Wadleigh, M., Cools, J., Ebert, B. L., Wernig, G., Huntly, B. J., Boggon, T. J., Wlodarska, I., Clark, J. J., Moore, S., et al. (2005). Activating mutation in the tyrosine kinase JAK2 in polycythemia vera, essential thrombocythemia, and myeloid metaplasia with myelofibrosis. Cancer Cell 7, 387-397. [0225] Levy, J. M., Yeh, W. H., Pendse, N., Davis, J. R., Hennessey, E., Butcher, R., Koblan, L. W., Comander, J., Liu, Q., and Liu, D. R. (2020). Cytosine and adenine base editing of the brain, liver, retina, heart and skeletal muscle of mice via adeno-associated viruses. Nat Biomed Eng 4, 97-110. [0226] Lindsay, H., Burger, A., Biyong, B., Felker, A., Hess, C., Zaugg, J., Chiavacci, E., Anders, C., Jinek, M., Mosimann, C., et al. (2016). CrispRVariants charts the mutation spectrum of genome engineering experiments. Nat Biotechnol 34, 701-702. [0227] Liu, Z., Lu, Z., Yang, G., Huang, S., Li, G., Feng, S., Liu, Y., Li, J., Yu, W., Zhang, Y., et al. (2018). Efficient generation of mouse models of human diseases via ABE- and BE-mediated base editing. Nat Commun 9, 2338. [0228] Lundin, S., Jemt, A., Terje-Hegge, F., Foam, N., Pettersson, E., Kaller, M., Wirta, V., Lexow, P., and Lundeberg, J. (2015). Endonuclease specificity and sequence dependence of type IIS restriction enzymes. PLoS One 10, e0117059. [0229] Mali, P., Yang, L., Esvelt, K. M., Aach, J., Guell, M., DiCarlo, J. E., Norville, J. E., and Church, G. M. (2013). RNA-guided human genome engineering via Cas9. Science 339, 823-826. [0230] Mashal, R. D., Koontz, J., and Sklar, J. (1995). Detection of mutations by cleavage of DNA heteroduplexes with bacteriophage resolvases. Nat Genet 9, 177-183. [0231] McClellan, J., and King, M. C. (2010). Genetic heterogeneity in human disease. Cell 141, 210-217. [0232] Mullally, A., Lane, S. W., Ball, B., Megerdichian, C., Okabe, R., Al-Shahrour, F., Paktinat, M., Haydu, J. E., Housman, E., Lord, A. M., et al. (2010). Physiological Jak2V617F expression causes a lethal myeloproliferative neoplasm with differential effects on hematopoietic stem and progenitor cells. Cancer Cell 17, 584-596. [0233] Oshima, K., Khiabanian, H., da Silva-Almeida, A. C., Tzoneva, G., Abate, F., Ambesi-Impiombato, A., Sanchez-Martin, M., Carpenter, Z., Penson, A., Perez-Garcia, A., et al. (2016). Mutational landscape, clonal evolution patterns, and role of RAS mutations in relapsed acute lymphoblastic leukemia. Proc Natl Acad Sci USA 113, 11306-11311. [0234] Pathania, S., Bade, S., Le Guillou, M., Burke, K., Reed, R., Bowman-Colin, C., Su, Y., Ting, D. T., Polyak, K., Richardson, A. L., et al. (2014). BRCA1 haploinsufficiency for replication stress suppression in primary cells. Nature communications 5, 5496. [0235] Paulsen, B. S., Mandal, P. K., Frock, R. L., Boyraz, B., Yadav, R., Upadhyayula, S., Gutierrez-Martinez, P., Ebina, W., Fasth, A., Kirchhausen, T., et al. (2017). Ectopic expression of RAD52 and dn53BP1 improves homology-directed repair during CRISPR-Cas9 genome editing. Nat Biomed Eng 1, 878-888. [0236] Pinello, L., Canver, M. C., Hoban, M. D., Orkin, S. H., Kohn, D. B., Bauer, D. E., and Yuan, G. C. (2016). Analyzing CRISPR genome-editing experiments with CRISPResso. Nature biotechnology 34, 695-697. [0237] Qiu, P., Shandilya, H., D'Alessio, J. M., O'Connor, K., Durocher, J., and Gerard, G. F. (2004). Mutation detection using Surveyor nuclease. Biotechniques 36, 702-707. [0238] Ran, F. A., Hsu, P. D., Wright, J., Agarwala, V., Scott, D. A., and Zhang, F. (2013). Genome engineering using the CRISPR-Cas9 system. Nature protocols 8, 2281-2308. [0239] Rees, H. A., and Liu, D. R. (2018). Base editing: precision chemistry on the genome and transcriptome of living cells. Nat Rev Genet 19, 770-788. [0240] Roy, K. R., Smith, J. D., Vonesch, S. C., Lin, G., Tu, C. S., Lederer, A. R., Chu, A., Suresh, S., Nguyen, M., Horecka, J., et al. (2018). Multiplexed precision genome editing with trackable genomic barcodes in yeast. Nat Biotechnol 36, 512-520. [0241] Ryu, S. M., Koo, T., Kim, K., Lim, K., Baek, G., Kim, S. T., Kim, H. S., Kim, D. E., Lee, H., Chung, E., et al. (2018). Adenine base editing in mouse embryos and an adult mouse model of Duchenne muscular dystrophy. Nature biotechnology 36, 536-539. [0242] Shakya, R., Reid, L. J., Reczek, C. R., Cole, F., Egli, D., Lin, C. S., deRooij, D. G., Hirsch, S., Ravi, K., Hicks, J. B., et al. (2011). BRCA1 tumor suppression depends on BRCT phosphoprotein binding, but not its E3 ligase activity. Science 334, 525-528. [0243] Shen, M. W., Arbab, M., Hsu, J. Y., Worstell, D., Culbertson, S. J., Krabbe, O., Cassa, C. A., Liu, D. R., Gifford, D. K., and Sherwood, R. I. (2018). Predictable and precise template-free CRISPR editing of pathogenic variants. Nature 563, 646-651. [0244] Song, C. Q., Jiang, T., Richter, M., Rhym, L. H., Koblan, L. W., Zafra, M. P., Schatoff, E. M., Doman, J. L., Cao, Y., Dow, L. E., et al. (2020). Adenine base editing in an adult mouse model of tyrosinaemia. Nat Biomed Eng 4, 125-130. [0245] Tan, S. L. W., Chadha, S., Liu, Y., Gabasova, E., Perera, D., Ahmed, K., Constantinou, S., Renaudin, X., Lee, M., Aebersold, R., et al. (2017). A Class of Environmental and Endogenous Toxins Induces BRCA2 Haploinsufficiency and Genome Instability. Cell 169, 1105-1118 e1115. [0246] Tschaharganeh, D. F., Xue, W., Calvisi, D. F., Evert, M., Michurina, T. V., Dow, L. E., Banito, A., Katz, S. F., Kastenhuber, E. R., Weissmueller, S., et al. (2014). p53-dependent Nestin regulation links tumor suppression to cellular plasticity in liver cancer. Cell 158, 579-592. [0247] Tzoneva, G., Dieck, C. L., Oshima, K., Ambesi-Impiombato, A., Sanchez-Martin, M., Madubata, C. J., Khiabanian, H., Yu, J., Waanders, E., lacobucci, I., et al. (2018). Clonal evolution mechanisms in NT5C2 mutant-relapsed acute lymphoblastic leukaemia. Nature 553, 511-514. [0248] Tzoneva, G., Perez-Garcia, A., Carpenter, Z., Khiabanian, H., Tosello, V., Allegretta, M., Paietta, E., Racevskis, J., Rowe, J. M., Tallman, M. S., et al. (2013). Activating mutations in the NT5C2 nucleotidase gene drive chemotherapy resistance in relapsed ALL. Nat Med 19, 368-371. [0249] van Overbeek, M., Capurso, D., Carter, M. M., Thompson, M. S., Frias, E., Russ, C., Reece-Hoyes, J. S., Nye, C., Gradia, S., Vidal, B., et al. (2016). DNA Repair Profiling Reveals Nonrandom Outcomes at Cas9-Mediated Breaks. Mol Cell 63, 633-646. [0250] Villiger, L., Grisch-Chan, H. M., Lindsay, H., Ringnalda, F., Pogliano, C. B., Allegri, G., Fingerhut, R., Haberle, J., Matos, J., Robinson, M. D., et al. (2018). Treatment of a metabolic liver disease by in vivo genome base editing in adult mice. Nat Med 24, 1519-1525. [0251] Wang, L., Xue, W., Yan, L., Li, X., Wei, J., Chen, M., Wu, J., Yang, B., Yang, L., and Chen, J. (2017). Enhanced base editing by co-expression of free uracil DNA glycosylase inhibitor. Cell Res 27, 1289-1292. [0252] Yeh, W. H., Chiang, H., Rees, H. A., Edge, A. S. B., and Liu, D. R. (2018). In vivo base editing of post-mitotic sensory cells. Nature communications 9, 2184. [0253] Yin, H., Song, C. Q., Dorkin, J. R., Zhu, L. J., Li, Y., Wu, Q., Park, A., Yang, J., Suresh, S., Bizhanova, A., et al. (2016). Therapeutic genome editing by combined viral and non-viral delivery of CRISPR system components in vivo. Nature biotechnology 34, 328-333. [0254] Yin, H., Xue, W., Chen, S., Bogorad, R. L., Benedetti, E., Grompe, M., Koteliansky, V., Sharp, P. A., Jacks, T., and Anderson, D. G. (2014). Genome editing with Cas9 in adult mice corrects a disease mutation and phenotype. Nature biotechnology 32, 551-553. [0255] Zafra, M. P., Schatoff, E. M., Katti, A., Foronda, M., Breinig, M., Schweitzer, A. Y., Simon, A., Han, T., Goswami, S., Montgomery, E., et al. (2018). Optimized base editors enable efficient editing in cells, organoids and mice. Nature biotechnology 36, 888-893. [0256] Zhang, J., Li, J., Saucier, J. B., Feng, Y., Jiang, Y., Sinson, J., McCombs, A. K., Schmitt, E. S., Peacock, S., Chen, S., et al. (2019). Non-invasive prenatal sequencing for multiple Mendelian monogenic disorders using circulating cell-free fetal DNA. Nat Med 25, 439-447.
[0257] All documents cited in this application are hereby incorporated by reference as if recited in full herein.
[0258] Although illustrative embodiments of the present disclosure have been described herein, it should be understood that the disclosure is not limited to those described, and that various other changes or modifications may be made by one skilled in the art without departing from the scope or spirit of the disclosure.