Fragmentation of DNA
20220380840 · 2022-12-01
Assignee
Inventors
- Lynne Apone (Waltham, MA)
- Brittany S. Sexton (Amesbury, MA, US)
- Margaret Heider (Ipswich, MA, US)
- Louise JS Williams (Reading, MA, US)
- Eileen T. Dimalanta (Wakefield, MA, US)
Cpc classification
C12N9/22
CHEMISTRY; METALLURGY
C12P19/34
CHEMISTRY; METALLURGY
C40B50/06
CHEMISTRY; METALLURGY
C12N15/1065
CHEMISTRY; METALLURGY
C12Q1/6806
CHEMISTRY; METALLURGY
C12Q1/6806
CHEMISTRY; METALLURGY
C40B40/06
CHEMISTRY; METALLURGY
International classification
Abstract
Provided herein is a polymerase-free enzyme mix (FRAG) for fragmenting double-stranded DNA. In some embodiments the enzyme mix may comprise a double-stranded DNA nickase and at least one of a DNA ligase capable of sealing a nick within a DNA, and a single-strand specific DNA nuclease. Methods for fragmenting double-stranded DNA are also provided.
Claims
1. A polymerase-free enzyme mix for fragmenting double-stranded DNA (FRAG), comprising: (a) a double-strand random DNA nickase; (b) a single-strand specific DNA nuclease; and optionally (c) a DNA ligase capable of sealing a nick within a DNA; and wherein FRAG does not contain a DNA polymerase.
2. The FRAG according to claim 1, wherein the double-strand random DNA nickase, the DNA ligase and the single-strand DNA nuclease are combined in the polymerase free enzyme mix.
3. The FRAG according to claim 1, further comprising a target DNA.
4. The FRAG according to claim 1, further comprising DNA fragments that are the product of fragmentation of the target DNA.
5. The FRAG of any of claim 1, further comprising: (d) a polynucleotide kinase (PNK).
6. The FRAG according to claim 2, wherein the ligase is an NAD.sup.+ ligase.
7. The FRAG according to claim 1, wherein the single-strand nuclease is zinc dependent.
8. The FRAG according to claim 1, wherein the single-strand nuclease is a P1 nuclease.
9. The FRAG according to claim 1, wherein the double-strand random nickase is DNase I or variant thereof.
10. The FRAG according to claim 2, wherein one or more of the enzymes are preserved by lyophilization or air drying.
11. A method comprising: (a) combining a sample comprising double-stranded DNA with the FRAG according to claim 1 to produce a reaction mix; and (b) incubating the reaction mix to provide fragments of the double-stranded DNA.
12. The method according to claim 11, further comprising: (a) randomly nicking the duplex DNA in the sample with a double-strand random nickase to produce DNA fragments; (b) repairing unresolved nicks in the DNA fragments from (a) with a ligase, wherein the unresolved nicks occur on one strand of the DNA and not proximally on the opposite strand; and (c) removing any single-strand DNA at termini of the DNA fragments with a single-strand specific nuclease. wherein (a)-(c) is performed in a single container in a reaction buffer, in the absence of a DNA polymerase.
13. The method according to claim 11, wherein the sample comprises genomic DNA.
14. The method according to claim 11, wherein the sample comprises DNA isolated from a formalin-fixed paraffin-embedded (FFPE) sample.
15. The method according to claim 11, wherein the sample comprises DNA having modified bases.
16. The method according to claim 11, wherein the median length of the fragmented DNA in selected from a length that is greater than 50 bp, 500 bp, 1 kb or 10 kb.
17. The method according to claim 11, further comprising ligating adaptors to the fragments.
18. The method according to claim 12, further comprising in a second container, end repairing the fragments, ligating adapters to the fragment ends, amplifying the fragments and/or sequencing the fragments.
19. A kit, comprising: (a) a double-stranded random DNA nickase and a single-strand specific DNA nuclease in a single container; (b) a DNA ligase in the same or different container for combining with (a); and (c) instructions for use for fragmenting DNA in the absence of a DNA polymerase.
20. The kit according to claim 19, wherein an amount of the double-stranded random DNA nickase, the DNA ligase, and the single-strand specific DNA nuclease are combined in the reaction mixture at 0.001 ng/ul-0.25 ng/ul for the double-strand random nickase, 0.0025 ug/ul-1.4 ug/ul for the single-strand nuclease and 0.03 ng/ul-8 ng/ul for the ligase.
21. The kit according to claim 19, wherein any of (a) or (b) are lyophilized, air dried or in an aqueous buffer.
22. The kit according to claim 19, wherein any of (a)-(b) are immobilized on a substrate.
23. The kit according to 19, further comprising: one or more of a polynucleotide kinase (PNK), a nicking agent, and a DNA repair enzyme.
24. The kit according to claim 23, wherein the ligase is an NAD.sup.+ ligase.
25. The kit according to claim 24, wherein the NAD.sup.+ ligase is selected from the group consisting of Taq DNA ligase, E. coli DNA ligase and an archaeal DNA Ligase.
26. The kit according to claim 19, wherein the single-strand nuclease is zinc dependent.
27. The kit according to claim 19, wherein the single-strand nuclease is selected from the group consisting of mungbean nuclease, recJ, a nuclease T, and a member of the S1 or P1 nuclease family of nucleases.
28. The kit according to claim 19, wherein the single-strand nuclease is a P1 nuclease.
29. The kit according to claim 19, wherein the double-strand random nickase is selected from the group consisting of a DNase or mutants thereof, Vvn nuclease, and micrococcal nuclease.
30. The kit according to claim 29, wherein the nickase is DNase I.
Description
DESCRIPTION OF THE FIGURES
[0023]
[0024]
[0025]
[0026]
[0027]
[0028]
[0029]
DETAILED DESCRIPTION OF EMBODIMENTS
[0030] FRAG is used for fragmenting large DNAs into suitable sizes for amplification and sequencing and enables high throughput processing capability for large numbers of samples, and preservation of the composition and base modifications of the input DNA in the absence of a DNA polymerase in the enzyme mix.
[0031] When combined with a sample DNA, FRAG provides randomly fragmented DNA of a size determined by choice of enzyme reagents and selected reaction conditions. In one embodiment, FRAG is a mixture of enzymes in a fragmentation buffer, where the mixture FRAG includes at least two enzymes selected from a nickase, a ligase and a single-strand specific nuclease but importantly no polymerase is included in FRAG. The nickase is preferably a double-stranded DNA nickase where the “double-stranded nickase” refers to an enzyme that can nick double-stranded and single-stranded DNA, chromatin and/or RNA-DNA hybrids by creating a break on one strand of the duplex preferably at a random site and preferably produces a 5′ phosphate on one side of the break and a 3′OH on the other side of the break. A double-stranded break is achieved when the position of the nick on one strand of the duplex is randomly positioned proximate to a random nick on the second strand of the duplex. A double-strand DNA fragment is formed from two double-strand breaks along the DNA duplex.
[0032] FRAG avoids introducing base damage or artifacts associated with traditional methods of DNA fragmentation including physical shearing (Covaris, Bioruptor® (Diagenode, Inc., Denville, N.J.), etc.) and alternative enzymatic fragmentation methods that utilize DNA polymerases such as a strand displacing polymerase (e.g., Bst polymerase) or a non-strand displacing polymerase (e.g., T4 DNA polymerase).
[0033] A sample DNA of any size can be fragmented to a desired size range using FRAG. The desired size range of fragments can be achieved by varying the reaction conditions. Examples of different reaction conditions that can be varied include the following: (a) choice of double-strand random DNA nickase; (b) varying the concentration of the selected nickase; (c) selecting a double-strand random DNA nickase such as a DNase variant for FRAG where the nickase has altered properties such as reduced nickase activity; (d) altering the nickase to ligase ratio; (e) modifying the salt concentration of the buffer; and (f) altering the incubation time and/or temperature of the sample DNA with FRAG. A commonly selected median size range for fragments is 20 bp-500 bp although fragments with a larger median size range can be generated by varying the reaction conditions as described above. Whereas FRAG is a mixture of enzymes, the enzymes can be stored separately and combined in the reaction mix containing the DNA by adding sequentially or at the same time to the reaction mix. Alternatively, two or more enzymes in FRAG may be combined in a storage buffer preferably containing at least 10% glycerol and then added to the reaction mix. For example, the nickase and single-strand nuclease may be combined in the storage buffer and the ligase provided separately in the reaction mix. Alternatively, the nickase and ligase can be combined in a storage buffer with the single-strand nuclease stored separately and added to the reaction tube after or with the addition of the nickase and ligase enzyme mixture. In certain contexts, FRAG preferably contains the nickase, nuclease and ligase in the same mixture. Any of the enzymes in FRAG may be stored separately in a lyophilized state or any of the two enzyme combinations described herein may be lyophilized together and the third enzyme lyophilized separately. Alternatively, the third enzyme may be lyophilized and a mix containing two specified enzymes may be in a buffer solution. Any of the above mixes may further include a PNK either separately or in the mix. The PNK may be in solution or maybe lyophilized. One or more of the enzymes in FRAG can be immobilized on a substrate such as a bead. This can enhance the efficiency of the reaction and permit reuse of the reagents after the reaction has been completed and the fragmented DNA is delivered to a reaction mix for amplification and/or sequencing.
[0034] In certain embodiments, one, two or more different nickases may be used in FRAG with one, two or more different ligases and one, two or more different single-strand nucleases in the absence of a DNA polymerase either within the mixture or added separately. Whereas an NAD+ ligase and/or an ATP ligase may be used In FRAG, preferably at least one ligase is an NAD+ ligase.
[0035] Without wishing to be bound to any specific theory, it is thought that the nickase in FRAG nicks the double-stranded DNA. Nicks that are opposite one another or a few bases away will cause the double-stranded DNA to break into fragments that have blunt ends or single-strand overhangs. The ligase seals any additional nicks that are internal to the double-strand fragments resulting in a continuous double-stranded DNA. Single-stranded DNA can be removed by the nuclease. The length of the fragments produced by the method can be tuned by, e.g., altering the nickase to ligase ratio and other reaction conditions.
[0036] Advantages of FRAG include one or more of the following: ability to sequence sample DNA fragments containing base modifications; increased availability of sequenceable material from a sample through reduction in artefacts that might otherwise occur to limit the availability of the sample for sequencing; a relative improvement in sequence data from DNA fragments regardless of the extent of damage to the bases in sample DNA; ability to control and modulate fragment size; reduction in sequence bias in fragment formation; no requirement for expensive equipment; preservation of nucleotide modification marks; relatively rapid; and suitability for high throughput library preparations providing a streamlined method that is relatively easy to execute resulting in reduced opportunities for error.
[0037] Improvements such as those described above, can be observed when the starting material is high quality DNA or damaged DNA such as obtained from FFPE samples. Using, high quality large molecular weight DNA purified using any suitable nucleic acid extraction method known in the art such as Monarch® High Molecular Weight DNA Extraction Kit (New England Biolabs, Ipswich, Mass.) (also see US 2021/0054363), the examples below show that FRAG preserves base modifications such as methylation marks in the DNA and does not introduce artifacts or base damage. Furthermore, FRAG was shown to improve sequencing metrics of libraries generated from genomic DNA. The improved sequencing metrics are described in the examples for methylated and/or FFPE DNA. The improved sequence metrics permit analysis of reduced amounts of DNA that may be available from fixed cells, body fluid samples and more generally environmental sample.
DNA Samples
[0038] The DNA sample that is fragmented using embodiments of the present method may be any high molecular weight DNA including any of plasmids, viral DNA, amplicons, and genomic DNA from bacteria and eukaryotic cells. The DNA for fragmentation may be purified or obtained directly or indirectly from biological samples obtained from environmental sources such as air, water, soil, metagenome repositories such as the ocean metagenome; from organisms such as bacteria, algae, viruses, parasites, invertebrates, vertebrates, or plant material; from body fluid from a vertebrate such as blood, lymph, urine, sputum, saliva, spinal fluid, mucous, feces, or tears; or from laboratory sources.
[0039] In some embodiments, the DNA may be isolated from a laboratory or clinical sample, e.g., a tissue biopsy, cultured cells or a cell lysate. Where the DNA is purified, it may be stored in a laboratory setting at low temperatures in buffers. Alternatively, the DNA may be derived from fresh cells or from stored cell samples such as FFPE or frozen samples, or from natural sources such as ancient bone or teeth samples. These samples may have been subjected to deamination, oxidative damage or actual nicking by the environmental conditions. Damaged bases in DNA may optionally be repaired before or during enzymatic fragmentation using enzyme repair mixes such as USER® (New England Biolabs, Ipswich, Mass.) containing a glycosylase and/or glycosylase lyase such as FPG, Endo IV or Endo VIII) and a cleaving enzyme such as UDG (see for example, U.S. Pat. No. 7,435,572). Alternatively, a repair mix may be used such as PreCR® (New England Biolabs, Ipswich, Mass.) and U.S. Pat. Nos. 7,700,283 and 8,158,388 prior to fragmentation. The samples may be repaired and then fragmented or vice versa in a single reaction vessel in one step or in sequential steps.
[0040] In alternative embodiments, DNA may be immobilized on a matrix and fragmented in situ. For example, high molecular weight DNA purified on beads (see for example WO 2021/034750) may be combined with the fragmentation step by adding FRAG to the bead bound DNA. In one embodiment, the beads containing the DNA are placed into a tube containing FRAG.
Enzyme Mixes
[0041] FRAG includes a plurality of enzymes in a mix. These include: a double-strand random DNA nickase capable of randomly nicking double-stranded DNA combined with at least one enzyme selected from a ligase capable of repairing internal nicks in a DNA and a single-strand specific nuclease capable of cleaving single-strand overhangs. In one embodiment, FRAG is a double-strand random DNA nickase combined with a single-strand nuclease in one container, and the ligase is provided in a separate container for adding to the sample after treatment with the nickase and nuclease. In another embodiment, FRAG is a mix of the nickase, the nuclease and the ligase. Significantly, FRAG does not contain a DNA polymerase.
[0042] In certain embodiments, a single nickase or a plurality of nickases, a single ligase or a plurality of ligases and/or a single nuclease or a plurality of nucleases may be included in FRAG. In certain embodiments, variants or mutants of any of wild-type enzymes having the described functions of nickase, ligase, and/or single-strand DNA nuclease may be included in FRAG. In certain embodiments, any or all of the enzymes in FRAG may be fusion proteins. In certain examples, any of the enzymes in FRAG may be fused to a moiety such as a DNA binding protein that is sequence specific such as the transcriptional activator moieties described in U.S. Pat. No. 9,963,687 or a non-sequence-specific protein, for example Sso7 (see for example WO 2001/092501 and U.S. Pat. No. 7,666,645). In another embodiment, any of the enzymes in FRAG may be fused to an affinity binding moiety for immobilizing the one or more enzymes to a matrix, where the affinity binding moiety may be AGT or ACT capable of binding benzyl guanine or analogs thereof (SNAP-tag® or CLIP-tag™ (New England Biolabs, Ipswich, Mass.). Any of the enzymes may be fused to other moieties such as His-tags, chitin binding domains, antibodies or antibody fragments, protein A and maltose binding domain. Suitable matrices for enzyme immobilization include beads that are routinely or commercially available including magnetic beads. Where the enzymes in FRAG are immobilized on the same or different matrix, fragmented DNA can be removed in an eluant for combining with additional reagents.
[0043] Any or all the reagents in FRAG may be lyophilized prior to use for storage and rehydrated at the time of use by means of a reaction buffer and/or by addition of an environmental sample. FRAG may be stored in a standard storage buffer routinely used for DNA enzymes containing a buffer such as Tris/EDTA and a detergent and optionally a reducing agent such as DTT and glycerol.
[0044] In one embodiment, FRAG preferably contains a nickase that cleaves double-stranded DNA randomly for example: a DNase, Vvn nuclease and Micrococcal nuclease. This type of nuclease is referred to herein as a random double-strand DNA nickase. In some embodiments, a nickase may additionally be included in the mixture that is a sequence specific nickase. Examples include: T7 EndoI, Nt.CviPII, Nt.BstNBI, Nb.BtsI, Nb.BsrDI or other modified restriction endonuclease preferably that have a three base recognition sequence. A nickase may also include one or more glycosylases and/or glycosylase lyases. Preferably, at least one double-strand random DNA nickase is included in FRAG for fragmentation of DNA.
[0045] FRAG is polymerase free. Any type of polymerase, strand displacing or otherwise is omitted from FRAG and polymerases are specifically excluded throughout the fragmentation reaction that converts large DNA into fragments.
[0046] In one embodiment, the preferred double-strand random nickase is a DNase or engineered variants thereof. Examples of a naturally occurring DNase include DNase I (e.g., from bovine (SEQ ID NO:1), TURBO™ DNase (Thermo Fisher Scientific, Waltham, Mass.) or other commercially available DNases. Examples of engineered variants of a DNase (derived from Bovine DNase SEQ ID NO: 1) are shown in
[0047] Fragmentation of DNA libraries may in different instances require a desired DNA fragment size. The desired sized fragments may fall into selected ranges such as 1000 bp-5000 bp or 150 bp-1000 bp. Where small DNA fragments (for example in the range of 150 bp-1000 bp) are desired, it may be preferred to utilize a more active DNase that will nick DNA rapidly and efficiently into the desired fragments. A more active DNase may be selected and/or a higher concentration of the DNase to obtain small fragments. Where large DNA fragments (for example 1000 bp-5000 bp or larger) are desired, it may be preferred to utilize a less active DNase that will nick DNA less rapidly and less efficiently into the desired fragment sizes. Alternatively, and/or in addition, lower concentrations of the DNase may be used.
[0048] The above description of parameters affecting the use of DNases in nicking DNA is not intended to preclude other temperatures of incubation and time for the nickase described or other equivalent enzymes.
TABLE-US-00001 Wild-type Bovine DNase 1 (SEQ ID NO: 1) LKIAAFNIRTFGETKMSNATLASYIVRIVRRYDIVL IQEVRDSHLVAVGKLLDYLNQDDPNTYHYVVSEPL GRNSYKERYLFLFRPNKVSVLDTYQYDDGCESCGN DSFSREPAVVKFSSHSTKVKEFAIVALHSAPSDAV AEINSLYDVYLDVQQKWHLNDVMLMGDFNADCSYV TSSQWSSIRLRTSSTFQWLIPDSADTTATSTNCAY DRIVVAGSLLQSSVVPGSAAPFDFQAAYGLSNEMA LAISDHYPVEVTLT
[0049] While Thermus aquaticus (Taq) DNA ligase is an example of an NAD.sup.+ DNA ligase used in the examples, other NAD.sup.+ DNA ligases may be substituted in the fragmentation mixture. Examples of alternate ligases include: E. coli NAD.sup.+ DNA ligase (LigA), and archaeal NAD.sup.+ DNA ligases. The bacterial NAD.sup.+ ligases are highly conserved so it would be expected that NAD.sup.+ ligases from bacterial sources other than Taq and E. coli could be substituted.
[0050] Ligation may occur at the same time as nicking where the ratios of enzyme concentrations and kinetics determine the optimal fragmentation conditions. Ligation may also be performed after nicking by adding the ligase to the reaction tube after nicking has occurred. Ligation may occur after nicking despite the enzyme mix containing both nickase and ligase, where the two enzymes have different optimal temperatures. For example, the ligase may be activated by raising the temperature after nicking has occurred. In Example 1, nicking using DNase I predominantly occurred at 37° C. while ligation using Taq NAD.sup.+ ligase favored a temperature of 65° C. This example is not intended to preclude other temperatures of incubation and time for the enzymes described or other equivalent enzymes.
[0051] Mutant ligases may be used such as those that are stable at temperatures at which the nickase and nuclease are inactivated. For example, HiFi Taq Ligase (New England Biolabs, Ipswich, Mass.) is stable at temperatures as high as 65° C. and may be preferred where it is desirable that ligation occur after nicking and single-strand cleavage in an enzyme mixture contained in a single reaction vessel. FRAG may include one or more single-strand specific, randomly cleaving nucleases or engineered variants thereof. Examples of wild-type nucleases include mung bean nuclease, nuclease T, red, ExoVII or a nuclease member of the S1 and P1 nuclease family. Members of the S1 and P1 nuclease family are found in both eukaryotes and prokaryotes having a primary substrate that is single-stranded nucleic acid. Well-known versions include S1 found in Aspergillus oryzae and Neurospora and Nuclease P1 found in Penicillium citrinum. Members of the S1/P1 family are found in both prokaryotes and eukaryotes (see for example, Desai, et al. (2003) FEMS Microbiology Reviews, 26, 457-91).
Formulation of Enzyme Mixes Range
[0052] The Enzyme mixes described herein contain active enzymes or enzymes capable of being active when the mixture is added to a reaction mixture. In some embodiments, a “1×” FRAG enzyme mix contains one or more nickase at an effective concentration for producing the desired fragment size within a desired incubation time where a “1×” FRAG refers to the concentration of enzymes in the fragmentation mixture after combination with DNA in a reaction mixture.
[0053] An example of an effective 1× concentration of DNaseI is in the range of 0.001 ng/μl-0.25 ng/μl. The FRAG reagent or the separate component enzymes may be stored in liquid or lyophilized form in 2×-20× concentrations.
[0054] In one embodiment, one or more NAD.sup.+ ligases are provided in the 1×FRAG or separately at an effective concentration in the range of 0.03 ng/μl-8 ng/μl (1×) where the concentration range may be further modified beyond the range specified depending on the amount of nickase in the mix.
[0055] In one embodiment, one or more single-strand nucleases are provided in the 1×FRAG or separately at an effective concentration in the range of 0.0025 ug/μl-1.4 ug/μl (1×) where the concentration depends on the amount of ligase and nickase and the predicted extent of damage in the DNA to be fragmented and may accordingly be modified beyond the specified range. FRAG may contain a PNK, such as a kinase from T4 phage. T4 PNK concentration in the 1×FRAG or provided separately may be selected from the range 0.05 ng/μl-10 ng/μl for a 1× mixture.
[0056] The enzymes used in FRAG are preferably cloned in a suitable strain for manufacture of recombinant proteins for example, E. coli.
Conditions for Fragmenting DNA Using FRAG
[0057] There are advantages to fragmentation of DNA to a particular size in a time frame that is as short as possible. Where large DNA for fragmentation is specified throughout, this can be any source of DNA that is preferably but not limited to high quality DNA. As shown in
[0058] Proteinase K or Thermolabile Proteinase K (see for example, U.S. Pat. No. 10,633,644) may be used in the preparation of high quality DNA prior to enzymatic fragmentation. Raising the temperature results in inactivation of the Proteinase K prior to the addition of fragmentation enzymes. This provides a streamlined process of purifying DNA and potentially fragmenting the DNA in a single tube without the need to change buffer to remove Proteinase K. The Monarch high molecular weight bead purification of DNA (New England Biolabs, Ipswich, Mass.) also may be streamlined for use with the enzyme fragmentation mix for a single tube extraction and fragmentation protocol. This streamlined workflow may be combined with steps for end repair, adapter ligation and sequencing libraries where the number of steps involving sample transfer are minimized preferably to a single reaction tube.
[0059] The use of the at least two enzymes in a mix as described above, provides the user with a plurality of choices. The user may select the time and temperature of incubation of the substrate with FRAG suitable for creating fragment sizes suitable for different sequencing platforms including short read sequencing such as Illumina® sequencing (Illumina, San Diego, Calif.) and long read sequencing such as Pacific Biosciences® instruments (Pacific Biosciences, Menlo Park, Calif.) or Oxford Nanopore sequencers (Oxford Nanopore Technologies, Oxford, UK).
[0060] The novel mix of enzymes in FRAG allows it to be used to fragment various DNA input for many types of NGS library preparation. Significantly, FRAG does not include a DNA polymerase. DNA polymerases remove DNA modifications from DNA and are therefore contraindicated. In one embodiment, 0.5 pg-3 μg high molecular weight DNA can be fragmented by incubating the DNA with FRAG for 1 minute-60 minutes, for example 5 minutes-30 minutes and 1° C.-100° C., for example 24° C.-45° C., for example, 37° C. The fragmented DNA can then be made into a DNA library and amplified and/or sequenced using NEBNext® EM-seq™ (New England Biolabs, Ipswich, Mass.) or other standard techniques such as Bisulfite sequences, ChIP-seq, NicE-seq, ChiA-PET, etc.
End Product
[0061] The yield of intact fragments of target DNA using FRAG is determined by the reduced loss of sample during fragmentation compared to other methods.
[0062] Retention of modifications on the DNA after fragmentation was determined to be greater than 90%, more specifically greater than 93%, more specifically greater than 95% and as much as 97%. The fragment length can be tailored as necessary. In some embodiments, the median length of the fragments produced by the method may in the range of 100 bp to 1 kb.
Processing of Fragments and Sequencing
[0063] If desired, the fragments can be A-tailed, ligated to adapters, and sequenced, for example. In some embodiments, the fragments (or adapter-ligated fragments) may be directly sequenced using, for example, nanopore sequencing methods such as that commercialized by Oxford Nanopore Technologies or single-molecule fluorescence-based methods such as that commercialized by Pacific Biosciences.
[0064] These technologies are capable of detecting modified nucleotides and, as such, the present fragmentation method may be used to produce samples to be sequenced by those technologies.
General Considerations
[0065] Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs. Still, certain terms are defined herein with respect to embodiments of the disclosure and for the sake of clarity and ease of reference.
[0066] Sources of commonly understood terms and symbols may include: standard treatises and texts such as Kornberg and Baker, DNA Replication, Second Edition (W.H. Freeman, New York, 1992); Lehninger, Biochemistry, Second Edition (Worth Publishers, New York, 1975); Strachan and Read, Human Molecular Genetics, Second Edition (Wiley-Liss, New York, 1999); Eckstein, editor, Oligonucleotides and Analogs: A Practical Approach (Oxford University Press, New York, 1991); Gait, editor, Oligonucleotide Synthesis: A Practical Approach (IRL Press, Oxford, 1984); Singleton, et al., Dictionary of Microbiology and Molecular biology, 2d ed., John Wiley and Sons, New York (1994), and Hale & Markham, the Harper Collins Dictionary of Biology, Harper Perennial, N.Y. (1991) and the like.
[0067] As used herein and in the appended claims, the singular forms “a”, “an”, and “the” include plural referents unless the context clearly dictates otherwise. For example, the term “a protein” refers to one or more proteins, i.e., a single protein and multiple proteins. The claims can be drafted to exclude any optional element when exclusive terminology is used such as “solely,” “only” are used in connection with the recitation of claim elements or when a negative limitation is specified.
[0068] Aspects of the present disclosure can be further understood in light of the embodiments, section headings, figures, descriptions and examples, none of which should be construed as limiting the entire scope of the present disclosure in any way. Accordingly, the claims set forth below should be construed in view of the full breadth and spirit of the disclosure.
[0069] Each of the individual embodiments described and illustrated herein has discrete components and features which may be readily separated from or combined with the features of any of the other several embodiments without departing from the scope or spirit of the present teachings. Any recited method can be carried out in the order of events recited or in any other order which is logically possible. Numeric ranges are inclusive of the numbers defining the range. All numbers should be understood to encompass the midpoint of the integer above and below the integer i.e., the number 2 encompasses 1.5-2.5. The number 2.5 encompasses 2.45-2.55 etc. When sample numerical values are provided, each alone may represent an intermediate value in a range of values and together may represent the extremes of a range unless specified.
[0070] In the context of the present disclosure, “non-naturally occurring” refers to a polynucleotide, polypeptide, carbohydrate, lipid, or composition that does not exist in nature. Such a polynucleotide, polypeptide, carbohydrate, lipid, or composition may differ from naturally occurring polynucleotides polypeptides, carbohydrates, lipids, or compositions in one or more respects. For example, a polymer (e.g., a polynucleotide, polypeptide, or carbohydrate) may differ in the kind and arrangement of the component building blocks (e.g., nucleotide sequence, amino acid sequence, or sugar molecules). A polymer may differ from a naturally occurring polymer with respect to the molecule(s) to which it is linked. For example, a “non-naturally occurring” protein may differ from naturally occurring proteins in its secondary, tertiary, or quaternary structure, by having a chemical bond (e.g., a covalent bond including a peptide bond, a phosphate bond, a disulfide bond, an ester bond, and ether bond, and others) to a polypeptide (e.g., a fusion protein), a lipid, a carbohydrate, or any other molecule. Similarly, a “non-naturally occurring” polynucleotide or nucleic acid may contain one or more other modifications (e.g., an added label or other moiety) to the 5′-end, the 3′ end, and/or between the 5′- and 3′-ends (e.g., methylation) of the nucleic acid. A “non-naturally occurring” composition may differ from naturally occurring compositions in one or more of the following respects: (a) having components that are not combined in nature, (b) having components in concentrations not found in nature, (c) omitting one or components otherwise found in naturally occurring compositions, (d) having a form not found in nature, e.g., dried, freeze dried, crystalline, aqueous, and (e) having one or more additional components beyond those found in nature (e.g., buffering agents, a detergent, a dye, a solvent or a preservative).
[0071] All publications, patents, and patent applications mentioned in this specification including U.S. Provisional 63/193,667, filed May 27, 2021, are herein incorporated by reference to the same extent as if each individual publication, patent, or patent application was specifically and individually indicated to be incorporated by reference.
EXAMPLES
[0072] In order to further illustrate some embodiments of the present invention, the following specific examples are given with the understanding that they are being offered to illustrate examples of the present invention and should not be construed in any way as limiting its scope.
Example 1: Fragmentation of Genomic DNA for NEBNext EM-seq
[0073] (a) FRAG
[0074] 50 ng of NA12878 DNA (human European female genome obtained from the International Genome Sample Resource) were spiked with 0.1 ng CpG methylated pUC19 and 2 ng unmethylated lambda (NEBNext EM-seq controls) in a final volume of 26 uls total volume in water. This was combined with FRAG (Fragmentation enzyme mix containing 0.001-0.25 ng/μl NEB stock DNase I solution, 0.03 ng/μl-8 ng/μl NEB stock Taq ligase, 0.0025 ug/μl-1.4 ug/μl of NEB stock P1 nuclease and 0.05 ng/μl-10 ng/μl of NEB stock PNK solution and a volume of buffer to a total of 14 μl and incubated for 20 minutes at 37° C. followed by 30 minutes at 65° C. in PCR strip tubes.
[0075] (b) Mechanical Shearing
[0076] 0.1 ng CpG methylated pUC19 and 2 ng unmethylated lambda (NEBNext EM-seq controls) were spiked into 50 ng NA12878 DNA (human European female genome obtained from the International Genome Sample Resource) in a final volume of 50 uls total volume in 0.1×TE in a Covaris 8 microTUBE-50 AFA Fiber H Slit Strip V2 and was mechanically sheared using a Covaris ME220 instrument set at 350 bp. The 50 μl of mechanically sheared DNA was then pipetted into a PCR strip tube.
[0077] Following fragmentation (FRAG and mechanical shearing), the NEBNext EM-seq workflow was followed according to manufacturer's instructions (EM-seq Manual) with six PCR cycles for both fragmentation methods. The libraries were quantified on an Agilent D5000 HS TapeStation® (Agilent, Santa Clara, Calif.). The libraries were then sequenced on an Illumina NextSeq® (Illumina, San Diego, Calif.) 2×76 sequencing run.
Example 2: Investigation of Bias for NEBNext EM-Seq Libraries Fragmented by FRAG Versus Mechanical Shearing
[0078] The NEBNext EM-seq libraries were prepared as described in Example 1. The results are shown for human DNA methylation in
[0079] The overall aggregated methylation for all three DNA inputs: human, pUC19 and lambda were comparable between FRAG and mechanical shearing.
[0080] The expected and observed Human CpG methylation was .sup.˜50%, and CHG and CHH <1% as presented in
[0081] The expected pUC19 CpG methylation and observed was .sup.˜97% with CHG and CHH <1%.
[0082] The expected lambda methylation and observed was <1% for CpG, CHG and CHH methylation. The results are shown in
Example 3: Investigation of Yield for NEBNext EM-Seq Libraries Fragmented by FRAG Versus Mechanical Shearing
[0083] The EM-seq libraries were prepared as described in Example 1. The overall library yields were higher for FRAG for NEBNext EM-seq compared to mechanical shearing (same number of PCR cycles) as determined by an Agilent D5000 HS TapeStation. All libraries are shown as technical duplicates.
[0084] Enzyme Fragmentation with FRAG:
[0085] 50 ng of FFPE Liver DNA in a final volume of 26 μls total volume in water was combined with 4 μls of FRAG and 14 μls of FRAG buffer (final total volume of 44 μl) and incubated for 20 minutes at 37° C. followed by 30 minutes at 65° C. in PCR strip tubes.
[0086] Mechanical Shearing:
[0087] 50 ng of FFPE Liver DNA, in a final volume of 50 uls total volume in 0.1×TE in a Covaris 8 microTUBE-50 AFA Fiber H Slit Strip V2, was mechanically sheared using a Covaris ME220 instrument set at 350 bp. The 50 μl of mechanically sheared DNA was then pipetted into a PCR strip tube.
[0088] Following FRAG or mechanical fragmentation, the NEBNext Ultra II DNA workflow was followed according to the manual (NEBNext Ultra II DNA Manual) with eight PCR cycles (same number of PCR cycles).
[0089] The libraries were quantified on an Agilent® D1000 HS TapeStation® (Agilent Technologies, Santa Clara, Calif.). The libraries were then sequenced on an Illumina NextSeq® (Illumina, San Diego, Calif.) 2×76 sequencing run.
[0090] The results are shown in
Example 4: Use of FRAG for FFPE Input Results in Better Sequencing Metrics than Other Fragmentation Methods
[0091] The FFPE libraries were prepared as described in Example 1. FFPE Liver DNA was fragmented using Enzymatic Fragmentation (white) and mechanical shearing (striped). The sequencing metrics measured for FFPE inputs were improved for the FRAG compared to Covaris mechanical shearing: including higher mapping rates and properly paired reads and lower percent chimeras and fold back (inverted repeat in the sequencing data caused by DNA polymerases switching strands) compared to physical shearing. All libraries are shown as technical duplicates and equal number of reads were used across libraries. The results are shown in
Example 5: Fragmentation for FFPE DNA Results in Lower Artificial Mutation Frequency than Mechanical Shearing
[0092] The FFPE libraries were prepared as described in Example 1. FFPE Liver DNA was fragmented using Enzymatic Fragmentation (white) and mechanical shearing (striped). FFPE DNA is often highly damaged including cytosine deamination, resulting in artifactually higher rates of C to T and G to A transitions. The mutation frequency for both C to T and G to A were lower for FFPE inputs with the FRAG compared to mechanical shearing. All libraries are shown as technical duplicates and equal number of reads were used. The results are shown in
Example 6: Fragmentation Time Course for FRAG with High-Quality DNA
[0093] 50 ng Human DNA (NA12878) was fragmented in a final volume of 26 uls total volume in water for the FRAG reaction. 4 uls of FRAG enzyme mix and 14 uls of FRAG buffer (total volume to 44 uls) was added to this DNA and incubated for 5-30 minutes at 37° C. followed by 30 minutes at 65° C. in PCR strip tubes. Fragmentation occurs during the 37° C. incubation step of FRAG. Following FRAG, the NEBNext Ultra II DNA workflow was followed according to the manufacturer's instructions (NEBNext Ultra II DNA Manual) with four PCR cycles. Table 1 provides an example of average library size and fragmentation pattern (Agilent TapeStation D5000 HS) based on fragmentation time. Incubation time can be optimized for individual samples (see
TABLE-US-00002 TABLE 1 Average Library Size (bp) Incubation 37° C. 400 30 minutes 500 25 minutes 600 20 minutes 700 15 minutes 900 10 minutes 1,200 5 minutes
[0094] Although the foregoing invention has been described in some detail by way of illustration and example for purposes of clarity of understanding, it is readily apparent to those of ordinary skill in the art in light of the teachings of this invention that certain changes and modifications may be made thereto without departing from the spirit or scope of the appended claims.