Library preparation
11365408 · 2022-06-21
Assignee
Inventors
- Bin Li (Palo Alto, CA)
- Benjamin G. Schroeder (San Mateo, CA)
- Manqing Hong (Belmont, CA, US)
- Maureen Peterson (Oakland, CA, US)
Cpc classification
C40B50/06
CHEMISTRY; METALLURGY
C12N15/1065
CHEMISTRY; METALLURGY
C12N15/1093
CHEMISTRY; METALLURGY
C12Q1/6806
CHEMISTRY; METALLURGY
C12N15/1093
CHEMISTRY; METALLURGY
C12Q1/6806
CHEMISTRY; METALLURGY
C40B40/06
CHEMISTRY; METALLURGY
International classification
C12N15/10
CHEMISTRY; METALLURGY
C40B40/06
CHEMISTRY; METALLURGY
Abstract
The disclosure provides DNA library preparation methods that do not require a purification between adapter ligation and PCR amplification. Adaptors are added to DNA fragments to form oligonucleotide extension products and the oligonucleotide extension products are amplified without stopping or interruption for a cleanup step. Excess materials, such as enzymes, adaptors, or co-factors, from the adaptor addition step do not interfere with the amplification step and the amplification step proceeds without regards to the presence of reagents from the ligation step. In preferred embodiments, the ligation and amplification step make use of a common priming sequence e.g., in the form of one of the adaptor oligos.
Claims
1. A method of preparing a sequencing library, the method comprising: obtaining a plurality of DNA fragments from a sample; introducing at least partially double-stranded adaptors to the plurality of DNA fragments; ligating a first strand of the adaptors to 5′ ends of the DNA fragments to form adaptor-ligated fragments; and amplifying without purifying the adaptor-ligated fragments in the presence of excess adaptors not ligated to one of the DNA fragments to form a plurality of amplicons, wherein the first strand comprises a barcode sequence.
2. The method of claim 1, wherein amplifying the adaptor-ligated fragments includes adding amplification primers that compete with the first strand of the adaptors.
3. The method of claim 1, further comprising extending a free 3′ end of the DNA fragment by polymerase to copy the first strand of the adaptor.
4. The method of claim 1, wherein the first strand of the adaptor is at least a few nucleotides longer than a second strand of the adaptor.
5. The method of claim 1, wherein amplifying the adaptor-ligated fragments includes using the first strand of the adaptor as an amplification primer.
6. The method of claim 1, further comprising purifying the amplicons to remove excess material.
7. The method of claim 1, further comprising attaching the amplicons to the surface of a flow cell surface to form sequencing clusters.
8. The method of claim 1, further comprising fragmenting a nucleic acid from the sample to obtain the plurality of DNA fragments.
9. The method of claim 1, wherein the amplification step occurs in the presence of excess ligation reagents that include one or more of co-factors, enzymes, and polyethylene glycol.
10. A method of preparing a sequencing library, the method comprising: obtaining a plurality of DNA fragments from nucleic acid from a sample; incubating the DNA fragments with adaptor oligos to form oligonucleotide extension products, wherein at least a first adaptor oligo is ligated to a fragment and at least a second adaptor oligo hybridizes to the fragment and is extended by a polymerase to form a sequence complementary to a target and complementary to the first adaptor oligo; and amplifying without purifying the oligonucleotide extension products in the presence of excess adaptor oligos not ligated to one of the DNA fragments to form a plurality of amplicons, wherein the first adapter oligo comprises a barcode sequence.
11. The method of claim 10, wherein copies of the second adaptor oligo function as primers during the amplification step.
12. The method of claim 10, further comprising purifying the amplicons to remove excess material.
13. The method of claim 10, further comprising attaching the amplicons to a flow cell surface to form sequencing clusters.
14. The method of claim 13, further comprising sequencing the amplicons to determine a sequence of the nucleic acid.
15. The method of claim 10, further comprising fragmenting the nucleic acid from the sample to obtain the plurality of DNA fragments.
16. The method of claim 10, wherein the amplification step occurs in the presence of excess ligation reagents that include one or more of co-factors, enzymes, and polyethylene glycol.
17. The method of claim 1, wherein the first strand comprises one or more of a sequence used in cluster formation, one or more barcode priming site, and one or more sequencer priming site.
18. The method of claim 10, wherein the first adapter oligo comprises one or more of a sequence used in cluster formation, one or more barcode priming site, and one or more sequencer priming site.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
DETAILED DESCRIPTION
(12) The disclosure relates to simplified library preparation methods for next-generation sequencing of nucleic acids.
(13) Methods of the disclosure include adding adaptors to DNA fragments (e.g., by ligating a free end of an adaptor that includes at least partially dsDNA to a free 5′ end of a DNA fragment, or insert) and amplifying the adaptors without an intervening cleanup step (and optionally by using an un-ligated strand from one or more of the adaptors as an amplification primer).
(14) In certain embodiments, two adaptors sequences are ligated to the 5′ ends of an insert, then the 3′ ends of the insert are extended to copy the adaptor sequences. The copy of the adapter sequence becomes the priming site for the PCR primers, which are the same as the long, or ligation strand of the adapter. The long strand of the adapter represents some or preferably all of the sequence used in cluster formation in addition to barcodes, barcode priming sites and sequencer priming sites. The short oligo of the adapter can be ligatable to the 3′ end (and get extended) or not ligate (only serves to enable DNA ligase to interact with the adapter, as the ligase is expecting dsDNA).
(15) Optionally, a short oligo that does not ligate is used, therefore the 3′ extension initiates at the 3′ end of the DNA insert/fragment as opposed to 3′ end of the short oligo. Using high concentrations of adapter ensure that sufficient un-ligated oligo will be available to serve as PCR primer. (If the short oligo is blocked at both ends, it cannot be ligated nor can it be extended, which makes for a cleaner library and less concern regarding interference in PCR. The scenario where adapter oligos are present but do not interfere with PCR is similar but now the residual ligated oligo of the adapter must either be diluted out with a longer oligo (to provide full sequence) or partially degraded to have a lower Tm than the oligos added for the PCR step).
(16) Here we demonstrate that DNA library prep can be accomplished with only a single Bead cleanup after the PCR step. We demonstrate a workflow that is compatible with both mechanical and enzymatic shearing of DNA. We show the use of adaptors that allow for both ligation and PCR amplification, without addition of distinct PCR primers.
(17) Benefits of the disclosure include methods in which post-ligation bead cleanup can be eliminated; a three step, single bead-cleanup protocol generates high quality libraries; and methods in which adaptors serve also as PCR primers.
(18) Other embodiments are within the scope of the disclosure.
(19) In some embodiments, methods of the disclosure include adding sequencing adaptors to DNA fragments (by ligation, hybridization, and extension) to form oligonucleotide extension products and amplifying the oligonucleotide extension products without any intervening purification or wash steps. When a sequencing library is prepared according to methods of the disclosure, material present after adaptor ligation—which may include excess molecular entities such as enzymes and adaptors as well as co-factors or other reagents—does not prevent a successful amplification reaction, which simplifies a library preparation workflow.
(20)
(21) Methods of the disclosure may be used to produce libraries used in next-generation sequencing starting with as little as 10 pg of double-stranded DNA. The library construction workflow is suitable for a wide range of sequencing applications including RNA-Seq, Digital Gene Expression (DGE), genomic DNA sequencing, target capture, amplicon sequencing, ChIP-Seq and more. These libraries are suitable for sequencing on Illumina sequencing platforms.
(22)
(23) The adaptor addition 125 yields adaptor-ligated fragments 213 which include an arbitrary sense strand 215 and a complementary strand 214. The adaptor-ligated fragments 213 proceed to amplification 129, which includes melting the adaptor-ligated fragments 213 and further includes hybridizing primers the arbitrary sense strand 215 and the complementary strand 214.
(24) In an embodiment, the ligation adaptor oligos are not used as the library amplification primers. For example, the ligation adaptor oligos may not be full length. For example, the long adaptor oligo may be 30 bases from the 3′ end of Illumina adaptors. In the amplification, the PCR primers can be added, which may be longer and add the rest of the full length Illumina adaptors to the amplified library.
(25) Although there may be competition between PCR primers and the long adaptor oligos in the amplification, the full length library is still made (as shown by data in the appended Examples). Of several distinct embodiments (e.g., in which (i) adaptor oligo is present and competes with amplification primer; (ii) an adaptor oligo functions as an amplification primer; and/or (iii) a single-primer extension embodiment in which a first adaptor is ligated and a second adaptor hybridizes to and is extended over the first adaptor), in common among the embodiments is the lack of any requirement of a cleanup step or purification between adaptor addition and amplification.
(26) Thus the disclosure provides a library preparation method in which adaptors are added to fragments which are then amplified without an intervening bead cleanup or purification step. Material from the adaptor addition step, including excess adaptor, may be present during the amplification and the included results show that those materials do not interfere with successful amplification to produce a library suitable for NGS sequencing.
(27) In an optional embodiment, one of the primers is provided by the long strand 205 of the adaptors 201 (which adaptors 201 had been added in excess). The long strand 205 of the adaptor 201 thus hybridizes to the complementary strand 214 of the adaptor-ligated fragments 213 and is extended, at the core of the amplification 129 steps.
(28) Illustrated were certain possible steps according to certain possible embodiments. In such embodiments, two adaptors sequences are ligated to the 5′ ends of insert, then extend the 3′ ends of insert to copy the adaptor sequences. The copy of the adapter sequence becomes the priming site for the PCR primers, which are the same as the long, or ligation strand of the adapter. In order for this to work, the long strand of the adapter now needs to represent the entire sequence used in cluster formation in addition to barcodes, barcode priming sites and sequencer priming sites. The short oligo in of the adapter can be ligatable to the 3′ end (and get extended) or not ligate (only serves to enable DNA ligase to interact with the adapter—expecting ds DNA). It may be preferable to use a short oligo that does not ligate, such that the 3′ extension initiates at the 3′ end of the DNA insert/fragment as opposed to 3′ end of the short oligo. Using high concentrations of adapter ensure that sufficient unligated oligo will be available to serve as PCR primer. The illustrated methods of DNA library prep can be accomplished with no more than a single bead cleanup after the PCR step. The workflow that is compatible with both mechanical and enzymatic shearing of DNA. The adaptors allow for both ligation and PCR amplification, without addition of distinct PCR primers.
(29) Other embodiments are within the scope of the invention.
(30)
(31) At step 125, the DNA fragments are incubated with adaptor oligos to form adaptor-ligated fragments in which at least a first adaptor oligo is ligated to a fragment and at least a second adaptor oligo hybridizes to the fragment and is extended by a polymerase to form a sequence complementary to a target and complementary to the first adaptor oligo. For details, see U.S. Pat. No. 9,650,628, incorporated by reference. Important sub-steps of forming the oligonucleotide extension products are stated as follows. The adaptor addition 125 includes (a) appending a first adaptor to a 5′ end of each DNA fragment; (b) annealing second adapter oligos to the DNA fragments, whereby the second adapter oligos have a 3′ portion that is complementary to a sequence of interest present in one or more of the fragments, and a 5′ portion comprising a second adapter sequence; and (c) extending the second adapter oligos with a polymerase thereby generating one or more oligonucleotide extension products with the first adaptor at a first end and a second adaptor sequence at a second end.
(32) The method 101 further includes amplifying 129 the oligonucleotide extension products in the presence of the adaptor oligos to form a plurality of amplicons. Copies of the second adaptor oligo function as primers during the amplification step. The entire workflow including fragmentation can be completed quickly, and yields DNA libraries ready for cluster formation and either single read or paired-end sequencing 135. Importantly, in the method 101 the steps of adaptor addition 125 and amplification 129 may be performed without an intervening purification step such as a bead wash. In fact, the second adaptor oligo of the ligation 125 step may serve as an amplification primer in the amplification step 129. Additionally, it may be found that other ligation materials (excess adaptors, co-factors such as Mg, PEG, enzymes such as ligase) simply do not interfere with amplification 129. Thus the method 101 may include (d) amplifying 129 the one or more oligonucleotide extension product using the second adaptor oligo as a primer. Methods may include steps described in U.S. Pat. No. 9,650,628, incorporated by reference for all purposes.
(33) In addition to use with genomic and other double-stranded DNA sources, methods may be used with input RNA. Importantly, for DNA sequencing applications, low abundance samples can be input directly to the library construction workflow without the need for pre-amplification. Methods of the disclosure produce DNA libraries suitable for either single read or paired-end sequencing on sequencing platforms such as Illumina platforms, without the need for gel-based size selection.
(34)
(35) The library preparation method 101 may include in three stages: DNA fragments are obtained 107 and the method further includes end repair of sheared DNA 313; adaptor ligation 325; and amplification 329. It may be preferable to use of a positive control DNA, to allow the establishment of a baseline of performance.
(36) In general, the method will proceed according to a workflow that includes setting up and thawing the indicated reagents. Reagents and reaction tubes may be thawed, prepared, and kept on ice. After thawing and mixing buffer mixes, if any precipitate is observed, the buffers may be re-mixed/re-dissolved, gently warmed, and briefly vortexed. Generally, enzymes and primers are not warmed. Standard pipetting techniques are observed. Steps of the method 101 may be performed using a thermal cycler.
(37)
(38) In some embodiments, the method 301 includes DNA fragmentation 307. Any suitable fragmentation method may be used including mechanical, chemical, or enzymatic fragmentation.
(39) In certain embodiments, intact gDNA is diluted into 120 pL of 1× low-EDTA TE buffer, transferred into Covaris snap cap microtube, and fragmented to desired insert size following Covaris recommended settings.
(40) Preferred embodiments of the method 101 include end repair 313. End repair 313 may include use of an end repair buffer mix (e.g., as sold by NuGEN Technologies, San Carlos, Calif.), end repair enzyme mix, end repair enhancer and nuclease-free water. The reagents are mixed and incubated according to manufacturer's instructions. The end-repair step 313 may proceed in a thermal-cycler programmed to run Program 1 (End Repair; see
(41) After end repair 313, the blunt-ended fragments proceed to adaptor addition 125. Adaptors and associated reagents are added to the tubes according to manufacturer's instructions. In preferred embodiments, all samples intended to share the same sequencing flow cell lane should have unique ligation adaptors. In some embodiments, the ligation reaction will proceed in a thermal cycler (Ligation; see
(42) An insight of the disclosure is that purification such as a bead wash after the adaptor addition 125 and before the amplification step 129 is not required and, in fact, an adaptor used in addition 125 can be carried over and used in amplification. After addition 125, excess adaptors (and other reagents or materials) may be present among the oligonucleotide extension products. Those excess materials may include magnesium or other metals, other co-factors, phosphate, polyethylene glycol (PEG), enzymes such as ligase, excess adapters, and blunt ended fragments.
(43) Importantly, the method 101 may proceed without a purification step.
(44) The method 101 proceeds to library amplification 129. For amplification, amplification enzymes, buffer, and primer mixes are added and mixed according to manufacturer's instructions. Amplification 129 may proceed in a pre-warmed thermal cycler programmed to run Program 3 (Library Amplification; see
(45) After amplification 129, it may be desirable to perform purification 135 of the amplicons 131. For DNA purification, one may choose a nucleic acid column-based purification system that allows small volume elution, such as the reaction cleanup, it sold under the trademark MINELUTE by Qiagen. A bead-based purification protocol provided by Agencourt is described here for convenience.
(46) Suspend beads in nuclease free water at room T by inverting and tapping tube.
(47) Introduce bead suspension to DNA sample in micro-centrifuge tubes & mix by pipetting.
(48) Transfer the PCR tubes containing the bead-sample mixture to the magnet and let stand 5 minutes to completely clear the solution of beads.
(49) Remove and discard binding buffer; wash with ethanol.
(50) Air dry beads on magnet.
(51) Add 1× low-EDTA TE buffer or nuclease-free water to the dried beads.
(52) Transfer tubes to magnet; remove eluate
(53) Remove from magnet and set aside.
(54) For the bead purification step 135, follow the manufacturer's instructions. The above outline is given to aid in comprehension of the order of the steps. For precise reagents, timing, and volumes, see the manufacturer's instructions. Proceed to any QC steps such as any desired step for the quantitative and qualitative Assessment of the Library. One may optionally perform a Quantitative and Qualitative Assessment of the Library. Run the samples on the Bioanalyzer DNA 1000 Chip.
(55) Sequences of the Barcodes in the Multiplexed Reactions Barcode sequences for the 32- and 96-plex Adaptor Plates are given in manufacturer's instructions. All barcode sequences are separated by an edit distance of three. For further details on the barcode design strategy, please refer to Faircloth BC, Glenn TC (2012) Not All Sequence Tags Are Created Equal: Designing and Validating Sequence Identification Tags Robust to Indels. PLoS ONE 7(8):e42543, incorporated by reference.
INCORPORATION BY REFERENCE
(56) References and citations to other documents, such as patents, patent applications, patent publications, journals, books, papers, web contents, have been made throughout this disclosure. All such documents are hereby incorporated herein by reference in their entirety for all purposes.
EQUIVALENTS
(57) Various modifications of the invention and many further embodiments thereof, in addition to those shown and described herein, will become apparent to those skilled in the art from the full contents of this document, including references to the scientific and patent literature cited herein. The subject matter herein contains important information, exemplification, and guidance that can be adapted to the practice of this invention in its various embodiments and equivalents thereof.
EXAMPLES
Example 1. Traditional Library Prep Requires Post-Ligation Bead Cleanup
(58) First, the effect of ligation reaction components on PCR was investigated by real-time PCR. Real-time PCR reactions using NuGEN OVATION Universal RNA-Seq System PCR reaction components, supplemented with a final 1× EvaGreen dye, were prepared with 10-fold serial dilutions of RNA-Seq library and 2-fold serial dilutions of ligation reaction components.
(59)
(60)
(61)
Example 2. Ligation Adaptors can Also Serve as PCR Primers
(62) Adaptors supplied with the NuGEN OVATION Ultralow System V2 can serve as PCR primers. The surprising and unexpected result is that by eliminating the post-ligation bead purification and allowing the unligated adaptors to participate in PCR instead of adding the supplied PCR primers, robust amplification without adaptor artifacts could be achieved. This is demonstrated by using 100 ng or 10 ng of DNA fragmented to 300 bp by Covaris as input into the Ultralow v2 end repair and ligation reactions following the standard protocol. After ligation, which was performed in the standard 30 ul volume, 25.5 ul of Amp Buffer Mix, 2 ul of Amp Enzyme Mix, and 42.5 ul of water were added to prepare a 100 ul PCR reaction. The 100 ng and 10 ng reactions were subjected to 9 or 12 cycles of PCR following the cycling conditions described in the Ultralow user guide, respectively, then the PCR products were bead purified and analyzed by Bioanalyzer.
(63) Methods of the disclosure may be used with a single-primer enrichment technology (SPET) target enrichment method as well as the UltraLow library system. In the ultralow method: two adaptors sequences are ligated to the 5′ ends of insert; the 3′ ends of insert are extended to copy the adaptor sequences. The copy of the adapter sequence becomes the priming site for the PCR primers, which are the same as the long, or ligation strand of the adapter.
(64) In order for this to work, the long strand of the adapter now preferably represents the entire sequence used in cluster formation in addition to barcodes, barcode priming sites and sequencer priming sites. The short oligo in of the adapter can be ligatable to the 3′ end (and get extended) or not ligate (only serves to enable DNA ligase to interact with the adapter—expecting ds DNA).
(65) It may be preferable to use a short oligo that does not ligate, therefore the 3′ extension initiates at the 3′ end of the DNA insert/fragment as opposed to 3′ end of the short oligo. Using high concentrations of adapter ensure that sufficient unligated oligo will be available to serve as PCR primer. If the short oligo is blocked at both ends, it can not be ligated nor can it be extended. This makes for a cleaner library and less concern regarding its interference in PCR. The scenario where adapter oligos are present but do not interfere with PCR is similar but now the residual ligated oligo of the adapter must either be diluted out with a longer oligo (to provide full sequence) or partially degraded to have a lower Tm than the oligos added for the PCR step.
(66)
(67) Based on the data, certain observations and conclusions may be made. The field of Next Generation Sequencing DNA library preparation is dominated by Illumina. One of their most widely used library prep kits is the TruSeq® Nano DNA Library Prep. Starting with fragmented DNA, the protocol consists of End Repair, Bead cleanup, A-tailing, Y-Adaptor Ligation, Bead cleanup, PCR, and a final Bead cleanup, for a total of 3 Bead cleanups and 5.5 hours to complete the protocol. A version of the NuGEN Ovation® Ultralow Library System offers a simplified workflow requiring only End Repair, Ligation, Bead cleanup, PCR, and a final Bead cleanup, for a total of two Bead cleanups and 4 hours to complete the protocol. It has been understood in the field that purification must be performed after ligation in order to remove adaptors and other ligation reaction components such as high concentrations of magnesium and PEG which are incompatible with the subsequent PCR step. Furthermore, both protocols require distinct PCR primers in order to amplify a functional final sequencing library.
(68) Here, the data show that those long held beliefs are incorrect, and that DNA library prep can be accomplished with only a single Bead cleanup after the PCR step. The demonstrated workflow is compatible with both mechanical and enzymatic shearing of DNA. Here, NuGEN-style adaptors allow for both ligation and PCR amplification (i.e., with an adapter functioning as a PCR primer), without addition of distinct PCR primers (an approach not possible with the Illumina Y-adaptor approach).
(69) Methods of the disclosure thus provide (1) a process whereby the post-ligation bead cleanup may be eliminated; (2) a three step, single bead cleanup protocol that generates high quality libraries; and (3) adaptors that serve also as PCR primers.
(70) As shown in
(71) Here, we have shown that ligation adaptors can also serve as PCR primers. A surprising and unexpected result is that by eliminating the post-ligation bead purification and allowing the un-ligated adaptors to participate in PCR instead of adding the supplied PCR primers, robust amplification without adaptor artifacts could be achieved.
(72) This is demonstrated by using 100 ng or 10 ng of DNA fragmented to 300 bp by Covaris as input into the Ultralow v2 end repair and ligation reactions following the standard protocol. After ligation, which was performed in the standard 30 ul volume, 25.5 ul of Amp Buffer Mix, 2 ul of Amp Enzyme Mix, and 42.5 ul of water were added to prepare a 100 ul PCR reaction. The 100 ng and 10 ng reactions were subjected to 9 or 12 cycles of PCR following the cycling conditions described in the Ultralow user guide, respectively, then the PCR products were bead purified and analyzed by Bioanalyzer. The resulting Bioanalyzer traces shown in
Example 3. Illumina Y-Adaptors Cannot Serve as PCR Primers
(73)
Example 4. Present Invention is Compatible with Enzymatic Fragmentation
(74) Intact genomic DNA was fragmented in a 15 ul reaction containing 2 mU HL-dsDNase (ArcticZymes), 6 U E. coli DNA Polymerase I (NEB), 1.5 U T4 DNA Polymerase, 1×NEBuffer 2 (NEB) and 0.2 mM dNTPs under the following temperature profile: 25 C for 15 min, 65 C for 15 min, 4 C hold. The NuGEN Ultralow v2 ligation and PCR components were used to perform ligation and PCR as follows. Ligation was performed by adding 3 ul of Ligation Adaptor Mix, 5 ul of Ligation Buffer Mix, and 2 ul of Ligation Enzyme Mix for a total of 25 ul. After the standard ligation incubation steps of 25 C for 30 min, 70 C for 10 min, and 4 C hold, PCR components were added directly to the ligation reaction, without bead purification. 25.5 ul of Amp Buffer Mix, 2.5 ul of Amp Primer Mix, 2 ul of Amp Enzyme Mix, and 45 ul of water were added to prepare a 100 ul PCR reaction. After 9 cycles of PCR following the cycling conditions described in the Ultralow user guide the PCR products were bead purified and analyzed by Bioanalyzer.
(75)
REFERENCES
(76) TRUSEQ Nano DNA Library Prep guide, file name truseq-nano-dna-library-prep-guide-15041110-d.pdf, available from support.illumina.com, IIlumina, Inc., San Diego, Calif. (40 pages) The OVATION Ultralow System V2 User guide, part No. 0344, 0344NB, file name M01379_v5_User_Guide_Ovation_Ultralow_Library_Systems_V2_(Part_No._0344)_2215.pdf, available from nugen.com from NuGEN Technologies Inc., San Carlos, Calif. (30 pages).