SINGLE MOLECULE CONTROLS
20210285035 · 2021-09-16
Inventors
Cpc classification
C12Q2525/161
CHEMISTRY; METALLURGY
C12Q2537/16
CHEMISTRY; METALLURGY
C12Q2545/107
CHEMISTRY; METALLURGY
C12Q1/6834
CHEMISTRY; METALLURGY
C12Q2537/16
CHEMISTRY; METALLURGY
C12Q1/6834
CHEMISTRY; METALLURGY
C12Q2545/107
CHEMISTRY; METALLURGY
International classification
Abstract
Methods for obtaining a reaction volume having a predetermined copy number, such as a single copy, of a known nucleic acid molecule therein are disclosed. Such a reaction volume may be useful as a control for nucleic acid amplification reactions. Nucleic acid constructs useful in such methods are also described. The constructs include a first region and a second region, wherein the second region is flanked by first and second primer binding sites to allow amplification across the second region, and wherein a selectively cleavable region is located between the first and second regions, with the selectively cleavable region being flanked by third and fourth primer binding sites to allow amplification across the selectively cleavable region. The second region acts as a reporter to indicate the presence of the entire nucleic acid molecule; and the selectively cleavable region can be cleaved to separate the first region from the second region, leaving an isolated copy of the first region.
Claims
1. A nucleic acid cassette comprising a nucleic acid molecule comprising a first region and a second region, wherein the second region is flanked by first and second primer binding sites to allow amplification across the second region, and wherein a selectively cleavable region is located between the first and second regions, with the selectively cleavable region being flanked by third and fourth primer binding sites to allow amplification across the selectively cleavable region.
2. The nucleic acid cassette of claim 1 wherein the first region is flanked by fifth and sixth primer binding sites to allow amplification across the first region.
3. The nucleic acid cassette of claim 2 wherein the fifth and sixth primer binding sites correspond to primer binding sites flanking a desired test nucleic acid.
4. The nucleic acid cassette of any preceding claim wherein the selectively cleavable region is a restriction enzyme binding and/or cleavage site.
5. The nucleic acid cassette of any preceding claim comprising multiple second regions, each flanked by primer binding sites.
6. The nucleic acid cassette of claim 5 wherein each second region is separated from its neighbour by a selectively cleavable region.
7. The nucleic acid cassette of claim 6 wherein each selectively cleavable region is identical.
8. The nucleic acid cassette of any preceding claim comprising multiple first regions.
9. The nucleic acid cassette of claim 8 wherein each of the multiple first regions is different.
10. The nucleic acid cassette of any preceding claim wherein the first region is a mimic region, with a sequence selected so as to have properties mimicking those of a desired test nucleic acid sequence.
11. The nucleic acid cassette of claim 10 wherein the properties mimicking those of a desired test nucleic acid sequence comprise one or more properties selected from the group consisting of length, G/C composition, absence of repeat sequences, and capacity to form secondary structures.
12. The nucleic acid cassette of any preceding claim wherein the second region is a reporter region.
13. The nucleic acid cassette of any preceding claim, wherein the cassette is functionalised with a reactive group in the vicinity of the first region, preferably at an end of the cassette.
14. The nucleic acid cassette of any preceding claim, wherein the cassette is immobilised on a solid support.
15. A reaction mix comprising the nucleic acid cassette of any preceding claim.
16. A method for obtaining a predetermined copy number of a known nucleic acid molecule, the method comprising the steps of: (a) preparing a plurality of reaction mix volumes containing a nucleic acid cassette according to any of claims 1 to 13 such that at least some of said volumes are statistically likely to contain a desired predetermined copy number of said cassette; (b) combining each of the plurality of volumes of (a) with an agent capable of cleaving the selectively cleavable region of the cassette, to thereby separate the first region and the second region; (c) combining each of the plurality of volumes of (b) with nucleic acid primers capable of binding to the third and fourth primer binding sites flanking the selectively cleavable region of the cassette, and conducting an amplification reaction, to thereby amplify a sequence across the selectively cleavable region in those volumes where cleavage did not take place; and discarding those volumes where amplification took place; (d) combining each of the remaining plurality of volumes of (c) with nucleic acid primers capable of binding to the first and second primer binding sites flanking the second region, and conducting an amplification reaction, to thereby amplify a sequence across the second region; and discarding those volumes where no amplification took place; (e) to thereby provide a remaining plurality of volumes each of which comprises a predetermined copy number of a nucleic acid molecule comprising the first region.
17. The method of claim 16 wherein the plurality of reaction mix volumes of (a) are prepared as a water-in-oil emulsion of an aqueous solution comprising the cassette.
18. The method of claim 17 wherein the concentration of the aqueous solution is selected such that this results in a statistical distribution within the volumes such that the majority of volumes will include no cassette, while at least some include the desired copy number of cassettes.
19. The method of any of claims 16 to 18 wherein the desired copy number of the nucleic acid molecule is one.
20. The method of any of claims 16 to 19 wherein the majority of reaction mix volumes in (a) include no copies of the cassette.
21. The method of any of claims 16 to 20 wherein one or more, and preferably all, of the combining steps are carried out by fusion or combination of two or more reaction mix volumes.
22. The method of any of claims 16 to 21 wherein the reaction mix volumes are in the form of droplets.
23. The method of any of claims 16 to 22 wherein nucleic acid amplification is polymerase chain reaction (PCR) amplification.
24. The method of any of claims 16 to 23 wherein step (d) further comprises quantitating the amplified second region, and discarding those reaction mixes where the amount of amplified nucleic acid is above and/or below a predetermined threshold.
25. The method of any of claims 16 to 24 further comprising the step of binding the nucleic acid molecules comprising the first region contained in the volumes of (e) to a solid support.
26. The method of claim 25 further comprising the step of combining the solid support with a reaction mix volume containing a test nucleic acid, and allowing the test nucleic acid to hybridise to the bound nucleic acid(s) on the solid support.
27. The method of claim 25 further comprising the steps of: a′) contacting i) the solid support having a nucleic acid molecule comprising the first region bound thereto with ii) a solution comprising a plurality of nucleic acid molecules, at least one of which is a target nucleic acid molecule, wherein at least a portion of the target nucleic acid molecule is complementary to a portion of the nucleic acid molecule attached to the solid support; b′) allowing the nucleic acid molecule attached to the solid support to hybridise to the target nucleic acid molecule; and c′) removing the solid support from the solution; to thereby isolate the hybridised target nucleic acid molecule in a known copy number.
28. The method of claim 27 wherein the nucleic acid molecule comprising the first region is double stranded, and wherein only a single strand of the double stranded molecule is attached to the solid support.
29. The method of any of claim 27 or 28, wherein at least a portion of the nucleic acid molecule comprising the first region is not complementary to a corresponding portion of the target nucleic acid molecule.
30. The method of claim 29 wherein a non complementary portion of the nucleic acid molecule comprising the first region is at an end of the molecule which is not attached to the solid support.
31. The method of any of claims 27 to 30 wherein the solution comprises significantly more copies of the target nucleic acid molecule than the known copy number.
32. The method of any of claims 27 to 31 wherein the solid support is a polymer bead.
33. The method of any of claims 27 to 32 further comprising the step d′) delivering the solid support and the target nucleic acid molecule to a reaction vessel or a reaction volume.
34. The method of claim 33 further comprising the step e′) extending the captured target nucleic acid molecule by a polymerisation reaction using the nucleic acid molecule comprising the first region as a template, thereby incorporating additional sequence into the captured target nucleic acid molecule.
35. The method of claim 33 or 34 further comprising the step f′) amplifying at least a portion of the target nucleic acid molecule to provide a plurality of copies of the amplified portion.
36. The method of any of claims 16 to 24 further comprising the step of adsorbing the nucleic acid molecules comprising the first region contained in the volumes of (e) onto or into a solid support.
37. The method of claim 36 wherein the solid support is hydrophilic and lipophobic.
38. The method of claim 36 or 37 wherein the solid support is a cellulose-based matrix.
39. The method of any of claims 16 to 24 or 36 to 38, wherein the first region is flanked by fifth and sixth primer binding sites which correspond to primer binding sites flanking a desired test analyte nucleic acid sequence; the method further comprising the step of combining the volumes obtained in (e) with a reaction mix volume containing a test nucleic acid and a primer pair which will bind to the fifth and sixth primer binding sites; performing nucleic acid amplification on the combined reaction mix; and determining whether nucleic acid amplification has taken place of i) the nucleic acid molecules comprising the first region; and/or ii) a portion of the test nucleic acid.
40. The method of any of claims 16 to 24 further comprising the step of combining the volumes obtained in (e) with an additional reaction mix having different physical properties.
41. A method for performing a nucleic acid assay to detect the presence of a test nucleic acid in a sample, the method comprising: (a) combining a sample containing a test nucleic acid with a nucleic acid molecule of known copy number having a first region which is flanked by fifth and sixth primer binding sites which correspond to primer binding sites flanking a desired test nucleic acid sequence; and with a primer pair which binds to the fifth and sixth primer binding sites; (b) performing a nucleic acid amplification reaction on the combined sample and reaction mix; (c) determining whether nucleic acid amplification has taken place of i) the nucleic acid molecule comprising the first region; and/or ii) a portion of the test nucleic acid.
42. The method of claim 41, wherein the nucleic acid molecule of known copy number is prepared according to the method of any of claim 12 to 38 or 40.
43. The method of claim 41 or 42 wherein the known copy number is one.
44. A product comprising a hydrophilic and lipophobic solid support, having a nucleic acid molecule of known copy number adsorbed thereon.
45. The product of claim 44, wherein the solid support comprises a cellulose-based matrix.
46. The product of claim 44, wherein the solid support comprises a polymeric bead.
47. The product of claims 44 to 46, wherein the nucleic acid molecule comprises a first region flanked by fifth and sixth primer binding sites which correspond to primer binding sites flanking a desired test nucleic acid sequence.
48. The product of any one of claims 44 to 47, wherein the nucleic acid molecule of known copy number is prepared according to the method of any of claim 12 to 38 or 40.
49. A method for isolating a target nucleic acid molecule in a known copy number, the method comprising the steps of: a) contacting i) a solid support having a nucleic acid molecule of known copy number attached thereto with ii) a solution comprising a plurality of nucleic acid molecules, at least one of which is a target nucleic acid molecule, wherein at least a portion of the target nucleic acid molecule is complementary to a portion of the nucleic acid molecule attached to the solid support; b) allowing the nucleic acid molecule attached to the solid support to hybridise to the target nucleic acid molecule; and c) removing the solid support from the solution; to thereby isolate the hybridised target nucleic acid molecule in a known copy number.
50. The method of claim 49, wherein the solid support having a nucleic acid molecule of known copy number attached thereto is prepared in accordance with any of claims 14 or 25 to 38.
51. The method of any of claim 49 or 50 wherein the nucleic acid molecule of known copy number is double stranded, and wherein only a single strand of the double stranded molecule is attached to the solid support.
52. The method of any of claims 49 to 51, wherein at least a portion of the nucleic acid molecule of known copy number is not complementary to a corresponding portion of the target nucleic acid molecule.
53. The method of claim 52 wherein a non complementary portion of the nucleic acid molecule of known copy number is at an end of the molecule which is not attached to the solid support.
54. The method of claim 52 or 53 wherein a non complementary portion of the nucleic acid molecule of known copy number is at an end of the molecule which is attached to the solid support.
55. The method of any of claims 49 to 54 wherein the solution comprises significantly more copies of the target nucleic acid molecule than the known copy number.
56. The method of any of claims 49 to 55 wherein the solid support is a polymer bead.
57. The method of any of claims 49 to 56 further comprising the step d) delivering the solid support and the target nucleic acid molecule to a reaction vessel or a reaction volume.
58. The method of claim 57 further comprising the step e) extending the captured target nucleic acid molecule by a polymerisation reaction using the nucleic acid molecule of known copy number as a template, thereby incorporating additional sequence into the captured target nucleic acid molecule.
59. The method of claim 57 or 58 further comprising the step f) amplifying at least a portion of the target nucleic acid molecule to provide a plurality of copies of the amplified portion.
Description
SUMMARY OF THE DRAWINGS
[0060]
[0061]
[0062]
[0063]
[0064]
[0065]
[0066]
[0067]
[0068]
[0069]
[0070]
[0071]
[0072]
[0073]
[0074]
[0075]
[0076]
[0077]
[0078]
[0079]
[0080]
[0081]
[0082]
[0083]
[0084]
[0085]
[0086]
DETAILED DESCRIPTION OF THE INVENTION
[0087] The present invention is intended to permit the generation of a reaction mix or reaction volume containing a controlled copy number (usually one) of a known and defined nucleic acid sequence. This volume may be used as a control in a molecular diagnostic assay. Although the invention is primarily described herein in terms of providing an artificial control sample, it will be appreciated that there are other fields where it may be desirable to generate a sample containing a known copy number of a known nucleic acid; the invention is thus not limited to generation of control sequences.
[0088] The sensitivity and specificity of DNA testing technologies has increased in recent years, such that it is now possible to detect a single molecule of a test analyte DNA sequence by a number of means, including clonal amplification of that test analyte DNA sequence and mass detection of the amplified product by, for example, Next Generation Sequence analysis.
[0089] However, the detection of single molecules (or very low single digit copies) is challenging. Any sufficiently developed assay must enjoy very high sensitivity and very high specificity, but frequently maximising one of these compromises the other. In the investigation of blood stream infection (BSI) for example, it may be clinically significant (and a patient may be very sick) where there is as low as 1 colony forming unit (CFU) of the pathogen per millilitre of whole blood. It is necessary to rapidly identify the pathogen and perhaps antimicrobial resistance (AMR) genes that might confer resistance to specific classes of antibiotic. With a minimal blood draw of 10 ml and acceptance that preparation of DNA from the pathogen will incur loss of a proportion of that DNA, it is not unreasonable to think that a molecular diagnostic test will have to be capable of accurately detecting low single digit copies of the target analyte in a background of large quantities (both physical and sequence-content) of ‘contaminating’ DNA, and reporting a result that can be relied upon by the clinician when selecting appropriate therapy.
[0090] The use of controls, run concomitantly with the test sample, has long been an accepted means of giving confidence that a diagnostic test is performing within the expected validation parameters. This is of particular relevance in demonstrating that the sensitivity of the assay is performing at the challenging levels of low single molecules in the test sample, but until now there has been no simple and reliable method that enables control samples to be generated to accurately reflect the very low copy number that might be encountered in a test sample.
[0091] The invention disclosed herein enables the provision of an artificial test sample that reliably mimics the very low copy number and the sequence context of a test analyte. Provision of such a control, which is co-amplified along with the test analyte, enables the performance of the assay to be confirmed, and the veracity of any test result generated to be confirmed. Even in situations where there is a negative test result, the veracity of this as a ‘true negative’ is confirmed to a greater extent where the control, run at a level of just a single molecule, does generate a convincing positive result.
[0092] Although amplification of very small quantities of DNA is prone to stochastic variability during early cycles, running a control that generates an output using the same amplification primer sequences as the test analyte, in the same tube as the test analyte, gives effective ‘normalisation’ of many experimental variables that might otherwise affect the efficiency of the test assay. Such variables as absolute concentration of reagents, pipetting errors and temperature fluctuations, plasticware and operator variables are automatically controlled for when the control and the test are run in the same reaction vessel. These variables can confound the usefulness of a control, if performed in a separate reaction vessel from the test assay. Other factors that can be designed to ensure that control and test sequences amplify with the same efficiency are for example the length of the amplicon, as there is a selective bias to amplify short amplicons with greater efficiency. The ‘GC’ content of the intervening sequence may also impact how efficiently sequences of identical length are amplified. These and other factors might be designed deliberately into the control, where it may be beneficial to make that control slightly ‘harder to amplify’, and therefore more prone to failure, as any good control should be. If designed to be as functionally equivalent as possible, then the primer binding sites, the length and the GC content and distribution may be normalised, together with other sequence considerations, such as runs of homopolymer and capacity to form secondary structures.
[0093] A detailed description of the invention is now given, for illustrative purposes only. In this example, the detection of a blood stream infection pathogen is used as an exemplar, in which it might be expected that any pathogen nucleic acids indicative of infection would likely be present at very low (single digit) copy number. The example also envisages the use of microdroplets as reaction volumes, and to contain the nucleic acid cassette; such microdroplets are amenable to manipulation and combination by using microfluidics techniques. Combination of a first droplet with additional reagents can be achieved relatively straightforwardly by fusion of two droplets. Again, however, the invention is not limited to the use of microdroplets.
[0094]
[0095] In order to provide a control for the diagnostic detection of the test analyte 101, an artificial nucleic acid construct is provided. This artificial construct is called the CONTROL CASSETTE 200, as shown in
[0099] Importantly, the susceptible site 203 is located between the analyte mimic 201 and the reporter 202. The susceptible site 203 could be, for example, a restriction endonuclease cleavage recognition sequence. The sequences of the analyte mimic 201 and the reporter 202 are not necessarily derived from any naturally occurring sequence, and therefore may be any optimally designed sequence. However, typically the analyte mimic 201 will be selected to have at least some similarity (for example, length, G/C content, etc) to the test analyte 101, but equally to be distinguishable therefrom when both sequences are amplified.
[0100] Whilst the DNA sequences of the analyte mimic 201 and reporter 202 are unconstrained in their selection, the DNA sequence of the susceptible site 203 and the sequences flanking all three of the regions 201, 202 and 203 are constrained such that specific amplification primers can be provided to hybridise to these sites.
[0101] Primers 301 and 302 are designed to amplify the reporter 202, and primers 303 and 304 are designed to amplify through the susceptible site 203 provided the nucleic acid cassette is intact. In certain embodiments, one or other or both of primers 303 and 304 may include 5′ tails of non-template nucleotides, to aid strand displacement of these primers when annealed. However, this is not believed to be essential, and in preferred embodiments the primers do not include tails.
[0102] Primers 102 and 103 are selected so as to allow hybridisation to both the cassette and the test sample, and are therefore based on DNA sequences found in the test sample. However, primers 301, 302, 303 and 304 are not necessarily derived from any naturally occurring sequences and can be freely optimised. In particular, primers 301, 302, 303, and 304 can be designed such that there is no (or minimal) unwanted hybridisation between these primers and the target sites of primers 102 and 103, or indeed these primers and their respective non-target sites. This design strategy reduces the chances of unintended amplification of analyte mimic 201; specifically, the primers should not be able to hybridise anywhere which could result in the unwanted duplication of the mimic region 201 since this could clearly compromise the ability to produce a single copy of that region in the final product. Likewise, the sequence of the mimic region 201 can also be designed so as to reduce or eliminate unwanted hybridisation. It will be understood, therefore, that only the sequences of the primers 102 and 103, and their respective binding sites, are constrained by the desired target sequence; other sequences may in certain embodiments be freely designed in order to optimise performance. In certain embodiments, the respective melting temperatures of a primer to its target may also or instead be designed to arrive at a preferred range of melting temperatures of different primer:target hybridisations. For example, primers 301 and 302 may have a lower melting temperature than primers 303 and 304. Such an arrangement may permit a further check against amplification of an unwanted region during performance of the present methods.
[0103] This invention provides a method whereby a single copy (or other known copy number) of the analyte mimic 201 may be delivered to a reaction volume, and which may therefore be used as a control in a diagnostic assay where the presence of the test analyte 101 is being assessed. The first step of this process is to distribute the control cassette 200 construct into isolated volumes, such as by formation of an emulsion of ‘water-in-oil’ droplets or other small volume reaction chambers. The format of ‘water-in-oil’ droplets will be used as an exemplar in
[0104] A known concentration of the control cassette 200 is prepared in aqueous solution, such that when combined with oil, small volume (nano, pico or femtolitre range) droplets are created. The concentration of the control cassette in the initial solution is sufficiently low that the overwhelming majority of the ‘water-in-oil’ droplets formed will be entirely devoid of the control cassette 200; these droplets are identified in
[0105] The concentration of the initial solution, and the volume of oil used, may be varied to achieve a desired final distribution of the cassette in the droplets.
[0106] There are a number of means that ‘water-in-oil’ droplets can be formed, including vigorous mixing, sonication or directly injecting the aqueous solution through a narrow constriction into the oil. This final method is preferred as it gives the greatest potential to control the uniform volume of the resulting droplets. Whichever method is chosen, the final result of the process is a very large number of droplets, maintained separately to each other (through, for example, the inclusion of a detergent within the aqueous solution).
[0107] The mixed population of droplets 401 and 402 must now be individually identified and separated, which is achieved by sequential utilisation of the susceptible site 203 and the reporter 202 that are linked to the analyte mimic 201 on the control cassette 200.
[0108] If the 401/402 population were to be analysed directly by, for example, amplifying the reporter 202, then this would risk unwanted amplification of the mimic 201 which is still on the same molecule as the reporter 202. For example, one possible method of confirming the presence of the reporter 202 would be to combine the individual droplets of the 401/402 mixture with a separate species of droplet enclosing biochemistry including primers 301 and 302, and subjecting the combined droplets to an amplification reaction. The presence of the reporter 202 could then be confirmed, through (for example) monitoring the accumulation of the reporter 202 amplicon. This can be achieved by including a dye that increases in fluorescence in the presence of increasing accumulation of dsDNA. However, if this is done on the intact cassette, there is the potential that the primer 302 would extend through the reporter 202, through the binding site for the primer 301, through the binding site for primer 304, through the intact susceptible site 203 and its primer site 303 and on through the region represented by the analyte mimic 201, and its associated primer binding sites 102 and 103. This undesirable duplication would mean that the analyte mimic 201 would no longer be present as a single copy within the droplet. This undesirable copying of the analyte mimic 201 would be linear (as opposed to exponential) and would only duplicate the analyte mimic 201 on one strand. Duplication of the analyte mimic 201 must therefore be prevented by introducing a physical blockage to the passage of the actively extending primer 302.
[0109] The nature of the blockade that would prevent the passage of the extending primer 302 could be, for example; [0110] Hybridisation of a high-affinity 5′ phosphorylated (or other 5′-3′ exonuclease digestion-refractory modification) oligonucleotide to the strand-separated template DNA, necessarily prior to initiation of extension from the reverse primer 302; [0111] Incorporation or generation of abasic sites; [0112] Inclusion of ‘PCR stoppers’, such as HEG (hexaethylene glycol); [0113] Generation of strand nicks or physical breaks in the DNA.
[0114] However, systems that rely on the hybridisation of blocking molecules cannot be guaranteed to be 100% effective, and systems that rely on the direct inclusion of abasic sites, HEG, or other chemical blockers mean the control DNA is no longer ‘natural’, and is less amenable to future manipulation, as might be necessary to introduce new control sequences for new targeted test analytes or to generate copies of the sequence for manufacturing purposes.
[0115] The present invention therefore makes use of a method whereby the mimic 201 and reporter 202 are physically separated prior to detection of the reporter 202. This ensures that amplification of the reporter 202 does not inadvertently also bring about the risk of amplification of the mimic 201. The most attractive and effective means of preventing the extension of reverse primer 302 through analyte mimic 201 during the polymerase-driven identification of the reporter 202 would be to physically break the covalent linkage of the analyte mimic 201 and reporter 202, necessarily after the intact control cassette 200 has been delivered to a specific droplet 401. This physical breakage can be achieved by the inclusion of the susceptible site 203 between analyte mimic 201 and the reporter 202 (but confirmed as being absent from any other regions within the control cassette 200). Restriction enzyme digestion of the cassette 200 using an enzyme which recognises or cuts at the susceptible site 203 will cleave the cassette 200 into two parts. As the susceptible site 203 is flanked by DNA sequences that can support hybridisation of oligonucleotide primers 303 and 304, then amplification across this region can be used to confirm whether cleavage has taken place or not. See
[0116] Initially, in order to determine which of the droplets within the mixture of 401 and 402 droplets do indeed contain the control cassette 200, each one of these droplets must be treated as if it does contain this construct and attempts to inactivate the susceptible site 203 must be made (by restriction digestion, for example). The droplets contained within mixture 401 and 402 are therefore individually merged with a further species of droplet, 701, that contains the necessary biochemistry to effect inactivation (cleavage) of the susceptible site 203, as shown in
[0117] In order to discriminate droplets 702, 703 and 704, polymerisation across the susceptible site 203 with primers 303 and 304 is effected (see
[0118] Note that it is preferable to initially assess the inactivation of the susceptible site 203, rather than identifying the (relatively small number of) droplets that contained the reporter 202 region first. This is because assessing reporter 202 followed by susceptible site 203 would necessitate discrimination of a positive fluorescent result building on a previously positive fluorescent result. Although it would not be impossible, it would be clearer and more preferred to have a positive result generated (from assessment of reporter 202) subsequent to a negative result (from failure to amplify across the susceptible site 203). This strategy however demands assessment of a very great number of species 704 droplets, the majority of droplet species devoid of the control cassette 200.
[0119] In certain embodiments of the invention, however, it may be possible to assess the reporter 202 first, if two different dyes (for example, fluorescing at different wavelengths) were used to assess the reporter 202 and susceptible site 203. This might be achieved for example by using TaqMan probes, which generate fluorescence after cleavage of a probe during Taq polymerase extension. TaqMan probes could be targeted against the reporter (Wavelength 1 positive result indicative of presence) and then the susceptible site 203 (Wavelength 2 failure to generate a positive response indicative of successful cleavage of the susceptible site 203).
[0120] Droplets 803 and 804, which failed to amplify across the susceptible site 203, must now be discriminated through assessment of the presence of the reporter 202. These droplets are individually merged with droplet species 901 (see
[0121] Once merged, droplets 902 and 903 are thermal cycled under conditions that will amplify the reporter 202 region, if present. Only droplet 902 will demonstrate this amplification and generate an increase in fluorescence as the SYBR Green dye binds to the amplicon generated.
[0122] Droplet 902 has therefore been generated from a progenitor droplet that has sequentially; [0123] Failed to demonstrate an intact susceptible site 203 [0124] Positively demonstrated the presence of reporter 202
[0125] This droplet 902 is therefore confidently determined to include a nucleic acid molecule comprising the analyte mimic 201 and flanking primer binding sites, which has been separated from the reporter 202 by cleavage of the susceptible site 203. This droplet will contain a single copy of the analyte mimic 201 (in the majority of cases), due to the low starting concentration and distribution within the original droplets of the control cassette 200, and will also contain the detritus of the various analyses used to confirm its identity. Given that this droplet can be manipulated into a diagnostic assay that seeks to confirm the presence of the test analyte 101 by virtue of amplification with primers 102 and 103, and that the carried over biochemistry droplet 902 harbours does not interfere with the performance of primers 102 and 103, this droplet 902 provides a single copy analyte mimic 201, amenable to amplification by primers 102 and 103.
[0126] An overview of the flow of droplets, mergers and rejections is given in
[0127] After thermal cycling, the droplet 902 is identified (within the black diamond) as being fluorescent due to reporter amplification, and is harvested as the final product of the process. Droplet 903, which remains non-fluorescent and therefore devoid of the reporter 202, is discarded to waste.
[0128] As a final demonstration of the effectiveness of this scheme, provision of a TaqMan probe designed to hybridise to the control mimic 201 sequence and biochemistry to amplify this region (primers 102 and 103) can be used to distinguish droplet 902 from droplets 903: only species 902 should support a positive TaqMan reaction, with the fluorescent emission of the TaqMan reaction being by necessity in a different channel to that of the dye already used to confirm the presence of the reporter 202 (SybrGreen for example). This confirmatory test is depicted in
[0129] During manufacture, no TaqMan assay will be performed to confirm the presence of control mimic 201; the control mimic 201 will remain as a single molecule. Modulating the physical nature of droplet 902 is desirable in order to enable it to be manipulated, such that its contents are easily introduced into a diagnostic assay.
[0130] Further possible manipulations might include desiccation of the droplets to form rehydrateable pellets, with each pellet containing just a single representation of the analyte mimic 201. It might be beneficial to remove certain (or all) of the detritus of previous analyses from the final desired single molecule droplet, in case these components interfere with the ultimate molecular diagnostic test, although in preferred embodiments, these carry-over components are tolerated in the final diagnostic assay and need not be removed.
[0131] In other embodiments, the single copy nucleic acid could be bound to a solid support (rather than being simply adsorbed); for example, the solid support could be a polystyrene bead, a derivatised glass surface, etc. This may allow alternative uses of the nucleic acid, such as acting as a nucleic acid probe. In certain embodiments of the invention, such solid supports may be useful in isolating a single copy of another nucleic acid to which the single copy nucleic acid may hybridise. The single copy nucleic acid can act as a molecular “hook” or “fishing rod” and placed in a reaction mix containing a desired target; the target will hybridise to the hook, while the solid support allows the hybridised nucleic acids to be manipulated thereafter. Further details of the use of a single copy nucleic acid as a “fishing rod” are described elsewhere in this document, with reference to
[0132] The diagnostic assay could be performed in a number of reaction vessels (as opposed to droplet-based water-in-oil), but is disclosed here as being a PCR amplification based system: the assay would require the combination of extracted test sample (Template DNA), the biochemistry required to amplify the test analyte 101, and the analyte mimic (in one of at least three formats disclosed here). This combination is represented diagrammatically in
[0133] Droplet 902, ‘modified’ droplet 1102 or matrix 1104 will each harbour a single copy of the analyte mimic 201. When one (and only one) of these species is combined with test template 1201 and the biochemistry 1202 to support amplification of the test analyte 101/analyte mimic 201, the reaction vessel 1203 will be capable of simultaneous amplification of the test analyte 101 (grey hatched box) and the analyte mimic 201 (black box). Subsequent detection/discrimination of the amplified species by any suitable means allows the anticipated positive control analyte mimic 201 to inform the significance of a positive or negative result generated from the test analyte 101.
[0134] After amplification within the vessel 1203 by, for example, thermal cycling, the vessel may contain multiple copies of the test analyte 101 and the analyte mimic 201. These amplicon species will have common sequence at their terminal ends (by virtue of having both been generated through the extension of primers 102 and 103), but will have distinct core sequences.
[0135] Irrespective of the presence or absence of the test analyte 101 in the vessel 1203, the analyte mimic 201 is confidently present and should be amplified to give multiple copies in the vessel 1301. If the test analyte 101 is also present this will amplify, but if it is absent, it will of course fail to amplify. The comparison of the number of copies of the analyte mimic 201 amplicon and the test analyte 101 may give some impression of the relative abundance of the test analyte 101 in the test sample, but this comparison will be at best semi-quantitative.
[0136]
[0137] Enhancements of the system are envisaged that allow the system to be used to control for more than one test analyte in a multiplex molecular diagnostic test analysis.
[0138] Total digestion of this cassette 1400 at both the susceptible sites 203 is necessary to demonstrate that duplication of one or other or both of the analyte mimics 201 and 1401 has been prevented. Detection of an intact susceptible site 203 post-inactivation (by amplification across the site 203) cannot reveal which of the susceptible sites 203 has remained intact since they are identical. Regardless, an intact site 203 will result in that droplet being discarded. It may be that in certain embodiments only the susceptible site 203 located between the analyte mimic 1401 and the reporter 202 is absolutely required to prevent duplication of the analyte mimics during analysis of the reporter 202 (and hence the two analyte mimics 201, 1401 are not separated by a susceptible site, and will not be separated in the final droplet). However, for certainty, and to prevent confusion of amplicons where there are primers amplifying both analyte mimics, it is preferable to position a susceptible site 203 between the analyte mimics 201 and 1401.
[0139] A further enhancement of the system (shown in
[0140] Comparison of the fixed-point (as opposed to end-point) responses from different representations of droplet 401 will enable the distribution of responses to be considered, and the potential for a secondary separation of the response to eliminate any overlap in the response, where the separation of the highest responses, together with any in a grey area, can be eliminated.
[0141] A final enhancement of the system, shown in
[0142] The above detailed description demonstrates the flexibility and reliability of the invention to provide a system that can be used to provide a single copy of a mimic of the analyte in a format that enables the detection of the analyte to be assessed.
[0143] Although the illustrated embodiments have been described from the perspective of preparation of droplets, the use of droplets per se may not be required, as there exist methods of carrying out ‘digital PCR’ that rely on very small reaction chambers (nanolitre wells; eg Wafergen or Life Technologies QuantStudio DX) to which additional components could be sequentially added. For example, after the Poisson distribution of the original diluted control cassette 200 into the wells of a digital PCR chip, the susceptible site 203 inactivation biochemistry, the susceptible site 203 amplification biochemistry and the reporter 202 amplification biochemistry could be introduced to these wells. However, this is not preferred, as it is envisaged that recovery of the entire contents of the well after identification of those wells that contained a single copy of the control cassette could be very difficult. However, this approach may be useful where such recovery is not needed and further reactions performed in the same well. This sequential addition scheme may also be used to demonstrate the serial biochemistries are preforming as anticipated.
[0144] Another enhancement that might be desirable is that rather than assessing the degree of amplification of the reporter 202 sequence after a limited and quantifiable level of amplification, the individual droplets would most beneficially be amplified in a device which allowed massively parallel amplification and simultaneous real time PCR assessment, perhaps allowing more reliable quantification of the presence of the reporter 202 region.
[0145] As noted above, single copy nucleic acid sequences may be attached to beads or other solid supports in order to be used as molecular “hooks” or “fishing rods”. The following section describes this in more detail.
CONTROLLED COPY NUMBERS AS SINGLE MOLECULE CAPTURE VEHICLES
[0146] Beyond the use as a sensitive control system, one attractive application of the ability to reliably isolate a single molecule of nucleic acid is the potential to use this (in its single stranded form) as a species-specific molecular ‘fishing hook’. If linked to a solid surface, for example a polymer bead, then the specific hybridization capacity of the single linked DNA sequence enables recovery of a second single DNA strand (bearing the complementary sequence) from a solution containing a plurality of DNA strands. The plurality of DNA strands may all bear the complementary sequence, or only a proportion of the plurality of DNA strands may bear the complementary sequence. Ideally, in order to maximize the potential for the bead-linked single molecule to encounter and capture a complementary DNA strand within a reasonable timeframe, the number of DNA strands in solution bearing the complementary sequence will be in massive excess compared to the single molecule linked to the bead. The complementary sequences can be designed to most advantageously allow high specificity, with little or no potential for cross-talk hybridization of the bead-linked capture sequence with any other DNA strand sequence that may exist within the plurality of DNA strands. Once captured, the DNA strand recovered from solution can then be manipulated as a non-covalently attached ‘passenger’ on the polymer bead.
[0147] The above system may be advantageously employed to seed the geographically-separated, clonal amplification of individual molecules (second single DNA strands) as the initiating step of an NGS reaction. Having delivered just one non-covalently attached passenger DNA strand to a specific geographical region, multiple (clonal) copies of identical sequence can be generated. Synchronous NGS interrogation of the bases of these copies maximizes the signal output generated at each individual base position of the DNA strand.
[0148] Delivery of a single copy of the DNA strand to be sequenced is ensured by virtue of the capacity to capture just one DNA strand onto each bead, and the geometric capacity to accommodate just a single polymer bead at each discrete geographic location. For example, after capture of a single DNA strand from a plurality of DNA strands, the bead harbouring the DNA strand may be delivered to a discrete well structure, where the well has dimensions sufficiently large to accommodate a single bead, but insufficiently large to accommodate more than one bead. This geometric limitation ensures that there will only ever be a single bead loaded to the well, and it follows only a single passenger DNA strand loaded per well. For example, such a system is described in WO2014/013263, to DNA Electronics Ltd. Reference is made particularly to page 15, line 25 to page 16, line 12; and to page 39, line 6 to page 43, line 15. These passages describe methods and systems for obtaining a limited number of beads per well, and for using such beads complexed with nucleic acids for sequence amplification and/or sequencing.
[0149] It is possible that non-specific adsorption of DNA stands from the plurality of DNA strands in solution onto the surface of the polymer bead may confound the aspiration to ensure just a single strand (passenger) of DNA delivered to the geographically distinct regions of the sequencing system. This potential may be minimized through stringent post-capture washing and/or selection of polymer bead/surface chemistry treatments that limit any such non-specific adsorption. Furthermore, it is possible to discriminate between legitimately captured DNA stands (hybridization captured) and non-specifically adsorbed sequences, by virtue of only the former having a potentially DNA polymerase-extendable, hybridized 3′0H end; the capture sequence attached to the bead may be specifically designed to include a non-hybridizable (i.e. not employed in capture) element that, upon extension of a legitimately captured DNA strand 3′ end, will drive DNA polymerase mediated incorporation of the complementary sequence to the non-hybridizable element (not employed in capture) onto the 3′ end of the captured DNA strand. The legitimately captured DNA strand is unique in its capacity to incorporate this additional sequence, which can subsequently be utilized as an essential element of the clonal amplification strategy (below).
[0150] The system detailed above is now illuminated using diagrammatic representations.
[0151]
[0152] Once the Control Cassette is cleaved by restriction digestion (for example) and confirmed through the amplification of Reporter 202, the single molecule component 201 released may resemble the DNA element presented in
[0153]
[0154]
[0155]
[0156]
[0157]