Methods for the analysis of dissociation melt curve data
10176277 ยท 2019-01-08
Assignee
Inventors
Cpc classification
C12Q2537/165
CHEMISTRY; METALLURGY
G16B20/20
PHYSICS
C12Q2537/165
CHEMISTRY; METALLURGY
International classification
Abstract
Methods are provided that operate on raw dissociation data and dissociation curves to generate calibrations of the detected data and to further improve analysis of the data. The data can be taken from each support region of a multi-region platform, for example, from each well of a multi-well plate. Each support region can be loaded with portions of the same sample. In some embodiments, a dissociation curve correction can be calibrated for the sample, prior to a run of an experiment using such sample. In some embodiments, a method is provided for generating a melting transition region of dissociation curves that show the melting characteristics of the sample. In some embodiments, dye temperature dependence correction can be performed on the dissociation curve data to further improve analysis. In some embodiments, a feature vector can be derived from the melt data, and the feature vector can be used to further improve genotyping analysis of the dissociation curves.
Claims
1. A method of a biological analysis system for analyzing melt curve data, wherein the biological analysis system includes a processor and a display, the method comprising: generating, by the processor, a calibration set of melt curve data for a calibration nucleic acid sample, wherein the calibration set is generated from subjecting the calibration nucleic acid sample deposited in a first plurality of support regions to conditions sufficient to denature nucleic acid replicates in the calibration nucleic acid sample; generating, by the processor, an experimental set of melt curve data for at least one test nucleic acid sample, wherein the experimental set is generated from subjecting the at least one test nucleic acid sample deposited in a second plurality of support regions to conditions sufficient to denature nucleic acids in the at least one test nucleic acid sample; correcting, by the processor, assay system noise in the experimental set of melt curve data using the calibration set of melt curve data, wherein the assay system noise is caused by at least thermal non-uniformity between support regions from the first plurality of support regions and the second plurality of support regions, and wherein correcting the experimental set of melt curve data for assay system noise using the calibration set of melt curve data further comprises: calculating, for the calibration set of melt curve data, a correction value comprising a difference between data from at least two support regions among the first plurality of support regions in which the calibration nucleic acid sample is deposited, and applying, for the experimental set of melt curve data, the calculated correction value to data from at least one of the second plurality of support regions in which the at least one test nucleic acid sample is deposited; scaling the corrected experimental set of melt curve data over an estimated temperature range; fitting the scaled corrected experimental set of melt curve data to an estimated asymptote for a low temperature region of a melting region of the melt curve data; clustering the experimental set of melt curve data based on the steps of correcting, scaling and fitting; and displaying, on the display, the corrected experimental set of melt curve data to a user.
2. The method of claim 1, wherein the method further comprises creating difference data from the experimental melt curve data for a plurality of nucleic acid samples, wherein the experimental melt curve data for one of the plurality of nucleic acid samples is selected as a reference, and the experimental melt curve data for the remaining nucleic acid samples are subtracted from the reference nucleic acid sample to create the difference melt curve data.
3. The method of claim 2, wherein feature vectors for one or more of the plurality of samples are generated from the difference melt curve data.
4. The method of claim 2, further comprising genotyping the plurality of nucleic acid samples based on the generated difference melt curve data.
5. The method of claim 3, wherein clustering the experimental set of melt curve data further comprises: clustering the experimental melt curve data based on the difference melt curve data.
6. The method of claim 3, wherein the feature vector for at least one sample from among the plurality of nucleic acid samples comprises one or more of a delta max, a melt temperature at delta max, and a sum of an absolute difference, wherein the delta max comprises a difference between a peak value for the at least one nucleic acid sample and the reference nucleic acid sample, the melt temperature at delta max comprises the melt temperature at the peak value, and the sum of the absolute difference comprises the area under the sample peak.
7. The method of claim 1, wherein the correcting is based on a derivative form of the experimental melt curve data.
8. The method of claim 1, wherein the calibration set of melt curve data is generated based on detected changes to an indicator exhibited by the plurality of support regions in which the calibration nucleic acid sample is deposited, the detected changes occurring during the subjecting to conditions sufficient to denature the nucleic acid replicates in the calibration nucleic acid sample.
9. A biological analysis system for analyzing melt curve data, the biological analysis system comprising: a processor configured to: generate a calibration set of melt curve data for a calibration nucleic acid sample, wherein the calibration set is generated from subjecting the calibration nucleic acid sample deposited in a first plurality of support regions to conditions sufficient to denature nucleic acid replicates in the calibration nucleic acid sample; generate an experimental set of melt curve data for at least one test nucleic acid sample, wherein the experimental set is generated from subjecting the at least one test nucleic acid sample deposited in a second plurality of support regions to conditions sufficient to denature nucleic acids in the at least one test nucleic acid sample; correct assay system noise in the experimental set of melt curve data using the calibration set of melt curve data, wherein the assay system noise is caused by at least thermal non-uniformity between support regions from the first plurality of support regions and the second plurality of support regions, and wherein correcting the experimental set of melt curve data for assay system noise using the calibration set of melt curve data further comprises: calculating, for the calibration set of melt curve data, a correction value comprising a difference between data from at least two support regions among the first plurality of support regions in which the calibration nucleic acid sample is deposited, and applying, for the experimental set of melt curve data, the calculated correction value to data from at least one of the second plurality of support regions in which the at least one test nucleic acid sample is deposited; scale the corrected experimental set of melt curve data over an estimated temperature range; fit the scaled corrected experimental set of melt curve data to an estimated asymptote for a low temperature region of a melting region of the melt curve data; cluster the experimental set of melt curve data based on the steps of correcting, scaling and fitting; and a display configured to: display the corrected experimental set of melt curve data to a user.
10. The biological analysis system of claim 9, wherein the processor is further configured to create difference melt curve data from the experimental melt curve data for a plurality of nucleic acid samples, wherein the experimental melt curve data for one of the plurality of nucleic acid samples is selected as a reference, and the experimental melt curve data for the remaining nucleic acid samples are subtracted from the reference nucleic acid sample to create the difference melt curve data.
11. The biological analysis system of claim 10, wherein feature vectors are generated from the difference melt curve data.
12. The system of claim 10, wherein the processor is configured to genotype the plurality of nucleic acid samples based on the generated difference melt curve data.
13. The biological analysis system of claim 11, wherein clustering the experimental set of melt curve data comprises: clustering the experimental melt curve data based on the difference melt curve data.
14. The system of claim 11, wherein the feature vector for at least one sample from among the plurality of nucleic acid samples comprises one or more of a delta max, a melt temperature at delta max, and a sum of an absolute difference, wherein the delta max comprises a difference between a peak value for the at least one nucleic acid sample and the reference nucleic acid sample, the melt temperature at delta max comprises the melt temperature at the peak value, and the sum of the absolute difference comprises the area under the sample peak.
15. The biological analysis system of claim 9, wherein the processor is configured to correct the experimental set of melt curve data for assay system noise using a derivative form of the experimental melt curve data.
16. The system of claim 9, wherein the calibration set of melt curve data is generated based on detected changes to an indicator exhibited by the plurality of support regions in which the calibration nucleic acid sample is deposited, the detected changes occurring during the subjecting to conditions sufficient to denature the nucleic acid replicates in the calibration nucleic acid sample.
17. A method of a biological analysis system for analyzing melt curve data, wherein the biological analysis system includes a processor and a display, the method comprising: generating a calibration set of melt curve data for a calibration nucleic acid sample, wherein the calibration set of melt curve data comprises fluorescence measurements collected by subjecting the calibration nucleic acid sample deposited in a first plurality of support regions to conditions sufficient to denature nucleic acid replicates in the calibration nucleic acid sample; generating an experimental set of melt curve data for at least one test nucleic acid sample, wherein the experimental set of melt curve data comprises fluorescence measurements collected by subjecting the at least one test nucleic acid sample deposited in a second plurality of support regions to conditions sufficient to denature nucleic acids in the at least one test nucleic acid sample; correcting assay system noise in the experimental set of melt curve data using a correction value calculated from the calibration set of melt curve data, wherein the assay system noise is caused by at least thermal non-uniformity between support regions from the first plurality of support regions and the second plurality of support regions; scaling the corrected experimental set of melt curve data over an estimated temperature range; fitting the scaled experimental set of melt curve data to an estimated asymptote for a low temperature region of a melting region of the melt curve data; clustering the experimental set of melt curve data based on the steps of correcting, scaling and fitting; and displaying, on the display, the corrected experimental set of melt curve data to a user.
18. The method of claim 17, wherein correcting the experimental set of melt curve data comprises: calculating, for the calibration set of melt curve data, the correction value, the correction value comprising a difference between data from at least two support regions from among the first plurality of support regions in which the calibration nucleic acid sample is deposited; and applying, for the experimental set of melt curve data, the calculated correction value to data from at least one of the second plurality of support regions in which the at least one test nucleic acid sample is deposited.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
DETAILED DESCRIPTION
(13) What is disclosed herein are various embodiments of methods for analyzing dissociation melt curve data, or as it is used throughout herein, melt curve data (MCD), where the differences in the melting points between various samples are small. For example, various embodiments of methods for analyzing dissociation melt curve data address samples sets where the differences in melting points may vary by only fractions of degrees. According to various embodiments of methods for the analysis of dissociation melt curve data, a calibration set of melt curve data may be used as a basis for correcting experimental sets of melt curve data, for example, with respect to assay system variance or noise. According to various embodiments, the melt curve data may be processed using curve-fitting techniques. In various embodiments of methods for analyzing dissociation melt curve data, different attributes of dissociation melt curve data, such those generated using a difference plot, may be used as the basis of cluster analysis of experimental melt curve data.
(14) One known approach for DNA melting curve analysis utilizes fluorescence monitoring with intercalating double-strand-DNA specific dyes, such as for example, SYBR Green. The SYBR Green dye attaches to the DNA as double-stranded DNA amplification products are formed, and continues to bind to the DNA as long as the DNA remains double-stranded. When melting temperatures are reached, the denaturation or melting of the double-stranded DNA is indicated and can be observed by a significant reduction in fluorescence, as SYBR Green dissociates from the melted strand. The detected dye fluorescence intensity typically decreases about 1000-fold during the melting process. Plotting fluorescence as a function of temperature as the sample heats through the dissociation temperature produces a DNA melting curve. The shape and position of the DNA melting curve is a function of the DNA sequence, length, and GC/AT content.
(15) Further, various approaches for validating the integrity of PCR reactions rely on melting curve analysis to discriminate artifact from real amplification product. Melting curve analysis can also be used to differentiate the various products of multiplexed DNA amplification, and to extend the dynamic range of quantitative PCR. DNA melting curve analysis is also used as a powerful tool for optimizing PCR thermal cycling conditions, because the point at which DNA fragments or other material melts and separate can be more accurately pinpointed.
(16) In some embodiments, dissociation curve analysis methods calculate and display the first derivative of multi-component dye intensity data versus temperature, i.e., the differential melting curve. The melting temperature, T.sub.m, at a peak of the differential melting curve can be used to characterize the product of a biochemical reaction. A sample with multiple amplification products will show multiple peaks in the differential melt curve. In some embodiments, melting curve detection involves very precise measurements of temperature and allows for the identification of a sample using the melting temperature, T.sub.m. The determination of T.sub.m using various embodiments of methods for differential dissociation and melting curve detection is disclosed in related in U.S. patent application Ser. No. 12/020,369, which is incorporated herein by reference in its entirety.
(17) According to various embodiments as shown in
(18) In various embodiments, replicate aliquots of a sample can be loaded into the plate to determine the melting temperature, Tm, of the each well. Ideally, these temperatures should be identical throughout the wells, given that the samples are replicates. In practice, variations in the analysis system, for example, non-uniformity of heating elements of the analysis system, create variations in the set of replicates. According to various embodiments of methods for the analysis of dissociation melt curve data, such melt curve data using replicates may be used as a calibration set of data. In
(19) According to various embodiments of methods for the analysis of dissociation melt curve data, as depicted in step 30 of
(20) According to various embodiments as indicated in
(21) For example, in
(22) According to various embodiments of methods for the analysis of dissociation melt curve data as depicted in step 40 of
(23) According to various embodiments of step 40 of
(24) In various embodiments of methods for the analysis of dissociation melt curve data, in addition to the curve-fitting of step 40 of
(25) In
(26) For example a temperature point of about 70.0 C. may be selected, with an interval of plus or minus 0.5 C. around the temperature point. From this narrow linear region, a line, such as line B in
(27) Step 40 and step 50 in
(28) As previously mentioned, as depicted in step 30 of
(29) As previously stated, the calibration melt curve data set is generated from replicates of the same sample dispensed in support regions of a sample support device, the variations in the calibration data are due to the inherent assay system noise. Accordingly, the information in the calibration melt curve data can be used to correct the experimental melt curve data for system noise. For example, a reference sample region in the EMCD may be selected. According to various embodiments, the frequency plot of the intensities of the sample regions, such as a well, in a sample support device may be determined, and a sample region within two standard deviations of the peak intensity of the EMCD may be selected as a reference sample region. In various embodiments, the reference sample region of the EMCD corresponding to the greatest intensity may be selected, however any sample region within two standard deviations would not be an outlier; i.e. either too dim or to bright, for the purpose of selecting a reference sample region, such as a well. According to various embodiments for correcting system noise as indicated in step 20 of
(30) A correction as described above for step 20 of
(31) According to various embodiments of methods for the analysis of dissociation melt curve data, the experimental melt curve data can be further analyzed to detect true differences in data that are different by only fraction of a degree. According to various embodiments, in step 150 of
(32) In the table of
(33) According to various embodiments of methods for the analysis of dissociation melt curve data as indicated by step 160 of
(34) Likewise, the block of data indicated with hatching; samples 5-10, all have melting temperatures of 84.5 C. Though most of the samples may be further discriminated by using Delta Max, samples 8 and 9 are only distinguished using the SAD feature vector. According to various embodiments of methods for the analysis of dissociation melt curve data in step 160 of
(35) While the principles of this invention have been described in connection with specific embodiments of methods for analyzing dissociation melt curve data, it should be understood clearly that these descriptions are made only by way of example and are not intended to limit the scope of the invention. What has been disclosed herein has been provided for the purposes of illustration and description. It is not intended to be exhaustive or to limit what is disclosed to the precise forms described. Many modifications and variations will be apparent to the practitioner skilled in the art. What is disclosed was chosen and described in order to best explain the principles and practical application of the disclosed embodiments of the art described, thereby enabling others skilled in the art to understand the various embodiments and various modifications that are suited to the particular use contemplated. It is intended that the scope of what is disclosed be defined by the following claims and their equivalence.