DIFFERENTIAL DISSOCIATION AND MELTING CURVE PEAK DETECTION
20180314788 ยท 2018-11-01
Assignee
Inventors
Cpc classification
C12Q2537/165
CHEMISTRY; METALLURGY
C12Q2537/165
CHEMISTRY; METALLURGY
G16B99/00
PHYSICS
G16B45/00
PHYSICS
G16B40/10
PHYSICS
International classification
Abstract
Systems and methods are provided for processing a melting or dissociation curve of a DNA or other sample, for example, during PCR processing. In some embodiments, detection of the melting point and melting curve behavior can be enhanced by taking a derivative of the curve, and detecting peaks in the differential dissociation curve. In some embodiments, the derivative operation can comprise the use of edge-processing, or other detection algorithms. In some embodiments, the dissociation analysis can comprise removing low-frequency (or pedestal) components of the differential dissociation curve. In some embodiments, the differential dissociation curve can exhibit a smoothed or more regular appearance than the raw detected data.
Claims
1. A method for determining the differential dissociation curve of at least one sample, comprising: interpolating emission measurement data of the at least one sample taken at uneven temperature intervals into data at equally-spaced temperature intervals; and generating a differential dissociation curve by generating a derivative of the emission measurement data.
2. The method of claim 1, further comprising detecting at least one peak in the differential dissociation curve.
3. The method of claim 1, further comprising modifying the differential dissociation curve.
4. The method of claim 1, further comprising generating a power spectrum of the interpolated emission measurement data.
5. The method of claim 1, wherein the derivative comprises a first-order derivative.
6. The method of claim 1, further comprising performing a frequency-domain transform on the interpolated emission measurement data.
7. The method of claim 1, wherein the at least one sample comprises a plurality of samples each having associated emission measurement data.
8. A system for determining the differential dissociation curve of at least one sample, comprising: an input unit for receiving emission data of at least one sample taken at uneven temperature intervals; and a processor, communicating with the input unit, the processor being configure to interpolate the emission measurement data of the at least one sample taken at uneven temperature intervals into data at equally-spaced temperature intervals, and generate differential dissociation curve by generating a derivative of the emission measurement data.
9. The system of claim 8, wherein the processor is further configured to detect at least one peak in the differential dissociation curve.
10. The system of claim 8, wherein the processor is further configured to modify the differential dissociation curve.
11. The system of claim 10, wherein the modifying comprises removing emission measurement data associated with peaks that fall below a peak detection threshold.
12. A differential dissociation curve generated for at least one sample, the differential dissociation curve being generated by a method comprising: interpolating emission measurement data of the at least one sample taken at uneven temperature intervals into data at equally-spaced temperature intervals; and generating a differential dissociation curve by generating a derivative of the emission measurement data.
13. The differential dissociation curve of claim 12, wherein the method further comprises generating a power spectrum of the interpolated emission measurement data.
14. The differential dissociation curve of claim 13, wherein generating a power spectrum comprises generating a normalized variance of the power spectrum and removing the emission measurement data of the at least one sample when the normalized variance of the power spectrum exceeds a predetermined threshold.
15. A computer-readable medium, the computer-readable medium being readable to execute a method for determining the differential dissociation curve of at least one sample, the method comprising: interpolating emission measurement data of the at least one sample taken at uneven temperature intervals into data at equally-spaced temperature intervals; and generating a differential dissociation curve by generating a derivative of the emission measurement data.
16. The computer-readable medium of claim 15, wherein the method further comprises modifying the differential dissociation curve.
17. The computer-readable medium of claim 15, wherein the method further comprises generating a power spectrum of the interpolated emission measurement data.
18. The computer-readable medium of claim 15, wherein the derivative comprises a first-order derivative.
19. The computer-readable medium of claim 15, wherein the method further comprises performing a frequency-domain transform on the interpolated emission measurement data.
20. The computer-readable medium of claim 15, wherein the differential dissociation curve is generated in connection with a polymerase chain reaction power.
Description
FIGURES
[0010]
[0011]
[0012]
[0013]
[0014]
[0015]
[0016]
[0017]
[0018]
DESCRIPTION
[0019] According to various embodiments of the present teachings, systems and methods are provided that operate on raw dissociation data plots to generate a first-order or other derivative plot of the original emission data. According to various embodiments, the emission data can comprise a graph, chart, or other representation of the dye emission of one or more fluorescently-labeled samples, such as DNA samples, as a function of temperature. According to various embodiments, the raw emission data of the dissociation/melting curve or other data can be pre-processed or otherwise conditioned to improve the downstream analysis. According to various embodiments, for example, the analysis can comprise interpolating the measurement data taken at unevenly-spaced temperature intervals into data samples at equally-spaced temperature intervals. According to various embodiments, an equal spacing interpolation, or other resampling or oversampling step, can improve the mathematical integrity or capability of the subsequent calculations, including, for example, to permit Fourier or other frequency-domain transformations. According to various embodiments, the original raw or source data can comprise data sample at irregular temperature intervals, since the rate of change in temperature can vary at different points in the PCR or other cycle or process. According to various embodiments, resampling, oversampling, interpolating, or otherwise processing the fluorescent signal-versus-temperature graph to produce data points at equally-spaced temperature intervals can provide modified data which is capable of being subjected to frequency domain analysis. In some embodiments, raw dissociation data that is interpolated, oversampled, or resampled to produce data points at equally-spaced temperature intervals can be subjected to a Fourier transform, to develop a frequency-domain or spectral representation of the original melting curve, or of processed melting curves derived from the original melting curve. The frequency transform or operator can comprise a discrete-time Fourier transform, a continuous Fourier transform, a Fast Fourier Transform, a wavelet transform, or other transform, algorithm, or operator.
[0020] According to various embodiments, interpolation processing to produce equally-spaced data points along the temperature axis can comprise processing algorithms shown in the flow diagram of
[0021] According to various embodiments, further processing or data containing can be performed on the raw or interpolated dissociation curve or related data. For example, the dissociation analysis can comprise steps that detect and identify noisy data sample, to eliminate the effects of those sources on further analysis. Illustrations of dissociation curves exhibiting different good, marginal, and noisy detected patterns of melt curve behavior are shown, for example, in
[0022] Computed power spectra of a noisy, good, and marginal well or sample are shown in the upper-right graph of
[0023] According to various embodiments, the power spectrum of an interpolated well or sample series can be quantitatively processed to identify noisy wells or samples. For example, a normalized variance of the power spectrum curve of the sample series can be computed. In some embodiments, if the normalized variance of the dissociation curve is about a defined noise discrimination threshold, the sample data can be classified as noise. According to various embodiments, the noise discrimination threshold can comprise a user-defined threshold. According to various embodiments, the noise discrimination threshold can comprise an automatically-generated threshold, for instance based on statistical measures. According to various embodiments, the noise discrimination threshold can comprise an empirically-derived threshold, for instance, an average threshold of known good wells or samples. In some embodiments, the normalized, rather than absolute, variance or other statistical measure can be used to accommodate data from different samples, for example, to process samples displaying different initial fluorescent intensities.
[0024] According to various embodiments, the analysis can comprise filtering the interpolated temperature data by a Gaussian kernel or other function. According to various embodiments, the filtered, interpolated data can be further filtered or processed by the derivative of the Gaussian kernel, or other derivative or other function. According to various embodiments, application of a derivative function, for instance a first-order derivative function, can produce a differential melt or dissociation curve, such as, for example, the curves shown in
[0025] According to various embodiments, the dissociation analysis can further comprise extrapolating data points at the beginning and at the end of the raw or interpolated dissociation curve, before the first derivative calculation. This can, for instance, improve the correctness or accuracy of the first derivative calculations at the beginning and at the end of the dissociation curve.
[0026] According to various embodiments, the dissociation analysis can comprise detecting and analyzing the peaks of the first derivative of the dissociation curve (i.e., the differential melting curve), that sit on top of a low-frequency pedestal or offset. According to various embodiments, the pedestal can designate very low frequency components of the differential melting curve. According to various embodiments, the analysis can comprise removing the pedestal or low-frequency components, and evaluating the heights of the modified differential melting curve peaks left after the pedestal or baseline is subtracted or otherwise compensated for. According to various embodiments, techniques for removing the pedestal can comprise the processing shown in the flow diagram illustrated in
[0027] According to various embodiments, the dissociation analysis can comprise ranking the detected, pedestal-removed peaks by their relative heights with respect to the tallest peak. According to various embodiments, the user can specify a fractional score as the peak detection threshold, and the analysis can comprise reporting those peaks that have a relative height above that reporting threshold. For example, the tallest peak can be given a fractional score of 100. If a fractional score peak detection threshold is set at 40, then only peaks above 40% of the tallest peak will be reported, and the lower height peaks will be regarded as noise. According to various embodiments, the peaks falling below the peak detection threshold can be removed or discarded. According to various embodiments, the peak detection threshold can be automatically computed, for example based on standard deviation measures on the peaks, or other metrics or measures. According to various embodiments, any of the raw detection data, normalized differential melting curves, or other data, charts, graphs, or information can be stored to, and/or displayed or presented to a user by, a computer, instrument, or other hardware or device.
[0028] According to various embodiments, the dissociation or melting curve analysis can take place during, or subsequent to, amplification, or in the absence of amplification. Furthermore, while various embodiments herein are described in connection with PCR, according to various embodiments, other methods of amplification can be compatible with differential dissociation or melting curve analysis according to the present teachings. Moreover, while reference is made to amplification, according to various embodiments, the differential dissociation/melting curve analysis of the present teachings can be performed on nucleic acid samples that have been obtained without amplification, or can be applied to other processes or chemistries. Furthermore, while description is made herein of analyzing DNA or fragments of DNA to determine melting points and other data, according to various embodiments, chemicals, substances, samples, or materials can be analyzed according to the present teachings.
[0029] According to various embodiments, different aspects of the differential dissociation/melting curve analysis of the present teachings can be applied to commercial systems and implementations, such as the Step One machine commercially available from Applied Biosystems, Foster City, Calif., and described, for example, a publication entitled Applied Biosystems Step One Real-Time PCR System Getting Started Guide, which publication is incorporated by reference in its entirety herein.
[0030] The differential dissociation/melting curve analysis according to various embodiments of the present teachings can be utilized in automated systems and techniques such as those described, for example, in the publication, by Mann et al., entitled Automated Validation of Polymerase Chain Reactions Using Amplicon Melting Curves, Proceedings of the Computational Systems Bioinformatics Conference, Aug. 8-11, 2005, Stanford, Calif. pp. 377-385, which publication is incorporated by reference in its entirety herein.
[0031] Various embodiments of the present teachings can be implemented, in whole or part, in digital electronic circuitry, or in computer hardware, firmware, software, or in combinations thereof. Apparatus of the invention can be implemented in a computer program, software, code, or algorithm embodied in machine-readable media, such as electronic memory, CD-ROM or DVD discs, hard drives, or other storage device or media, for execution by a programmable processor. Various method steps according to the present teachings can be performed by a programmable processor executing a program of instructions to perform functions and processes according to the present teachings, by operating on input data and generating output. The present teachings can, for example, be implemented in one or more computer programs that are executable on a programmable system including at least one programmable processor coupled to receive data and instructions from, and to transmit data and instructions to, a data storage system or memory, at least one input device such as a keyboard and mouse, and at least one output device, such as, for example, a display or printer. Each computer programs, algorithm, software, or code can be implemented in a high-level procedural or object-oriented programming language, or in assembly, machine, or other low-level language if desired. According to various embodiments, the code or language can be a compiled, interpreted, or otherwise processed for execution.
[0032] Various processes, methods, techniques, and algorithms can be executed on processors that can include, by way of example, both general and special purpose microprocessors, such as, for example, general-purpose microprocessors such as those manufactured by Intel Corp. or AMD Inc., digital signal processors, programmable controllers, or other processors or devices. According to various embodiments, generally a processor will receive instructions and data from a read-only memory and/or a random access memory. According to various embodiments, a computer implementing one or more aspects of the present teachings can generally include one or more mass storage devices for storing data files, such as magnetic disks, such as internal hard disks and removable disks, magneto-optical disks, and CD-ROM DVD, Blu-Ray, or other optical disks or media. Memory or storage devices suitable for storing, encoding, or embodying computer program instructions or software and data can include, for instance, all forms of volatile and non-volatile memory, including for example semiconductor memory devices, such as random access memory, electronically programmable memory (EPROM), electronically erasable programmable memory, EEPROM, and flash memory devices, as well as magnetic disks such as internal hard disks and removable disks, magneto-optical disks, and optical disks. Any of the foregoing can be supplemented by, or incorporated in, ASICs. According to various embodiments, processors, workstations, personal computers, storage arrays, servers, and other computer, information, or communication resources used to implement features of the present teachings can be networked or network-accessible.
[0033] Other embodiments will be apparent to those skilled in the art form consideration of the present specification and practice of the present teachings disclosed herein. For example, resources described in various embodiments as singular can, in embodiments, be implemented as multiple or distributed, and resources described in various embodiments as distributed can be combined. It is intended that the present specification and examples be considered as exemplary only.