Robust Detection Of Variablility In Multiple Sets Of Data
20190228053 ยท 2019-07-25
Inventors
Cpc classification
G01D1/14
PHYSICS
G01N21/6428
PHYSICS
International classification
G06F17/18
PHYSICS
Abstract
The present teachings comprise systems and methods for calibrating the background or baseline signal in a PCR or other reaction. The background signal derived from detected emissions of sample wells can be subjected to a normalized statistical metric, and be compared to a threshold or other standard to discard outlier cycles or other extraneous data. According to various embodiments, a relative standard deviation (relativeSTD) for the background component can be generated by dividing the standard deviation by the median of differences across all wells, where the difference is defined as the difference between maximum and minimum pixel values of a well. The relativeSTD as a metric is not sensitive to machine-dependent variations in absolute signal output that can be caused by different gain settings, different LED draw currents, different optical paths, or other instrumental variations. More accurate background characterization can be achieved.
Claims
1. A method of calibrating a polymerase chain reaction (PCR) instrument, comprising: receiving reaction emission data comprising multiple sets of data; generating a relative standard deviation (relativeSTD) representing a normalized measure of variation in the multiple sets of data, by applying the equation:
2. The method of claim 1, wherein the emission data comprises emission data detected from a biological sample.
3. (canceled)
4. The method of claim 1, wherein reaction comprises a reaction with a nucleic acid.
5. The method of claim 1, wherein the receiving emission data comprises receiving emission data from a plurality of sample wells.
6. The method of claim 1, wherein the emission data comprises data detected using a plurality of filters, and the generating a relativeSTD comprises generating a relativeSTD on a per-filter basis.
7. A system for calibrating a polymerase chain reaction (PCR) instrument, comprising: an input unit configured to receive reaction emission data comprising multiple sets of data; and a processor unit, the processor unit being configured to generate a relative standard deviation (relativeSTD) representing a normalized measure of variation in the multiple sets of data, by applying the equation:
8. The system of claim 7, wherein the emission data comprises emission data detected from labeled nucleic acid samples.
9. (canceled)
10. The system of claim 7, wherein the input unit comprises a plurality of different filters configured to filter the emission data from the reaction.
11-15. (canceled)
16. A non-transitory computer-readable medium, the computer-readable medium being readable to execute a method of calibrating a polymerase chain reaction (PCR) instrument, the method comprising: receiving reaction emission data comprising multiple sets of data; generating a relative standard deviation (relativeSTD) representing a normalized measure of variation in the multiple sets of data, by applying the equation:
17. The computer-readable medium of claim 16, wherein the emission data comprises emission data detected from labeled nucleic acid samples.
18. (canceled)
19. The computer-readable medium of claim 16, wherein the reaction occurs within a plurality of sample wells.
20. The computer-readable medium of claim 16, wherein the emission data comprises data detected using a plurality of filters, and the generating a relativeSTD comprises generating a relativeSTD on a per-filter basis.
Description
FIGURES
[0012]
[0013]
[0014]
[0015]
[0016]
[0017]
[0018]
[0019]
[0020]
[0021]
DESCRIPTION
[0022] According to various embodiments of the present teachings, methods for background calibration and outlier detection can comprise steps for identifying an outlier cycle or outlier cycles for each filter and/or each well of a sample plate used in an amplification or other reaction. Systems for carrying out calibration analysis are also provided. According to various embodiments, the calibration systems and methods can be implemented in or applied to PCR scanning systems, in which a read head containing a photodetector, for example, a photodiode or other detector, can read the fluorescent output or other output from a single well or other location, then travel to a next well or location to read the spectral dye or other output at that location, and step or repeat across a plate or other container or platform to take spectra from the entire group of sample wells, one at a time. According to various embodiments, the calibration systems and methods can be implemented in or applied to PCR imaging systems in which a photodetector, for example, a CCD, CID or other detector, images an entire plate, and all sample wells contained in that plate, at one time or at substantially one time, for instance taking a spectral image of all 96 wells of a standard microtiter plate. According to various embodiments, each well or other container or location in a plate or other platform can contain samples, for example, samples of DNA fragments or other material, to which one or more spectrally distinct dye is attached for detection and analysis.
[0023] According to various embodiments, the calibration can comprise identifying a background or baseline signal detected before or during the initial stages of a PCR run, performed using a PCR system 102 reading samples contained in a plate 104, such as illustrated in
[0024]
[0025] According to various embodiments, the calibration analysis can comprise identifying entire scanning or imaging cycles within an amplification run or runs that deviate from statistically expected ranges, and that therefore can be removed to increase the accuracy of the background characterization. According to various embodiments, accurate background characterization can contribute to increased accuracy in the readings of a PCR or other analytic run.
[0026] According to various embodiments, and as, for instance, illustrated in
[0027] According to various embodiments, background calibration can comprise running a PCR or other detection cycle on some or all wells of a plate 104, or other container or support, having sample wells 106 that are loaded only with buffer. The fluorescent signal emitted from the plate itself, optical components, or other sources of residual emission, can then be detected. According to various embodiments, plate 104 can comprise 48, 96, 384 or more wells 106 arranged in a standardized, rectangular format, or those or other numbers of wells arranged in another configuration.
[0028] Systems and methods according to embodiments of the present teachings can identify an outlier cycle from a set of PCR cycles or other detection cycles. According to various embodiments, an outlier cycle can be detected using a thresholding operation, for example, determining if the captured or detected background signal is a predetermined percentage lower or higher than a computed mean signal. In various embodiments, the cycle can be labeled an outlier if the signal is at least two standard deviations away from the mean signal. According to various embodiments, if the cycle can be identified as an outlier for all filters and all wells, this cycle can be removed from further quality checking or other processing.
[0029] According to various embodiments, the background calibration analysis can comprise determining a measure of fluorescent or other baseline signal that is not sensitive to individual well, instrument, or other, variations, which variations would have an effect on metrics that are not made relative or otherwise normalized. According to various embodiments, for example, the detection system of a PCR instrument or other instrument can include an analog-to-digital converter (ADC) to convert analog optical intensity signals to digital quantities. According to various embodiments, the detection system can incorporate an amplification circuit, for example, comprising an operational amplifier (opamp) or other circuit or component, inserted before the ADC, to regulate signal levels. According to various embodiments, the opamp or other amplification circuit can be supplied with an offset or bias, to maintain a positive signal after A/D conversion is complete. According to various embodiments, the offset or bias setting of such a detection system can be set by a resistor connected to the opamp's feedback or other circuit. As an electronic component, however, resistors can suffer from a significant degree of variability in their resistance (ohm value) rating, and depending on resistor type and cost, achieve accuracy or consistency only on the order of 5-20%, or more or less. Two machines of identical type, even if made by the same manufacturer, can therefore exhibit a variance in resistor-based offset of roughly a factor of 2, or more or less. Conducting accurate machine-to-machine PCR calibrations or other calibrations or tests can therefore be complicated or affected by electronic tolerances, such as detector offset, or other gain, sensitivity, bias, or offset settings that can not be readily determined.
[0030] According to various embodiments, for further example, the digital output of detected intensity can be processed to consist of the pixel count for a subject well or other feature, plus a predetermined or other conversion offset to be added to the raw pixel count produced by the ADC. According to various embodiments, depending on factors including initial settings, reaction chemistries and concentrations, detector and filter efficiencies, and other factors, the digital conversion offset can, in some cases, be larger than the background fluorescence or other background signal being captured and calibrated. According to various embodiments, the PCR instrument or other instrument can provide an overall sensitivity or gain setting, for example, to more readily detect comparatively faint samples or other objects. According to various embodiments, comparison of outputs from two different PCR or other instruments set to two different overall sensitivity or gain settings can therefore be difficult. According to various embodiments, the amount of current delivered to, or drawn by, the LED or other illumination source can be set at the factory or otherwise at different levels, for instance, at 200 mA, at 450 mA, or at other levels, each of which can cause the LED or other illumination source to produce a different absolute brightness. According to various embodiments, the various filters used in the detection system of a PCR instrument or other instrument can have different efficiencies across different filter wavelengths in the same instrument, or at the same wavelength across different instruments due to filter manufacturing tolerances. According to various embodiments, therefore, the absolute detected quantities taken at different wavelengths can vary in the same PCR instrument or other instrument, and conversely, emission data for the same filter wavelength captured on two of the same type of machine can vary, due to variations in filter tolerances and efficiency. These and other effects can create and introduce instrumental variance, offsets, or other kinds of unwanted bias when attempting to calibrate the background contribution of plates, wells, non-reactant liquids, filters, or other components or aspects of a PCR detection system or other detection system.
[0031] Attempting to characterize the background contributions by means of an un-normalized standard deviation or other statistical measure can therefore result in background calibration at significantly skewed scales, reducing the usability of calibration data generated in that fashion. According to various embodiments of the present teachings, instead of a conventional standard deviation calculation, a normalized or relative STD measure can be generated, permitting broader and more useful integration of well, plate, cycle, filter, machine, and other calibration measures, on a consistent basis.
[0032] According to various embodiments, the background calibration can comprise the computation of a relative standard deviation (again, referred to as relativeSTD) that is independent of instrument factors or effects such as instrument gain, filter efficiency, current draw (or brightness) of an LED or other illumination source, detector sensitivity, or other factors. According to various embodiments, the relativeSTD value can be defined as the ratio between the standard deviation (STD) and the measured MedianDiffMinPeak, as defined below in Equation 1.
[0033] where the MedianDiffMinPeak is the median of differences across all wells, where the difference is defined as the difference between maximum pixel values and the minimum pixel values of a detected emission from a well, as expressed below in Equation 2.
MedianDiffMinPeak=median(S),Equation 2
[0034] where S={max(W.sub.i)min(W.sub.i)|I1, . . . , 48}, and W.sub.i is a set of pixel values for pixels configured to detect fluorescence from well i.
[0035] According to various embodiments, the STD can, for example, be the standard deviation of well data from individual wells 106 of plate 104 (see
[0036] According to various embodiments, the calibration can comprise, for each filter in a set of filters 108, or for another detection channel, identifying one or more outlier well or wells where the well background signal is lower than a threshold, as expressed below in Equation 3.
Background detected in Well(i)<mean(2relativeSTD)(MedianDiffMeanPeak).Equation 3
[0037] According to various embodiments, the calibration can also or instead comprise, for each filter in a set of filters 108, or for another detection channel, identifying one or more outlier well or wells where the well background signal is greater than a threshold, as expressed below in Equation 4.
Background detected in Well(i)>mean+(2relativeSTD)(MedianDiffMeanPeak),
[0038] where the relativeSTD is a built-in or defined parameter, for instance, generated according to Equation 1 above, and the MedianDiffMeanPeak is generated according to Equation 2 above, and, for instance, is calculated from the background calibration run in a PCR or other amplification or other reaction.
[0039] According to various embodiments, the background calibration can comprise, for each filter in a set of detection filters 108 and/or for each well 106, calculating the background well signal by averaging the background signal for cycles that are not identified or excluded as outliers and are instead considered useful or reliable.
[0040] According to various embodiments of the present teachings, the defined measure of MedianDiffMeanPeak and associated relativeSTD can cancel or compensate for the effect of instrumental variations that can come into play across different instruments, plates, or processes. As, for example, illustrated in
[0041] According to various embodiments, employing a conventional standard deviation computation can ultimately produce inconsistent results, since different instruments can be set to different amplification or gain values, can use LEDs or other illumination sources which are set to be driven with different amounts of current and/or have different efficiencies and thus produce different brightness, can have photodetectors of different sensitivities or efficiencies, or can otherwise differ in factors that affect the absolute value or amplitude of the output. According to various embodiments, the background calibration techniques described herein instead rely upon metrics for measuring background variation that are invariant under various instrumental fluctuations or deviations, and therefore permit comparison between instruments, plates, cycles, or other items that may or may not operate with the same instrumental biases.
[0042] According to various embodiments, the background calibration can utilize metrics for measuring background variation that do not depend on absolute values of detected background output, but instead, for example, can utilize relative values, such as a ratio or other scaled or transformed value or values. The relativeSTD described above meets all requirements for invariance. As, for example, illustrated in
[0043] According to various embodiments, the relativeSTD can be taken, for instance, of either well intensity data as a whole, or peak pixel density for a single pixel with greatest amplitude corresponding to a given well. According to various embodiments, the detector system within a PCR system 102 or other instrument can have imaging resolution of hundreds of lines per inch, such as 400 lines per inch, or more or less, so that a significant number of separate intensities based on narrow line widths can be reported moving across an individual well 106. According to various embodiments, use of the standard deviation of peak pixels improves on the calculation of standard deviation based on overall or total well signal, because the variance of standard deviation of total well data will therefore typically be several times higher than the variance of standard deviation of single peak pixels, due to increased sampling for the total well data.
[0044] Overall fluorescent background calibration processing according to various embodiments of the present teachings is illustrated in the flowchart of
[0045] In step 1010, the MedianDiffMinPeak quantity can be computed or generated, for example, according to Equation 2 above. In step 1012, the relativeSTD quantity or variable can be computed or generated, for example, according to Equation 1 above. In step 1014, a thresholding operation can be performed on the remaining or selected wells or cycles, for instance, according to Equations 3 and 4 above. In some embodiments, other thresholding equations or quantities can be used. In step 1016, wells or cycles lying outside the thresholding limits of step 1014 can be removed or discarded. In step 1018, the normalized or conditioned well, cycle, filter, or other data, can be used to process analytical PCR runs, or perform other operations. In step 1020, processing can end, repeat, return to a prior processing point, or proceed to a further processing point.
[0046] According to various embodiments, the background calibration can comprise reporting a problem with the background run such as detected non-uniform results, and/or outlier wells. In some embodiments, the problem can be reported to an operator, to an automated logging system, or to another destination, location, or storage. According to various embodiments, the quality check reflected in background or baseline calibration that identifies outliers is consistent and robust across different instruments, and is insensitive to instrument factors such as instrument gain, LED current, filter design, and offset. According to various embodiments, the background calibration can produce improved results by averaging cycle data classified as accurate or reliable, and removing outlier cycles.
[0047] According to various embodiments, different aspects of the differential dissociation/melting curve analysis of the present teachings can be applied to commercial systems and implementations, for example, can be applied to the STEPONE system commercially available from Applied Biosystems, Foster City, Calif., and described, for example, in the publication entitled Applied Biosystems Step One Real-Time PCR System Getting Started Guide, which publication is incorporated by reference in its entirety herein.
[0048] It will be appreciated that while various embodiments described above involve the calibration of one or more aspects of background or baseline signal behavior, according to various embodiments, more than one type of background or other calibration can be performed, together or in sequence.
[0049] Various embodiments of the present teachings can be implemented, in whole or in part, in digital electronic circuitry, or in computer hardware, firmware, software, or in combinations thereof. Apparatus of the present teachings can be implemented in a computer program, software, code, or algorithm embodied in machine-readable media, such as electronic memory, CD-ROM or DVD discs, hard drives, or other storage devices or media, for execution by a programmable processor. Various method steps according to the present teachings can be performed by a programmable processor executing a program of instructions to perform functions and processes according to the present teachings, by operating on input data and generating output. The present teachings can, for example, be implemented in one or more computer programs that are executable on a programmable system including at least one programmable processor coupled to receive data and instructions from, and to transmit data and instructions to, a data storage system or memory, at least one input device such as a keyboard and mouse, and at least one output device, such as, for example, a display or printer. Each computer program, algorithm, software, or code, can be implemented in a high-level procedural or object-oriented programming language, or can be implemented in assembly, machine, or other low-level language, if desired. According to various embodiments, the code or language can be a compiled, interpreted, or otherwise processed for execution.
[0050] Various processes, methods, techniques, and algorithms disclosed herein can be executed on processors that can include, by way of example, both general and special purpose microprocessors, for example, general-purpose microprocessors such as those manufactured by Intel Corp. or AMD Inc., digital signal processors, programmable controllers, or other processors or devices. According to various embodiments in general, a processor will receive instructions and data from a read-only memory and/or a random access memory. In some embodiments, a computer implementing one or more aspects of the present teachings can generally include one or more mass storage devices for storing data files, such as magnetic disks, such as, internal hard disks, removable disks, magneto-optical disks, and CD-ROM, DVD, Blu-Ray, or other optical disks or media. Memory or storage devices suitable for storing, encoding, or embodying computer program instructions or software and data as described herein can include, for instance, all forms of volatile and non-volatile memory, including, for example, semiconductor memory devices, such as random access memory, electronically programmable memory (EPROM), electronically erasable programmable memory (EEPROM), and flash memory devices, as well as magnetic disks such as internal hard disks and removable disks, magneto-optical disks, and optical disks. Any of the foregoing can be supplemented by, or incorporated in, ASICs. According to various embodiments, processors, workstations, personal computers, storage arrays, servers, and other computer, information, or communication resources, used to implement features of the present teachings, can be networked or network-accessible.
[0051] Other embodiments will be apparent to those skilled in the art from consideration of the present specification and practice of the present teachings disclosed herein. For example, resources described in various embodiments as singular can, in embodiments, be implemented as multiple or distributed, and resources described in various embodiments as distributed can be combined. It is intended that the present specification and examples be considered as exemplary only.