PEAK DETECTION METHOD AND DATA PROCESSING DEVICE

20200378933 ยท 2020-12-03

Assignee

Inventors

Cpc classification

International classification

Abstract

The peak detection method of the present invention is a method of detecting peaks from data of a graph representing change in measured value relative to a measurement variable, comprising: a wavelet transform step (S2) in which wavelet transform is performed on the aforementioned data using a mother wavelet having a single maximum value to find an evaluation function having said mother wavelet's scale and translation as parameters; and a peak candidate information acquisition step (S3 through S5) in which locations of peak candidates in the aforementioned data are found based on the translation at which said evaluation function has its maximum value, and the width of said peak candidates is determined based on the scale corresponding to said peak candidates. Performing wavelet transform makes it possible to detect peak candidates regardless of the strength or weakness of peaks, etc., and to determine the width of peak candidates, which serves as an index for discriminating whether or not the peak candidate is a true peak.

Claims

1. A peak detection method of detecting peaks from data of a graph representing change in measured value relative to a measurement variable, comprising: a wavelet transform step in which wavelet transform is performed on the data using a mother wavelet having a single maximum value to find an evaluation function having said mother wavelet's scale and translation as parameters; a peak candidate information acquisition step in which locations of peak candidates in the data are found based on the translation at which said evaluation function has its maximum value, and the width of said peak candidates is determined based on the scale corresponding to said peak candidates; and a wide peak candidate detection data generating step of generating wide peak candidate detection data by eliminating one or multiple peak candidates from the data in sequence starting with the peak candidate having the smallest scale value among the plurality of peak candidates; wherein, after executing the wide peak candidate detection data creation step, the data is replaced with the wide peak candidate detection data and the wavelet transform step and the peak candidate information acquisition step are performed again.

2. (canceled)

3. A data processing device, which performs data processing for detecting peaks from data of a graph representing change in measured value relative to a measurement variable, comprising: a wavelet transform unit which performs a wavelet transform on the data using a mother wavelet having a single maximum value to find an evaluation function having said mother wavelet's scale and translation as variables; a peak candidate information acquisition unit which finds locations of peak candidates in the data based on the translation at which said evaluation function has its maximum value, and determines the width of said peak candidates based on the scale corresponding to said peak candidates; and a wide peak candidate detection data generating unit that generates wide peak candidate detection data by by eliminating one or multiple peak candidates from the data in sequence starting with the peak candidate having the smallest scale value among the plurality of peak candidates; wherein, after generating the wide peak candidate detection data, the data is replaced with the wide peak candidate detection data and the wavelet transform step and the peak candidate information acquisition step are performed again.

Description

[BRIEF DESCRIPTION OF THE DRAWINGS]

[0027] [FIG. 1] A schematic diagram illustrating a first mode of embodiment of the data processing device of the present invention.

[0028] [FIG. 2] A flow chart illustrating a first mode of embodiment of the data processing method of the present invention.

[0029] [FIG. 3] A graph illustrating the data subjected to processing in embodiment example 1, in which processing was performed by the data processing method of the first mode of embodiment.

[0030] [FIG. 4] A graph illustrating an example in which ridge lines and the location of the maximum value of an evaluation function on said ridge lines have been found based on an evaluation function obtained through wavelet transform in embodiment example 1.

[0031] [FIG. 5] A schematic diagram illustrating a second mode of embodiment of the data processing device of the present invention.

[0032] [FIG. 6] A flow chart illustrating a second mode of embodiment of the data processing method of the present invention.

[0033] [FIG. 7] A graph illustrating an example in which narrow peaks have been eliminated from the original data based on width as determined through wavelength transform in embodiment example 2, in which processing was performed by the data processing method of the second mode of embodiment.

[0034] [FIG. 8] A graph illustrating an example in which ridge lines corresponding to wide peaks and the location of the maximum value of an evaluation function on said ridge lines corresponding to wide peaks have been found based on an evaluation function obtained by performing wavelet transform on data from which narrow peaks have been eliminated.

MODES FOR EMBODYING THE INVENTION

[0035] Modes of embodiment of the peak detection method and data processing device of the present invention will be described using FIG. 1 through FIG. 8.

[0036] The data processing device 10 of the first mode of embodiment is used together with a data recording unit 1, displaying device 2 and input device 3. The data recording unit 1 is a device which records the data obtained during measurement by a detector possessed by a liquid chromatograph, gas chromatograph, etc., and comprises a hard disk, memory, etc. The data recording unit 1, in the example shown in FIG. 1, is provided outside the data processing device 10, but it may also be provided inside the data processing device 10. The displaying device 2 is a display which displays information during data processing and the results of data processing performed by the data processing device 10. The input device 3 is a device for input of necessary information by the user into the data processing device 10, and comprises a keyboard, mouse, etc.

[0037] The data processing device 10 comprises a chromatogram generating unit 11, a wavelet transform unit 12, a peak candidate information acquisition unit 13 and a peak determination unit 14. The peak candidate information acquisition unit 13 comprises a peak candidate location determination unit 131 and a peak candidate width determination unit 132. These units are actually implemented by means of computer hardware, such as a CPU, memory, etc., and software. The first mode of embodiment of the data processing method of the present invention will be described below, along with the functions of each unit of the data processing device 10 of the first mode of embodiment, using the flow chart of FIG. 2.

[0038] First, the chromatogram generating unit 11 acquires data from data recording unit 1 and generates a chromatogram C(t) by the same methods as in the prior art (step S1). This chromatogram corresponds to the data of a graph representing change in measured value mentioned above, and t is time, which corresponds to the measurement variable mentioned above. When processing is performed on a spectrum, the operation of generating a chromatogram is unnecessary, and it suffices to simply acquire data from the data recording unit 1.

[0039] Next, the wavelet transform unit 12 performs wavelet transform on the chromatogram C(t) to find the evaluation function (step S2). For the wavelet transform, the chromatogram C(t) is used as the analyzed data x(t) in formula (1) above, and the inner product is found using a function having a single maximum value, such as the Mexican hat function or difference of Gaussians. The inner product found in this manner may itself be used as the evaluation function, or alternatively, the inner product multiplied by .sub.f.sup.m (here, .sup.f is scale), as shown in formula (5), may be used as the evaluation function. This evaluation function is a function having scale and translation as its parameters.

[0040] Subsequently, the peak candidate information acquisition unit 13 generates ridge lines (step S3) based on the evaluation function found in step S2, and finds the point where the evaluation function has its maximum value on the obtained ridge lines. To generate the ridge lines, the operation of fixing the scale .sub.f to a given value and then finding the point which yields the maximum value while changing the measurement variable t is performed for multiple values of scale .sub.f. The ridge lines obtained here are not limited to one, and in many cases, multiple ridge lines will be present, so the point at which the value of the evaluation function has its maximum value on the ridge lines is found separately for each maximum value. The peak candidate location determination unit 131 determines the values of the measurement variable at the maximum value obtained in this manner as locations (times) at which a peak candidate is present on the chromatograph C(t) (step S4), and the peak candidate width determination unit 132 determines the width of peak candidates based on the scale for each peak candidate (step S5). These steps S3 through S5 correspond to the aforementioned peak candidate information acquisition step. It will be noted that steps S4 and S5 may also be carried out simultaneously in parallel, and step S5 may also be performed first. The width of peak candidates, of example, when using formula (5) and formula (6), is a value 5.sup.1/2 times the scale at the maximum value when m=0 (the inner product is the evaluation function), and is the same value as the scale at the maximum value when m=1.

[0041] Subsequently, the peak determination unit 27 discriminates, based on the obtained width value, whether the corresponding peak candidate is a true peak or something other than a true peak, such as noise or background (step S6), thereby completing the processing. Whether or not a peak is a true peak is typically determined by setting an upper limited value and lower limit value for width, and discriminating the peak as being true if the obtained width is between the upper limit value and lower limit value, and as not being true if the width is not between these two values. Alternatively, the location (time) may be divided into multiple zones, with an upper limit value and lower limit value for width being set for each zone. In the present mode of embodiment, the peak determination unit 14 performs this discrimination automatically, but it is also possible to display the width values on the displaying device 2 and have a person perform the discrimination instead.

[0042] Next, an example (embodiment example 1) will be described, in which peak detection was performed by the data processing method of the first mode of embodiment. In embodiment example 1, the object of processing was the data shown in the graph of FIG. 3. In this data, one can observe ten peaks (P01 through P10), and one peak-shaped profile PB1 which is wider than those peaks.

[0043] In the present embodiment example, the inner product was found by performing wavelet transform on this data using the Mexican hat function as the mother wavelet, and that inner product multiplied by .sub.f.sup.1 (m=1 in formula (5)) was used as the evaluation function. In this example, the value of scale at which the evaluation function has its maximum value is determined directly as being the width of a peak candidate in the original data.

[0044] FIG. 4 shows the results of finding the ridge lines. In FIG. 4, data is displayed as follows, with the horizontal axis of the two-dimensional graph showing time (the measurement variable; same as the horizontal axis of the original data) and the vertical axis showing the scale .sub.f (where .sub.f is a base 2 logarithm). First, the scale .sub.f is fixed at one value and the measurement variable is changed to find the point at which the evaluation function has its maximum value for the given value of scale .sub.f. This operation is repeated while changing the value of .sub.f little by little. In this way, one series of points extending in the vertical axis direction appears for each peak candidate in the graph of FIG. 4. These series of points are the ridge lines. The locations of maximum values on the two-dimensional graph of FIG. 4 can then be identified by finding the scale .sub.f at which the evaluation function has its maximum value for each ridge line.

[0045] Based on the graph of FIG. 4, 10 points (the x marks in the figure) at which the evaluation function has its maximum value have been obtained. These 10 points correspond to the peaks P01 through P10, and all have substantially the same value of .sub.f i.e. width of the original data. There are multiple series of points in this graph labeled N, which constitute noise.

[0046] A series of points corresponding to peak-shaped profile PB1 can also be seen in the graph of FIG. 4, but due to the significant the influence of the overlap of P05 and P06 where .sub.f is large, it was not possible to estimate the width of the peak candidate corresponding to the peak-shaped profile PB1 based on this embodiment example 1. Next, the second mode of embodiment of the present invention will be described, which is a method which allows the width of peak candidates to be estimated even when there is such overlap with other peaks.

[0047] A schematic diagram of the data processing device 20 of the second mode of embodiment is shown in FIG. 5. This data processing device 20 has a configuration obtained by adding a wide peak candidate detection data generating unit 21 to the data processing device 10 of the first mode of embodiment. The second mode of embodiment of the data processing method of the present invention will be described below, along with the function of the wide peak candidate detection data generating unit 21.

[0048] First, in steps S11 through S15, the same operations are performed as in steps S1 through S5 of the peak processing method of the first mode of embodiment to determine the locations (times) at which peak candidates are present and the widths of the peak candidates.

[0049] Next, the wide peak candidate detection data generating unit 21 selects one or multiple peak candidates from among the peak candidates in sequence starting from the one with the smallest determined width (step S16). Subsequently, in step S17, the decision is made as to whether a wide peak detection operation is to be carried out. Methods of making this decision include, for example, deciding YES when it was not possible to find the maximum value as in the case of the aforementioned peak-shaped profile PB 1, or deciding YES until step S17 has been executed a predetermined number of times, regardless of the content of the data.

[0050] In the case of YES in step S17, it is determined whether or not a selected peak candidate is a true peak (step S18). Data for the range corresponding to the location and width of the selected peak candidate is then removed from the chromatogram C(t), and the data within that range is interpolated with a straight line or curve to generate data for wide peak candidate detection (step S19). Control then returns to step S12 and the operations of steps S12 through S16 are repeated based on the data for wide peak candidate detection. Here, the number of peaks or peak-shaped profiles in the data for wide peak candidate detection is smaller than in the original data, making it easier to detect wide peaks. The operations of these steps S12 through S16 are repeatedly executed until the decision in step S17 becomes NO.

[0051] Furthermore, when the decision in step S17 is NO, the peak determination unit discriminates, for all the peak candidates remaining at the time, whether or not the peak is a true peak (step S20), thereby completing the processing.

[0052] An example (embodiment example 2) in which peak detection was performed by the data processing method of the second mode of embodiment will be described below. In embodiment example 2, similarly to embodiment example 1, the object of processing is the data shown in the graph of FIG. 3. First, 10 peaks P01 through P10 were detected by the same method as in embodiment example 1. Then, based on the locations and widths found for the detected peaks P01 through P10, the data for the portion corresponding to peaks P01 through P10 was eliminated from the original data of FIG. 3, and the eliminated portion was interpolated to generate data for wide peak candidate detection. A graph of the generated data for wide peak candidate detection is shown in FIG. 7. Ridge lines were then generated based on this data for wide peak candidate detection. These ridge lines are shown in FIG. 8. One series of points extending in the vertical axis direction is formed corresponding to the peak-shaped profile PB1, and the location at which the evaluation function has its maximum value is identified. There are no other series of points near this series, making it possible to determine the location and width of the peak-shaped profile PB1 without the influence of other peaks, etc. It should be noted that the series of points other than that corresponding to the peak-shaped profile PB1 are noise due to discontinuities, etc. arising during data interpolation.

DESCRIPTION OF REFERENCE SYMBOLS

[0053] 2 . . . Data recording unit

[0054] 2 . . . Displaying device

[0055] 3 . . . Input device

[0056] 10, 20 . . . Data processing device

[0057] 11 . . . Chromatogram generating unit

[0058] 12 . . . Wavelet transform unit

[0059] 13 . . . Peak candidate information acquisition unit

[0060] 131 . . . Peak candidate location determination unit

[0061] 132 . . . Peak candidate width determination unit

[0062] 14 . . . Peak determination unit

[0063] 21 . . . Wide peak candidate detection data generating unit