Apparatus and method for processing spectrum

11550010 · 2023-01-10

Assignee

Inventors

Cpc classification

International classification

Abstract

A spectrum y includes a waveform-of-interest component and a baseline component serving as a wide-band component. An optimum solution of a signal model x is determined according to a first condition to fit a corresponding portion S.sub.IFx of a baseline model Fx with respect to a representative portion y.sub.I of the baseline component, and a second condition to minimize an Lp norm (wherein p≤1) of the signal model x. An estimated baseline component determined from the optimum solution of the signal model x is subtracted from the spectrum y.

Claims

1. An apparatus for processing a spectrum, comprising: a processor that is programmed to: receive a spectrum which includes a baseline component, process the spectrum, wherein, when processing the spectrum, the processor is to: search for an optimum signal model according to: a first condition to fit a baseline model with respect to the baseline component, wherein the baseline model is a frequency space model, and a second condition to reduce an Lp norm (wherein p≤1) of a signal model which is another expression of the baseline model, wherein the signal model is a time space model in a predetermined space in which the baseline component is expressed as a sparse signal, wherein the predetermined space is a time space, and wherein the baseline model is defined by applying a transform matrix to the signal model, and generate an optimum baseline model corresponding to the optimum signal model, wherein the optimum baseline model is an estimated baseline component; and perform a spectrum analysis on the spectrum after subtracting the estimated baseline component from the spectrum.

2. The apparatus for processing the spectrum according to claim 1, wherein the processor is further to: subtract the estimated baseline component from the spectrum.

3. The apparatus for processing the spectrum according to claim 1, wherein the first condition is a condition to fit, with respect to a representative portion in the baseline component, a corresponding component in the baseline model.

4. The apparatus for processing the spectrum according to claim 1, wherein an evaluation value J is defined by an L2 norm of a residual determined from the baseline component and the baseline model and the Lp norm of the signal model, and a condition to minimize the evaluation value J forms the first condition and the second condition.

5. The apparatus for processing the spectrum according to claim 4, wherein the processor is further to: select a representative portion of the baseline component from the spectrum, and wherein the residual is a residual between the representative portion in the baseline component and a corresponding portion in the baseline model.

6. The apparatus for processing the spectrum according to claim 1, wherein the spectrum is an NMR spectrum.

7. An apparatus for processing a spectrum, comprising: a means for: receiving a spectrum which includes a baseline component; processing the spectrum; and performing a spectrum analysis on the spectrum after subtracting an estimated baseline component from the spectrum; wherein processing the spectrum comprises: searching for an optimum signal model according to: a first condition to fit a baseline model with respect to the baseline component, wherein the baseline model is a frequency space model, and a second condition to reduce an Lp norm (wherein p≤1) of a signal model which is another expression of the baseline model, wherein the signal model is a time space model in a predetermined space in which the baseline component is expressed as a sparse signal, wherein the predetermined space is a time space, and wherein the baseline model is defined by applying a transform matrix to the signal model, and generating an optimum baseline model corresponding to the optimum signal model, wherein the optimum baseline model is an estimated baseline component.

8. A method for processing a spectrum, comprising: receiving, with at least one processor, an NMR spectrum which includes a baseline component; and processing, with the at least one processor, the NMR spectrum, wherein processing the NMR spectrum comprises: searching for an optimum signal model according to: a first condition to fit a baseline model with respect to the baseline component, wherein the baseline model is a frequency space model, and a second condition to reduce an Lp norm (wherein p≤1) of a signal model which is another expression of the baseline model, wherein the signal model is a time space model in a time space, and wherein the baseline model is defined by applying a transform matrix to the signal model, and generating, with at least one processor, an optimum baseline model determined from the optimum signal model, wherein the optimum baseline model is an estimated baseline component; and performing a spectrum analysis on the spectrum after subtracting the estimated baseline component from the spectrum.

Description

BRIEF DESCRIPTION OF DRAWINGS

(1) Embodiment(s) of the present disclosure will be described by reference to the following figures, wherein:

(2) FIG. 1 is a block diagram showing a function of an apparatus for processing a spectrum according to an embodiment of the present disclosure;

(3) FIG. 2 is a conceptual diagram showing a flow of a spectrum processing;

(4) FIG. 3 is a block diagram showing an information processor device functioning as an apparatus for processing a spectrum;

(5) FIG. 4 is a diagram showing an algorithm according to a first configuration;

(6) FIG. 5 is a diagram showing a first processing result of a method of processing a spectrum according to an embodiment of the present disclosure;

(7) FIG. 6 is a diagram showing a second processing result of a method of processing a spectrum according to an embodiment of the present disclosure;

(8) FIG. 7 is a diagram showing an algorithm according to a second configuration; and

(9) FIG. 8 is a diagram showing an algorithm according to a third configuration.

DESCRIPTION OF EMBODIMENTS

(10) An embodiment of the present disclosure will now be described with reference to the drawings.

(11) FIG. 1 shows an NMR measurement system. The NMR measurement system comprises an NMR measurement apparatus 10 and a spectrum processing apparatus 12. NMR spectrum data are transferred from the NMR measurement apparatus 10 to the spectrum processing apparatus 12. The transfer is executed, for example, via a network, or via a storage medium.

(12) The NMR measurement apparatus 10 comprises a spectrometer and a measurement unit. The measurement unit comprises a static magnetic field generator, a probe, or the like. The static magnetic field generator has a bore serving as a vertical through hole, and an insertion unit of a probe is inserted into the bore. In a head of the insertion unit, a detector circuit which detects an NMR signal from the sample is provided. The spectrometer comprises a controller, a transmission unit, a reception unit, and the like. The transmission unit generates a transmission signal according to a transmission pulse sequence, and the transmission signal is sent to the probe. With this process, an electromagnetic wave is irradiated onto the sample. Then, at the probe, the NMR signal from the sample is detected. A reception signal generated by the detection is sent to the reception unit. In the reception unit, an NMR spectrum is generated by an FFT computation with respect to the reception signal. The NMR spectrum is sent to the spectrum processing apparatus 12 as necessary. Alternatively, the spectrum processing apparatus 12 may be incorporated in the NMR measurement apparatus 10.

(13) In the embodiment, the spectrum processing apparatus 12 is formed by an information processor device such as a computer. FIG. 1 shows a plurality of representative functions of the spectrum processing apparatus 12 by a plurality of blocks. A specific structure of the information processor device will be described later with reference to FIG. 3.

(14) In the following, with reference to FIGS. 1 and 2, a spectrum process executed by the spectrum processing apparatus 12 will be described.

(15) In FIG. 1, an observed NMR spectrum y (hereinafter also referred simply as “spectrum y”) is input to the spectrum processing apparatus 12. The spectrum y is given to a searcher 14 and a selector 16. The searcher 14 functions as a search unit, and the selector 16 functions as a selection unit. First, the selector 16 will be described. The selector 16 is a unit for selecting, in the spectrum, a portion designated by a user as a representative portion in the baseline component. In general, the baseline component which is a wide-band component exists over the entirety of the spectrum y. The representative portion is selected such that components other than the baseline component such as the waveform-of-interest component are not set as the fitting target.

(16) FIG. 2 exemplifies at an upper left part the spectrum y. The spectrum y includes a waveform-of-interest component 30 and a baseline component 32. For example, in a state where the spectrum y is displayed on a screen, the user designates two sections 34 and 36 on the screen, avoiding the waveform-of-interest component 30. In the baseline component 32, the potions included in the sections 34 and 36 form a representative portion y.sub.I. FIG. 2 exemplifies at an upper right part the representative portion y.sub.I. Here, for example, I is a coordinate array which identifies a plurality of representative points of the representative portion y.sub.I. Alternatively, elements of the representative portion y.sub.I may be sequentially designated in units of the representative points (that is, elements). In this case, it is possible to designate the representative portion y.sub.I while avoiding fine noise. In either case, it is desirable to select as the representative portion y.sub.I a portion which is highly likely the baseline component 32. Alternatively, the representative portion y.sub.I may be automatically selected by waveform analysis or other methods.

(17) In FIG. 1, in the embodiment, the searcher 14 searches an optimum solution of a signal model x according to a presumption condition (first condition) shown by the following Equation (1-2) and a basic condition (second condition) shown by the following Equation (1-1). In the computation, the signal model x is a vector.
[Equation 1]
min∥x∥.sub.p.sup.p  (1-1)
subject to y.sub.I=S.sub.IFx  (1-2)

(18) Equation (1-2) described above shows that a baseline component is fitted by a baseline model, and more specifically, shows that the representative portion y.sub.I in the baseline component is fitted by a corresponding portion S.sub.IFx of the baseline model. The signal model x is another expression of the baseline model Fx, and is an expression of the baseline model Fx in the frequency space as a model in the time space. In other words, the baseline model Fx is another expression of the signal model x. F represents a transform matrix from the time space to the frequency space, and S.sub.I represents a sampling matrix in the frequency space which extracts a corresponding portion corresponding to the representative portion from the baseline model. In a specific configuration described later, in the search of the optimum solution of the signal model x, a condition for minimizing the L2 norm of a residual between the representative portion y.sub.I and the corresponding portion S.sub.IFx is taken into consideration.

(19) Equation (1-1) described above shows a condition to minimize the Lp norm (wherein p≤1) of the signal model x. It is known that, when the value of p is less than or equal to 1, the Lp norm functions to increase the sparsity of the solution which is the norm computation target, in the process of solving the problem of minimization of the Lp norm. The value of p is greater than or equal to 0 and less than or equal to 1. When p is 0, the solution may become unstable. Thus, desirably, the value of p is greater than 0 and less than or equal to 1. The spectrum processing apparatus 12 estimates the baseline component utilizing the two properties that the baseline component is a sparse signal on the time axis and that the Lp norm increases the sparsity of the solution in the process of searching the optimum solution.

(20) In general, p is 1, but when the sparsity of the baseline component is expected to be high, p may be set to, for example, 0.75 or 0.5. When the number of elements of the baseline component is N and the number of elements of the representative portion y.sub.I is M, the representative portion y.sub.I is a matrix of M rows and 1 column, and the sampling matrix S.sub.I is a matrix of M rows and N columns.

(21) With regard to the norm, generally, the following expressions represented by Equation (2-1) and Equation (2-2) are permitted. In the present disclosure, the “Lp norm” basically refers to the norm defined by Equation (2-2). The parameter n represent a number of elements of the vector.
[Equation 2]
x∥.sub.p=(|x.sub.1|.sup.p+|x.sub.2|.sup.p+ . . . +|x.sub.n|.sup.p).sup.1/p  (2-1)
x∥.sub.p.sup.p=|x.sub.1|.sup.p+|x.sub.2|.sup.p+ . . . +|x.sub.n|.sup.p  (2-2)

(22) In FIG. 1, a portion shown by reference numeral 18 is a portion where the signal model x is to be generated (updated), and a portion shown by reference numeral 20 is a portion where the signal model x is to be evaluated. An optimum solution of the signal model x is searched to minimize the evaluation value to be described later. The signal model x at the time when a completion condition is satisfied is an optimum signal model x′.

(23) In the embodiment, an initial signal model x.sub.0 is generated from the representative portion y.sub.I. Specifically, the representative portion y.sub.I is transformed into a signal in the time space, to generate the initial signal model x.sub.0. A portion shown by reference numeral 22 is a portion where this function is realized. Alternatively, the initial signal model x.sub.0 may be generated by other conditions, or designated by the user.

(24) FIG. 2 exemplifies at a lower right part the optimum signal model x′ which is the optimum solution. The optimum signal model x′ has an approximated waveform or a simulated waveform, in the time space, fitted to a signal, on the time axis, corresponding to the baseline component. For example, the optimum signal model x′ has a pulse-shape portion 38 having a short time width and a flat portion 40 other than the pulse-shape portion 38. The flat portion 40 is formed as an array of zeros (or values near zero). The exemplified optimum signal model x′ is a sparse signal.

(25) In FIG. 1, a subtractor 24 subtracts an estimated baseline component Fx′ from the original spectrum y. The estimated baseline component Fx′ is an optimum baseline model which is generated by applying a transform matrix F to the optimum signal model x′. An analyzer 26 executes analysis on the spectrum after the subtraction.

(26) FIG. 2 exemplifies at a slightly lower part in the center the estimated baseline component Fx′ generated from the optimum signal model x′. The estimated baseline component Fx′ is a wide-band component on the frequency axis. The estimated baseline component Fx′ is subtracted from the original spectrum (refer to reference numeral 24A), to obtain the spectrum y′ after the subtraction. In the spectrum y′, the baseline component 32 included in the original spectrum y is removed, and, at the same time, the waveform-of-interest component 30 included in the original spectrum y is maintained.

(27) FIG. 3 shows an example structure of an information processor device which functions as the spectrum processing apparatus 12. The information processor device includes a CPU 100, a memory 102, an inputter 106, a display 108, and the like. In the memory 102, a spectrum processing program 104, and a spectrum analysis program 105 are stored. The spectrum processing program 104 is a program for realizing the spectrum process described above with reference to FIGS. 1 and 2. The spectrum analysis program 104 is a program for analyzing the spectrum after the pre-process. These programs 104 and 105 are executed by the CPU 100.

(28) The CPU 100 functions as the search unit, the selection unit, the subtraction unit, and the like. The inputter 106 is formed with a keyboard, a pointing device, or the like, and the representative portion is designated by the user using the inputter 106. The display 108 is formed with, for example, an LCD, and displays a spectrum before the processing. Alternatively, a range of the representative portion or a group of representative points may be designated on the displayed spectrum by the user. Alternatively, another processor which executes the spectrum processing program 104 may be provided. Alternatively, a plurality of processors which execute the spectrum processing program may be provided. The concept of the processor includes various computation devices which execute data computation.

(29) Next, with reference to FIG. 4, a first configuration will be described. In this configuration, p=1, and an optimum signal model is searched as the optimum solution according to a condition to minimize the evaluation value J shown in Equation (3).

(30) [ Equation 3 ] J = 1 2 .Math. y I - S I Fx .Math. 2 2 + λ 2 .Math. x .Math. 1 1 ( 3 )

(31) A first term in Equation (3) is a term corresponding to the first condition, and is a portion where the L2 norm for a residual (y.sub.I-S.sub.IFx) between the representative portion y.sub.I of the baseline component and the corresponding portion S.sub.IFx of the baseline model in the frequency space is to be computed. A second term in Equation (3) is a term corresponding to the second condition, and is a portion where the L1 norm of the signal model x in the time space is to be computed. The term λ is a regularization weight.

(32) In general, solving the problem of minimizing the evaluation value requires differentiation of the evaluation value computation formula. While the first term is differentiable, differentiating the second term is difficult. Thus, according to the known IST (Iterative Soft Thresholding) method, the above-described problem is solved. Alternatively, the IRLS (Iterative Reweighted Least Squares) method, the SIFT (Spectroscopy by Integration of Frequency and Time domain) method, or the like may be used. When p=1, use of the IST method is desirable. When p is less than 1, use of the IRLS method is desirable. When a portion where the baseline component is dominant in the time space is known, use of the SIFT method is desirable. The second and third configurations to be described later are based on the IRLS method.

(33) FIG. 4 shows an algorithm of the first configuration. In S10, the spectrum y and a coordinate array I for specifying the group of representative points are read. In S12, the sampling matrix S.sub.I and the transform matrix F are formed. The formulas shown in S14 are formulas that replace the matrix S.sub.IF included in Equations (1) and (3) with B for the purpose of convenience. In S16, based on the spectrum y and the coordinate array I, the representative portion y.sub.I is formed. In S18, a Lipschitz coefficient L.sub.f is computed. The coefficient is a maximum value of a plurality of eigenvalues for B.sup.H×B; that is, a maximum eigenvalue (alternatively, the Lipschitz coefficient L.sub.f may be set to be greater than or equal to the maximum eigenvalue). B.sup.H is a Hermitian transpose of B. In S20, the initial signal model x.sub.0 is generated from the representative portion y.sub.I. B.sup.+ is a pseudo-inverse matrix of the matrix B. In addition, in S20, k is initialized.

(34) In S22, a first completion condition is judged. Specifically, a judgment is made as to whether or not k, which represents a number of computations, is less than or equal to a maximum value k.sub.max. When k≤k.sub.max, S24 and S26 are executed. The computation formula shown in S24 and the computation formula shown in S26 update the signal model x in two stages. The computation formula shown in S24 is determined by once differentiating the first term in Equation (4), and updates the signal model x to minimize the L2 norm of the first term. The computation formula shown in S26 corresponds to the second term of Equation (4). Because it is difficult to differentiate the second term, a soft threshold function soft( ) is used. The function re-forms, based on a computation result X.sub.k+1 of S24, the value of X.sub.k+1 as follows.
[Equation 4]
If λ/L.sub.f<x.sub.k+1 Then x.sub.k+1=x−λ/L.sub.f  (4-1)
If −λ/Lf≤x.sub.k+1≤λ/Lf Then x.sub.k+1=0  (4-2)
If x.sub.k+1<−λ/L.sub.f Then x.sub.k+1=x+λ/L.sub.f  (4-3)

(35) With the use of the soft threshold function soft( ) the signal model x is updated to minimize the L1 norm. In S28, k is incremented. A signal model X.sub.k+1 at the time when the first completion condition described above is satisfied is output as the optimum signal model.

(36) Alternatively, a second completion condition may be added to the above-described algorithm. For example, immediately after S26, an index e defined by Equation (5) may be computed, and the algorithm may be completed when the index e becomes a certain value or less.

(37) [ Equation 5 ] e = .Math. x k + 1 - x k .Math. 2 2 .Math. x k + 1 .Math. 2 2 ( 5 )

(38) The denominator in Equation (5) is the L2 norm of the signal model X.sub.k+1 after the update, and the numerator in Equation (5) is the L2 norm of a difference between signal models X.sub.k and X.sub.k+1 before and after the update. Alternatively, another condition may be set as the second completion condition.

(39) Next, with reference to FIGS. 5 and 6, an advantage of the first configuration will be described. FIG. 5 shows a first estimation and removal result, and FIG. 6 shows a second estimation and removal result.

(40) In FIG. 5, reference numeral 40 shows a group of representative points forming the representative portion. In this example configuration, a representative point designation method is employed in place of a section designation method. A number of elements in the spectrum in the frequency axis direction is 1024, and a number of the representative points is 256. These conditions are similarly applicable in the second estimation and removal result shown in FIG. 6.

(41) An NMR spectrum 42 shown by a solid line includes a baseline component which is artificially added at a later timing. A result of estimation according to the first configuration for this spectrum is an estimated baseline component 44 shown by a broken line and serving as an optimum baseline model. A result of subtraction of the estimated baseline component 44 from the NMR spectrum 42 is an NMR spectrum 46 after the pre-process.

(42) In FIG. 6, reference numeral 50 shows a group of elements of the representative portion. In this example configuration also, the designations are made in units of elements. An NMR spectrum 52 shown by a solid line also includes a baseline component which is artificially added at a later timing. A result of estimation related to the first configuration on this spectrum is an estimated baseline component 54 shown by a broken line and serving as an optimum baseline model. A result of subtraction of the estimated baseline component 54 from the NMR spectrum 52 is an NMR spectrum 56 after the pre-process.

(43) In both the first and second estimation and removal results, the baseline component which changes with a relatively long period on the frequency axis is almost completely removed. This is due to the use of the L1 norm in the model fitting for the baseline component, under a presumption that the baseline component has the sparsity on the time axis. In addition, in both the first and second estimation and removal results, the waveform-of-interest is maintained.

(44) FIG. 7 shows an algorithm related to a second configuration. The algorithm is according to the IRLS method. The evaluation value J is defined based on Equation (3) described above. For example, p is 0.75. Alternatively, another numerical value which is less than or equal to 1 may be given as p. According to the IRLS method, the second term in Equation (3) which is difficult to differentiate can be expressed as follows, using a weight matrix W.

(45) [ Equation 6 ] J = 1 2 .Math. y I - S I Fx .Math. 2 2 + λ 2 .Math. W x .Math. 2 2 ( 6 )

(46) The second term in Equation (6) described above is a computation formula of the L2 norm, which can be once differentiated and twice differentiated.

(47) In FIG. 7, processes similar to the processes of FIG. 4 are assigned the same reference numerals and will not be described again. In S10a, the spectrum y and the coordinate array I are read, and a value of p is defined. For example, 0.75 is given to p. In S30, numerical values are given to a coefficient ε for preventing division by zero and to the regularization weight λ. The subsequent processes of S12˜S22 are already described above.

(48) In S32, according to the IRLS method, a plurality of weight elements w.sub.i are defined. In S34, a weight matrix W is defined in which the plurality of weight elements are employed as diagonal elements, and all other elements are set to zero. A computation formula shown in S36 is an updating formula of the signal model x. The formula is determined by once differentiating and twice differentiating Equation (6) described above. By repeatedly executing the computation formula shown in S36, an optimum solution of the signal model x which satisfies the condition to minimize the evaluation value J is determined.

(49) FIG. 8 shows an algorithm of a third configuration. This algorithm is based on the underdetermined IRLS method. This method can be used when the number of elements M of the sample set is smaller than a total number of data N. For example, p is 0.75. Alternatively, another numerical value less than or equal to 1 may be given as p.

(50) Equations (1-1) and (1-2) described above are respectively rewritten as Equations (7-1) and (7-2) based on the IRLS method and using the weight matrix W. Equation (7-2) is identical to Equation (1-2).
[Equation 7]
min∥Wx∥.sub.2.sup.2  (7-1)
subject to y.sub.I=S.sub.IFx  (7-2)

(51) When the matrix S.sub.IF described above is a wide-width matrix (that is, underdetermined), Equations (7-1) and (7-2) described above can be rewritten into the following equivalent equations based on the underdetermined IRLS method.

(52) [ Equation 8 ] min .Math. z .Math. 2 2 ( 8 - 1 ) subject to y I = S I FW - 1 z = BW - 1 z ( 8 - 2 )

(53) A solution which satisfies this equation is computed by Equation (9).
[Equation 9]
x=W.sup.−1B.sup.H(BW.sup.−1W.sup.−1B.sup.H).sup.−1y.sub.I  (9)

(54) The algorithm shown in FIG. 8 is based on this concept. In FIG. 8, processes identical to the processes shown in FIGS. 4 and 7 are assigned the same reference numerals, and will not be described again.

(55) In S10a, the spectrum y and the coordinate array I are input, and then, in S30a, a predetermined value is given to ε. The processes of S12˜S22, S32, and S34 are identical to the processes already described. The computation formula shown in S38 is an updating formula of the signal model x, and is the same as Equation (9).

(56) According to the embodiment described above, the baseline component included in the spectrum can be precisely estimated. In this process, designation of functions and designation of parameters to be given to the function are not necessary. Thus, the baseline component can be precisely estimated without causing a heavy burden for the user. The spectrum analysis can be accurately executed on the spectrum from which the baseline component is removed. The spectrum process described above may be applied to spectra other than the NMR spectrum.