METHOD AND SYSTEM FOR DETERMINING THE CONCENTRATION OF CHEMICAL SPECIES USING NMR
20210063330 ยท 2021-03-04
Inventors
Cpc classification
G01R33/4625
PHYSICS
G01N24/085
PHYSICS
G16C10/00
PHYSICS
International classification
Abstract
Method for determining the concentrations of constituent chemical species in a mixture, including the steps of: using nuclear magnetic resonance spectroscopy, acquiring an NMR measurement for a sample of the mixture; and for each of the constituent chemical species, retrieving a reference model representative of the NMR FID signal or frequency domain spectra from a database. Each model has a number of parameters, and for at least one of the constituent chemical species, the reference model is a quantum mechanical model. The method further includes, using a computer, generating a model signal for the mixture and adjusting some or all of the model parameters to fit the model signal to the measured data; and based on the fitted model signal, calculating and displaying the concentrations of the constituent species in the sample.
Claims
1. A method of determining the concentrations of constituent chemical species in a mixture, comprising: using nuclear magnetic resonance spectroscopy, acquiring an NMR measurement for a sample of the mixture; for each of the constituent chemical species, retrieving a reference model representative of the NMR FID signal or frequency domain spectra from a database, each model having a number of parameters; wherein for at least one of the constituent chemical species, the model is a quantum mechanical model; using a computer, generating a model signal for the mixture and adjusting some or all of the model parameters to fit the model signal to the measured data; and based on the fitted model signal, calculating and displaying the concentrations of the constituent species in the sample.
2. A method as claimed in claim 1, wherein the acquired NMR measurement is obtained using a benchtop-type NMR spectrometer.
3. A method as claimed in claim 1 or 2, wherein the acquired NMR measurement is obtained using an NMR spectrometer of a type having a permanent magnet.
4. A method as claimed in claim 1, wherein the acquired NMR measurement is obtained using an NMR spectrometer with an operating frequency of less than 100 MHz.
5. A method as claimed in any preceding claim, further comprising the step of hierarchically arranging the reference models for the chemical species.
6. A method as claimed in claim 5, wherein the step of hierarchically arranging the reference models comprises forming one or more groups containing multiple constituent species, wherein constituent species within the same group display similar responses to specific experimental conditions; and assigning one or more model parameters at the group level to reduce the overall number of parameters.
7. A method as claimed in any preceding claim, wherein at least one of the constituent chemical species reference models is specified in terms of the transition peaks with parameters found by diagonalization of the spin Hamiltonian.
8. A method as claimed in any preceding claim, wherein at least one of the constituent chemical species reference models is a quantum mechanical model utilising temporal propagation.
9. A method as claimed in any preceding claim, wherein different types of models are used to generate the reference signals of different constituent species, the models each being selected from: a base model of spectra peaks, a quantum mechanical model utilising diagonalization, a quantum mechanical model utilising temporal propagation, and experimental data.
10. A method as claimed in any preceding claim, wherein the reference signal for at least one of the chemical species is independent of the NMR instrument's field strength.
11. A method as claimed in any preceding claim, wherein the mixture contains K chemical species, and the NMR signal (x) for the mixture is a superposition of the reference signals u.sub.k of the constituent chemical species, modelled according to:
12. A method as claimed in claim 11, wherein the model parameters comprise one or more of: chemical shifts of the peaks, relaxation rates, peak intensities, and J-coupling constants.
13. A method as claimed in any preceding claim, wherein the acquired NMR measurement is obtained using single pulse .sup.1H NMR
14. A method as claimed in any preceding claim, the method further comprises the step of using marginal posterior distributions of the intensity estimators to analyse the uncertainties in their calculated values.
15. A method as claimed in claim 14, further comprising the step of using an MCMC algorithm to sample the posterior distribution of the fitted model to estimate the uncertainty of the model parameters.
16. A method as claimed in claim 14 or 15, wherein, along with the step of displaying the concentrations of the constituent species in the sample, the confidence or credible intervals relating to the concentrations are displayed.
17. A method as claimed in claim 16 wherein the confidence or credible intervals are calculated using a robust variance estimator taking into account the residual signal between the model signal and the NMR measurements.
18. A method as claimed in any preceding claim, wherein the step of fitting the model signal, or calculating the concentrations of the constituent species further comprises the step of performing line shape correction.
19. A method as claimed in any preceding claim, wherein the step of fitting the model signal, or calculating the concentrations of the constituent species comprises utilising a generalized least squares (GLS) estimator.
20. A method as claimed in claim 19, wherein the generalized least squares (GLS) estimator treats possible model misspecification as additional non-isotropic noise.
21. A method as claimed in claim 19 or 20, wherein, in the frequency domain, the variance of the noise assumed to be proportional to the absolute value of the derivative of the modelled NMR spectra.
22. A method as claimed in any preceding claim, wherein the mixture comprises sugars.
23. A method as claimed in claim 22, wherein the chemical species comprise one or more of glucose, fructose and sucrose.
24. A method as claimed in claim 22 or 23, wherein the mixture is a fruit juice.
25. A method as claimed in any one of claims 1 to 21, wherein the chemical species comprise one or more alcohols.
26. A system for determining the concentrations of constituent chemical species in a mixture, comprising: a nuclear magnetic resonance (NMR) spectrometer, for acquiring an NMR measurement of a sample of the mixture; a computer storage media comprising a database of NMR FID signal or frequency domain spectra reference models for each constituent species, each model having a number of parameters; wherein for at least one of the constituent chemical species, the model is a quantum mechanical model; a model signal generator configured to generate a model signal for the mixture and adjust some or all of the model parameters to fit the model signal to the measured data, and thereby calculate the concentrations of each of the constituent species in the sample; and a user interface for receiving input commands from a user and for displaying the calculated concentrations of each of the constituent species.
27. A system as claimed in claim 26, wherein the model generator comprises means for combining the reference models for each constituent species to generate the model signal.
28. A system as claimed in claim 26 or 27, wherein the NMR spectrometer is a benchtop-type spectrometer.
29. A system as claimed in any one of claims 26 to 28, wherein the NMR spectrometer of a type having a permanent magnet.
30. A system as claimed in any one of claims 26 to 29, wherein the NMR spectrometer operates at less than 100 MHz.
31. A system as claimed in any one of claims 26 to 30, comprising means for hierarchically arranging the chemical species and their respective reference models.
32. A system as claimed in any one of claims 26 to 31, wherein at least one of the constituent chemical species reference models in the database is specified in terms of the transition peaks with parameters found by diagonalization of the spin Hamiltonian.
33. A method as claimed in any one of claims 26 to 32, wherein at least one of the constituent chemical species reference models in the database is a quantum mechanical model utilising temporal propagation.
34. A system as claimed in any one of claims 26 to 33, wherein the database comprises different types of reference models for different constituent species, the model types being selected from: a base model of spectra peaks, a quantum mechanical model utilising diagonalization, a quantum mechanical model utilising temporal propagation, and experimental data.
35. A system as claimed in any one of claims 26 to 34, wherein at least one of the reference models in the database is independent of the field strength of the NMR spectrometer.
36. A system as claimed in any one of claims 26 to 35, wherein the model signal generator combines the reference signals u.sub.k for a mixture containing K chemical species using superposition, and generates the NMR signal (x) for the mixture according to:
37. A system as claimed in claim 36, wherein the model parameters comprise one or more of: chemical shifts of the peaks, relaxation rates, peak intensities, and J-coupling constants.
38. A system as claimed in any one of claims 26 to 37, further comprising the step of using an MCMC algorithm to sample to posterior distribution of the fitted model to estimate the uncertainty of the model parameters.
39. A system as claimed in any one of claims 26 to 38, along with the step of displaying the concentrations of the constituent species in the sample, the confidence intervals relating to the concentrations are displayed.
40. A non-transient computer readable medium containing program instructions for causing a computer to: upon receiving an NMR measurement for a sample of a mixture of a number of known constituent chemical species, retrieve a reference model representative of the NMR FID signal or frequency domain spectra for each specie from a database, each model having a number of parameters; at least one of the retrieved reference models being a quantum mechanical model; generate a computer readable model signal for the mixture and adjust some or all of the model parameters to fit the model signal to the measured data; and based on the fitted model signal, calculate and display the concentrations of the constituent species in the sample.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0058] The present invention will now be described by way of example only and with reference to the accompanying drawings in which:
[0059]
[0060]
[0061]
[0062]
[0063]
[0064]
[0065]
[0066]
[0067]
[0068]
[0069]
[0070]
[0071]
[0072]
[0073]
[0074]
[0075]
[0076]
[0077]
[0078]
[0079]
[0080]
DETAILED DESCRIPTION OF A PREFERRED EMBODIMENT
[0081] Theory
[0082] An NMR signal for a mixture of K chemical species can be modelled as a parametric model x consisting of a superposition of corresponding signature signals u.sub.k, k=1, . . . ,K weighted by the amount of the k.sup.th chemical species in the solution, c.sub.k:
[0083] The sets of model parameters .sub.k determine the appearance of each reference signal;
[0084] they are related to the nature of particular compounds and may include, for example, chemical shifts of the peaks, their relative intensities and widths, as well as values of J-coupling constants. Additionally, the global phase shift .sub.0 and the ringdown delay bear the meaning of the zero- and first-order phasing terms, respectively.
[0085] The weighting coefficients c.sub.k, herein intensity estimators, are directly proportional to the amount of the corresponding species k. Therefore, estimating the weighting coefficients is the main goal of model-based quantification.
[0086] The generalised model above may be defined in the time domain or in the frequency domain by using suitable reference signals u.sub.k specified in the desired domain. Equation 1 does not set any inherent requirements on the reference signals u.sub.k and their parameterization. In general, the resulting model x is a complex valued vector of length N; depending on the chosen domain, it may represent either a free induction decay (FID) signal or the resulting spectrum.
[0087] Multiple approaches are available for modelling u.sub.k for a given chemical species and will be described in due course. For example, it is known to model u.sub.k by fitting a number of separate Lorentzian peaks or using experimental data. However, as discussed in the background section, these approaches have some shortcomings and are limited in their application.
[0088] Alternatively, a quantum mechanical model can be used to define u.sub.k. Such models utilise the characteristic that, even though peaks in an NMR spectrum can move as a response to changing experimental conditions, the spectrum is necessarily determined by the underlying molecular structure. Specifically, in an NMR experiment, one observes transitions between certain quantum states of the studied spin system. The appearance of all P transition peaks can be found based on the spin Hamiltonian of the quantum mechanical (QM) system, and in general can be expressed as some non-explicit function:
{.sub.p, b.sub.p, .sub.p}.sub.p=1.sup.P=f.sup.QM(, J, r) (2)
[0089] This function takes into account shielding effects experienced by different nuclei, and as a resulttheir individual chemical shifts the set of mutual J-coupling constants J,and possibly a relaxation model with rates r that defines the resulting peak widths [1] [2] [3]. Here, bold symbols denote arrays of parameters, with each entry corresponding to a specific nucleus in and r, or pairs of nuclei in J. Typically, due to splitting of resonances of coupled nuclei, the number of peaks observed in the spectrum is much larger than the number of spin chemical shifts and J-coupling constants.
[0090] Adopting the QM model formulation therefore reduces the number of free parameters to fit for improved computational efficiencies. In addition, because a QM model does not make any assumptions about the experimental conditions but rather describes a molecule itself, the same specification of a chemical specie is suitable for modelling NMR experiments with any types of pulse sequences run at any field strength.
[0091] In the example of propanol shown in
[0092] The high-resolution structure of an NMR spectrum in terms of all transition peaks may be obtained using matrix formalism for representation of angular momentum operators and diagonalizing the Hamiltonian H of the spin system directly [4]. H is defined as a sum of the chemical shift (Zeeman) H.sub.Z and coupling terms H.sub.J as:
Where: the indexes s and s run over all spins in the system and I.sub.X; I.sub.Y; I.sub.Z denote their respective Cartesian spin operators; is a gyromagnetic ratio, and B.sub.0 is the magnetic field strength in Tesla [5]. For a system of n spins, the above operators are defined as Kronecker tensor products of series of n .sup.22 matrices:
I.sub.x(1)=.sub.x.Math..sub.0 . . . .sub.0,
I.sub.x(2)=.sub.0.Math..sub.x . . . .sub.0,
I.sub.x(s)=.sub.0.Math..sub.0 . . . .sub.x . . . .sub.0,
with similar equations for I.sub.y and I.sub.z, where .sub.0 and .sub.x, .sub.y and .sub.z are the identity operator and the three Pauli matrices, respectively:
That is, for each s=1, . . . ,n, in the sequence of Kronecker products .sub.0.Math..sub.0.Math. . . . .Math..sub.0, the s.sup.th term is substituted with a certain Pauli matrix resulting in the corresponding 2.sup.n2.sup.n operator, I.sub.x(s), I.sub.y(s), or I.sub.z(s). Each eigenpair (h.sub.q, .sub.q) of the diagonalized Hamiltonian H=.sub.q.sub.qh.sub.qh.sub.q.sup.T corresponds to a certain quantum state from . . . to . . . in the decreasing order of eigenvalues .sub.q for q=1, . . . , 2n. If the transition from state q to state q is allowed, i.e. if an odd number of spins flip their state, the frequency of the corresponding resonance is found as .sub.p=2(B.sub.0+.sub.q.sub.q), and the intensity of the peak is b.sub.p=[h.sub.q.sup.Th.sub.q].sup.2. Finally, a simple isotropic damping model for relaxation is used to assign widths to the resulting transition peaks, assuming that all nuclei in the same spin system relax with equal exponential rates leading to peaks of the same width. Optionally, for example, where high levels of computing power is available or where an isotropic damping model is less appropriate, other relaxation models may be used and the relaxation rates tailored for each transition peak.
Modelling Individual Chemical Species
[0093] There are several distinct ways in which to specify a signature signal u.sub.k for a chemical specie. Table 1 outlines these various approacheseach approach takes a certain set of parameters as the input and results in a model for the NMR signal. Four approaches are detailed below, although other approaches may be possible. The choice of a suitable representation mainly depends on the nature of a particular chemical specie and the given application.
[0094] Referring to option 1 in Table 1, to obtain a modelled FID signal, a first option is to explicitly specify frequencies, intensities, and widths for all peaks in the spectrum. This approach is suited to relatively simple spectra with few peaks. Unless the spectrum contains only singlet peaks, such a model is field specific and not easily transferable to different field strengths. However, this approach can be useful to represent simple commonly occurring components, such as solvents.
[0095] Alternatively, a reference model can be specified in terms of the transition peaks with parameters found by diagonalization of the spin Hamiltonian (model 2), as described above. This approach offers the advantage of working with only a few higher-level parameters, such as chemical shifts and coupling constants attributed directly to the nuclei rather than to the observed peaks. Additionally, this modelling approach is not specific to the instrument field strength, so the same set of parameters can advantageously be used to easily find the transition peaks and generate a reference model for other operating frequencies. This method rapidly increases in complexity with increasing numbers of coupled spins due to the necessity of evaluating the exponential base model for each of the transition peaks, significantly slowing down computations. To improve computation time, it is possible to take advantage or the observation that many transition peaks have very small intensities, and many peaks are clustered together in just a few groups (observed as usual peaks in the spectrum). Therefore, after diagonalization, closely located peaks can be combined and peaks with an intensity lower than a certain threshold can be discarded. It will be apparent that these simplifications will introduce some error, but depending on the thresholds selected, the error is usually negligible, and the process significantly improves computational time.
[0096] Even with such simplifications, for larger spin systems, it is more appropriate to use temporal propagation (model 3 in Table 1). Fast algorithms for temporal propagation have been developed recently to overcome the computational hurdle of Hamiltonian eigen decomposition [3] [6] [7]. Given values of chemical shifts, coupling constants, and a relaxation model, they directly compute the resulting FID and effectively realize a function u=f.sup.tQM (, J, r). Such techniques are capable of handling much larger spin systems of over 40 spins or even 200 spins if state-space approximations are used [2]However these methods may result in suboptimal running times for smaller molecules (e.g. sugars), in which cases the direct diagonalization approach (model 2) is preferred.
[0097] For sufficiently large spin systems, polynomially-scaling propagation algorithms is the viable option to generate, possibly approximate, model signals. A potential downside is that since propagation techniques work by gradually evolving the resulting signal directly in the time domain, they effectively exclude information about separate transition peaks provided by the diagonalization methods, and this data can be valuable for certain applications.
[0098] Finally, a fourth option is to use an empirical signal of a pure component as a reference model. This option utilising experimental data is considered the least desirable approach but may be necessary for modelling very large molecules, whose complexity prevents quantum mechanical simulation or for specifying chemical species with unknown structure and for which spectra of the pure species are available. This method is expected to introduce the largest error into the final quantification results due to inevitable imperfections of phasing and baseline correction of the experimentally obtained reference signal as well as the presence of noise and possible lineshape distortions.
TABLE-US-00001 TABLE 1 Different methods of generating reference model signals . Input parameters Modeled FID 1. Base model global shift {tilde over ()} and rate {tilde over ()} = e
peaks positions u = .sub.p b.sub.pe.sup.[.sup.
chemical shifts , b, = .sup.
chemical shifts u = .sup.
u = .sup.exp (.Math.) 5. Group of species in fixed proportion global shift {tilde over ()} and rate {tilde over ()} = e
u = .sub.jc.sub.ju.sub.j
indicates data missing or illegible when filed
Hierarchical Structure
[0099] Fitting a parametric model x to experimental data is the most computationally demanding step of the quantification procedure, which entails solving a multidimensional non-convex optimization problem. For example, to properly fit a QM model of an isomer of glucose, one needs to accurately estimate seven chemical shifts and possibly also seven J-coupling values. Other monosaccharides require finding similar numbers of parameters. Given a mixture of sugars, when multiple resonances overlap in a small spectral region, simultaneous fitting of all QM parameters becomes practically impossible if the spectral resolution is insufficient (e.g. with benchtop instruments).
[0100] Fortunately, observations indicate that, at least for some species, modelling deviations in individual peaks independently is often not necessary. However, their location as a group on the chemical shift scale may change (particularly when there is no unambiguous reference to set the 0 ppm mark). This suggests that when fitting signal models to datasets of certain chemical mixtures, to account for the move of the entire group of peaks, one may simply offset all chemical shifts in the QM model by the same amount {tilde over ()}. Since such shift deviations are negligible comparing to the absolute resonance frequencies, they do not affect corresponding coupling patterns. Therefore, the computationally demanding QM simulations can be performed only once for some values in the middle of the range, and then the entire resulting spectrum can be shifted accordingly to fit the observations.
[0101] Furthermore, line-broadening caused by the loss of magnetic field homogeneity is likely to affect all peaks in the spectrum as well, and thus can be taken into account by controlling the widths of all peaks simultaneously with a single higher-level parameter.
[0102] If a reference model is generated using any of the methods discussed above, moving its peaks by {tilde over ()} and applying Lorentzian broadening of rate {tilde over ()} can be easily achieved by multiplying the modelled FID with a decaying harmonic signal, =e.sup.[i{tilde over ()}{tilde over ()}]t. Optionally the broadening function may be a non-Lorentzian broadening, such as described later. By extension, one may even combine models of separate chemicals and adjust them simultaneously if desired (item 5 in Table 1). For example, if in a measured spectrum, all peaks of both anomers of glucose are shifted by the same amount with respect to their database values, this deviation is corrected by adjusting a single parameter {tilde over ()} instead of fourteen chemical shifts. Such aggregation significantly reduces the effective dimension of the parameter space and facilitates model fitting. Nevertheless, after fitting these global parameters {tilde over ()} and {tilde over ()}, individual chemical shifts and J-couplings of each spin can be fine tuned if necessary.
[0103] The resulting reference signals u.sub.k defined in Table 1 can now be used to represent separate quantified constituents of the chemical mixture, either a single chemical, or a group of chemicals.
[0104] To allow for multilevel parameter organization, this idea of grouping chemicals can be performed in a hierarchical manner.
[0105] A further exemplary tree structure is shown in
[0106] In the structure in
[0107] As discussed above in relation to Table 1, depending on the chemical specie, different mathematical models may be used for different species. Therefore, different blocks may be parameterised differently depending on the model. For example, a signal from water is more conveniently described with the base model (BM, block 7) of single exponentially decaying sinusoid (time domain) or Lorentzian peak (frequency domain), while proper modelling of spectra of other molecules such as sugars can be performed by diagonalizing the spin system Hamiltonians (dQM, blocks 6 and 8). Moreover, for sufficiently large spin systems of more than 12-13 coupled nuclei, time-dimensional propagation of the quantum mechanical model may be feasible (tQM, block 5). Finally, experimentally obtained spectra can also be used as signature models (EXP, block 9). These different cases are represented by the different signal-generating blocks of four types as leaves in the tree model.
[0108] The second internal signal, a modifier =e.sup.[i{tilde over ()}{tilde over ()}]t, is used solely to move and broaden the modelled peaks. Both signals are parametric functions of time. The specific sets of parameters depend on the type of a block and the represented chemical; in addition to the relative peak position {tilde over ()} and decay rate {tilde over ()} attributed to each block, they may also directly include absolute positions , decay rates , and intensities b of each peak separately or express them as functions of chemical shifts and J-coupling constants as indicated in Table 1 and
[0109] Each block 2, 5, 6, 7, 8, 9 produces two output signals that are passed in one or both directions along the tree to the parent and/or child nodes, if any exist. The output in the upwards direction aggregates signals of all species included in the corresponding subgroup. Given the internal signals and , it is defined as =.sub.c.sub.c where the summation is over the output signals of all node's children. On the other hand, the output signal passed in the downwards direction encompasses shifting and broadening effects determined by all nodes located above. We define it as {hacek over (u)}={hacek over (u)}.sub.p, where {hacek over (u)}.sub.p is the output of the node's parent. Now, a model signature signal for a group of chemical species represented by an intermediate node in the tree can be defined as:
which represents a distinct quantified component of the mixture. Finally, by convention, we set {hacek over (u)}.sub.p=1 for the root node and .sub.c.sub.c=1 for the leaves, and the above equation holds for all nodes in the tree.
[0110] For the exemplary mixture in
[0111] It may be desirable to quantify the chemicals A, B, and C separately. For this, the reference signals u.sub.1, u.sub.2, and u.sub.3 of quantified mixture components are defined according to u={hacek over (u)}.sub.p={hacek over (u)}={hacek over (u)}.sub.p.sub.c .sub.c at the corresponding blocks in the tree. Furthermore, as an example, chemicals D and E may be known to be present in 1:3 ratio, but their total amount is unknown. Therefore, their individual intensities are weighted accordingly (using c=0.25 and c=0.75 respectively), and single reference signal to be used for quantification, u.sub.4, is defined at the Group 2 node.
[0112] Initialising and Fitting the Model
[0113] The above parametric model of an NMR signal for the measured data y can be concisely formulated in a matrix form as:
y=Zce.sup.i.sup.
where Z .sup.NK is the model matrix, whose columns contain ideal responses for each component/group that is, the reference signals, c
.sup.K2 is a vector of intensities, .sub.0 is the phasing term, and n is a normally distributed random vector of noise.
[0114] Typically, good initial values of model parameters (chemical shifts and J-couplings) are known either as a result of previous experiments or can be found in numerous NMR databases. However, to accurately match the reference models with spectra measured at different experimental conditions, often the model parameters require adjustment. For example, chemical shifts of individual peaks may differ slightly from the values reported in databases as a response to varying pH, temperature, concentration or other factors beyond control of the experimentalist.
[0115] As for any model-based quantification method, the objective is to find the estimates of model parameters {circumflex over ()}.sub.1, . . . , {circumflex over ()}.sub.K, {circumflex over ()}.sub.0, and {circumflex over ()} that best explain the measured NMR data y. In turn, this provides a straightforward way for estimating the associated signal intensities .sub.k (equation 1) and the mole fractions
of chemical species. Assuming that the affecting noise n is isotropic Gaussian, the optimal model is obtained by minimizing the Euclidean norm of the residual, yx,
where, in compact matrix notation, Z .sup.NK is a model matrix with columns Z.sub.:,k=u.sub.k(.sub.k, ) for k=1, . . . , K and c=[c.sub.1, . . . , c.sub.K].sup.T is a column vector of intensities. Z implicitly depends on all {circumflex over ()}.sub.K and . Given a fixed model matrix z evaluated with some values of {circumflex over ()}.sub.1, . . . , {circumflex over ()}.sub.K, {circumflex over ()}.sub.0, and {circumflex over ()}, this leads to the ordinary least squares problem with respect to the intensities .sub.k. Following known derivations [8], we obtain a real-valued (ordinary least squares OLS) estimate for the vector of component intensities:
.sub.OLS=Re({circumflex over (Z)}.sup.H{circumflex over (Z)}).sup.1 Re({circumflex over (Z)}.sup.H) (7)
where [.Math.].sup.H denotes the conjugate transpose of a matrix and {circumflex over ()}.sub.0 is an estimate of the global phase shift [9] [10]:
={[
where [.Math.] denotes the argument of a complex number. The sought mole fractions X.sub.k are computed as ratios of the entries of .
[0116] These results assume that suitable values of the underlying model parameters {.sub.k}.sub.k=1.sup.K and are known. In practice, they may need to be estimated as minimizers of the original model fitting problem in Equation (6). After substituting the found estimates and {circumflex over ()}.sub.0 and rearranging its terms, this problem reduces to unconstrained maximization of the variable projection functional (.sub.1, . . . , .sub.K, )=.sup.T|{circumflex over (Z)}.sup.H{circumflex over (Z)}|.
[0117] The derived model fitting solution can be viewed as a special case of a more general Bayesian framework [8] [11], which potentially allows us to incorporate prior information about the fitted parameters into the problem and estimate the uncertainty of the found results. Specifically, the above result can be derived as the maximum likelihood estimator of the component intensities under the assumption that the affecting noise n is circularly symmetric zero-mean complex Gaussian with variance .sup.2. This assumption on the noise distribution is supported by the principle of maximum entropy and the Central Limit Theorem as the observed noise is likely to be contributed by a large number of independent additive sources (e.g. thermal noise). Furthermore, since the Gaussian distribution of the noise is preserved by the Fourier transform, the same optimality conditions hold for models defined in the time as well as frequency domains allowing us to use either Equation (7) or (8) to define the reference models and resulting in the same estimates of intensities in both cases.
[0118] With these assumptions, the variance of the estimate of intensity c.sub.k is found as the scaled diagonal term of the inverse matrix Re({circumflex over (Z)}.sup.H{circumflex over (Z)}).sup.1 [8]. The associated 95% confidence interval is specified as c.sub.k2{tilde over (c)}.sub.k for k-1, . . . K, where is the corresponding standard deviation and can be found as:
{tilde over (c)}.sub.k={square root over ([Re({circumflex over (Z)}.sup.{circumflex over (Z)}).sup.1].sub.kk, )}(9)
[0119] Where [.].sub.kk denotes the k.sup.th diagonal entry of a matrix in the brackets. This definition of uncertainty takes into account the influence of noise and also the peak overlap. Indeed, if spectra of all signals described by the model matrix Z do not overlap, their inner productsthe off-diagonal terms in the matrix Z.sup.HZare 0, leading to a diagonal inverse matrix. In this case, assuming that the model fit is perfect with respect to the parameters, the uncertainty of the estimates is determined solely by the variance of noise .sup.2. However, spectral overlaps and non-zero inner products between different columns of Z raise the diagonal terms in Re({circumflex over (Z)}.sup.H{circumflex over (Z)}).sup.1 and increase the variance of the resulting estimates accordingly.
[0120] The covariance matrix associated with Equation (7) can be expressed as .sub.c.sub.
[0121] As for any model-based approach, the present method is limited in its capacity to account for unforeseen variability in experimental datasets not fully explained by the strict model assumptions. Most notably, deviations of the peaks' lineshapes from the perfect Lorenzian curves has been long known to pose a significant problem to model fitting methods. Such deviations can be caused by various factors, for example, diffusion processes in the sample, in which case Gaussian or Voigt functions have been found to be more faithful representations of the underlying peak lineshapes. On the other hand, inevitable inhomogeneity of the magnetic field across the measured sample volume leads to additional distortions that usually affect all peaks in the spectrum similarly. Finally, weak couplings between distant nuclei, while not easily observed directly, may cause broadening of certain peaks, in which case simple single-resonance models can no longer accurately represent the resulting composite lineshapes. These imperfections do not change the total signal intensity and thus, unless several peaks overlap, they typically remain unnoticed in peak integration. However, even small deviations from the ideal lineshape may have significant effects on the results of model-based quantification.
[0122] Referring to the example spectra in
[0123] Most importantly, the difference between the experimental spectrum and the estimated model, which ideally should be Gaussian noise, exhibits strong residual components. This violates the underlying assumption of the majority of model-based techniques that the error term n is circularly-symmetric Gaussian with zero-mean and fixed variance. As such, this model misspecification biases the estimated intensities and their variances. Assuming heteroscedasticity of the noise, a robust variance estimator can be constructed by taking into account the residual,
.sub.c.sub.
where is a diagonal matrix with entries equal to the squared values of the residual, yZ.sub.OLS. However, this approach does not remove the bias in the intensities estimator .sub.OLS.
[0124] The effects of lineshape imperfections can be partially alleviated by including a lineshape correction mechanism into the model as shown in the
[0125] For this, noise in
.sub.GLS=Re({circumflex over (Z)}.sup.H.sup.1{circumflex over (Z)}).sup.1 Re({circumflex over (Z)}.sup.H.sup.1) (11)
and the covariance matrix .sub.c.sub.
[0126] Model Formulation in the Frequency Domain
[0127] The above described model is expressed in the time-domain, enabling simpler mathematical expression. However, it will be apparent to skilled persons that the model alternatively may be formulated in the frequency domain.
[0128] Assuming that an NMR signal is a superposition of P.sub.k exponentially decaying harmonic components, an FID sampled at times t is defined as:
[0129] The intensity coefficients b.sub.p in front of each component are proportional to the number of corresponding nuclei. The angular frequencies .sub.p=2(f.sub.pf.sub.o) are usually defined with respect to some absolute) reference f.sub.0 offset by the frequencies of chemical shifts of the peaks f.sub.p (in Hz). Finally, the decay rates .sub.p, also expressed in Hz, are related to the full width at half maximum (FWHM) of each peak and the relaxation time T.sub.2* through .sub.p=FWHM=1/T*.sub.w. In this specific example, the triplets of parameters {b.sub.p, .sub.p, .sub.p}.sub.p=1.sup.p constitute the sets .sub.k for each mixture component. However, under the least squares criterion, when the noise is Gaussian, the equivalent frequency-domain formulation is possible. It can be easily obtained via the Fourier transform as is usually done in the traditional spectral analysis; alternatively, one may directly evaluate the Lorentzian function on the frequency grid f:
where t is the sampling interval (dwell time).
[0130] As such, the model matrix z and the associated vectors x and y represent spectra instead of FID signals. This representation leads to the same formulation of the weighting coefficients, but is a preferable approach in cases when one is only interested in analysing only a portion of the spectrum because it is possible to select data only for the frequency range of interest, and thereby reduce the number of peaks/variables to fit.
[0131] Method
[0132]
[0133] A user inputs information identifying the constituent chemical species, 106, 203 for example via a user interface on a laptop or PC, or via an input on the NMR instrument itself. The laptop or PC is preferably arranged to automatically or manually receive outputs from the NMR instrument.
[0134] The user constructs the hierarchical model, depending on the properties of the chemical species present in the sample, assigning each species to a group or sub-group of species in the sample for modelling purposes. Alternatively, this step may be automated and the chemical species analysed and grouped according to predetermined rules. That is, if there are two or more species having peaks that are expected to move together in response to changes in experimental conditions, they may be grouped together. A hierarchical model of the type illustrated in
[0135] In a next step 106, 107, 207 models for each constituent species are pulled from a database 108, 109. As described above, the nature of the model depends on the specific chemical species, but where appropriate, the models are quantum mechanical models that are not specific to the field strength of the NMR instrument. Once model information for the constituent species is loaded, a model signal is generated at a desired Larmor frequency corresponding to the field strength of the NMR instrument (.sub.0= B.sub.0, where B.sub.0 is the field strength).
[0136] In a fitting step 111, 211, the computer takes the experimental measurements from the NMR sample and adjusts the model parameters to fit the model to the measured data using a least squares fit. Adjustment of parameters is carried out iteratively 113, 213 until an acceptable fit is reached, a maximum number of iterations is reached, or until the change in the values of the parameters between iterations is below a threshold value.
[0137] The amount/concentration of each chemical species is then calculated based on the intensity estimators from the resulting model.
[0138] Finally, the resulting model for the NMR signal is analysed at step 115, 215 to estimate the uncertainty of the intensity estimates due to noise and peak overlap, using one or more of the methods described above in relation to Equations 9 to 11.
[0139] Additionally, a Bayesian approach may be utilised. Methods for applying Bayesian statistics to modelling and sampling posterior distributions are known, for example, outlined in [8]. With the Bayesian approach, the joint distribution may be used to analyse the uncertainty in other model parameters such as chemical shifts and/or peak widths. When integrating out the parameters is difficult or not possible, the uncertainty can be estimated using a Markov chain Monte Carlo (MCMC) algorithm, sampling the Bayesian posterior distribution. MCMC sampling is only useful in the cases. This may be useful for characterizing the correctness of the models, for example.
[0140] The amount/concentration of each chemical specie is then output on the user interface along with the calculated uncertainty, which may be expressed as credible intervals at step 115, 215.
[0141]
[0142] Applications
[0143] The present method enables a much wider application of benchtop-type NMR systems than is currently available. Therefore, this method has wide ranging applications where mixture analysis is required but large high-field NMR spectroscopy is impractical.
[0144] Food screening is one such potential application. Glucose, fructose, and sucrosethe three sugars most commonly occurring in fruit juices, wine, soft drinks, and other beverageshave different chemical properties and sweetness. Their relative concentrations (in addition to that of acids) largely determine the taste of a product. Therefore, knowledge of the exact composition of sugars in the mixture is important in the food industry, where it provides a means for process monitoring (e.g. fermentation), quality control, or even detection of adulteration (e.g. undisclosed sweetening of fruit juices or dilution of them with apple juice [12] [13]). High-field NMR spectroscopy has been shown to be particularly effective in these analyses, but these applications are often deemed too challenging for low-resolution benchtop instruments. The presence of multiple isomers of sugar species in relatively low concentrations, whose very complex overlapping spectra are further obstructed by the strong solvent peak are the main obstacles that prevent effective and accurate quantification.
[0145] Other food and beverage related applications include analysing alcohol content, testing for vitamins such as thiamine or pyridoxine, for example, or substances such as caffeine, or simple acids (e.g. malic, citric, maleic, lactic, etc.) and aminoacids.
[0146] The method also has application in forensic science, particularly in testing for certain opioids. Or in the chemical industry, for example to detect CO.sub.2 absorption in amines, or the purity of chemical products.
[0147] In general, the method is suitable for .sup.1H NMR, for any molecule provided that reference values of chemical shifts and J-coupling constant are given or can be obtained from a high resolution dataset. Molecules containing (any number of) spin systems with no more than 12-13 coupled protons each can be modelled with the diagonalization approach, and temporal propagation techniques are used to model larger spin systems. The success of the quantification method for the analysis of a particular mixture critically depends on the number of mixture components, the field strength of the spectrometer, and as a result, the extent of peak overlap. This can be analysed numerically on a case by case basis by considering the properties of the model matrix Z.sup.HZ.
[0148] For .sup.13C NMR spectroscopy for this application where no coupling is observed, the present method still provides advantages. The limiting factor in .sup.13C NMR is the level of noise in the dataset. In this regard, the robustness of the present model-based approach to high levels of noise is important for accurate quantification and the present method compares favourably with traditional peak integration.
[0149] Experimental Results
[0150] Analysis of Sugar Mixtures
[0151] The above described system and method is herein illustrated experimentally for samples containing controlled mixtures of sugars and samples of natural fruit juices. High-field
[0152] NMR spectroscopy has been proven an excellent technique for analysing compositions of various food samples as well as in numerous conformational studies of carbohydrates. However, most sugars have a highly complex .sup.1H NMR spectra, largely precluding the traditional peak integration analysis, especially of data observed on medium-field benchtop instruments. In addition, the water peak is located closely to the spectra of sugars, significantly distorting the baseline and complicating the analysis. Therefore, using the sugar mixtures described herein cannot be readily quantitatively analysed using medium-field benchtop NMR instruments and existing techniques.
[0153] In addition to the relatively high number of coupled protons (6-7 in most monosaccharides), reducing sugars such as glucose and fructose, exist in solutions in several tautomeric forms. For example, - and -glucopyranose, which under normal conditions account for approximately 37.5% and 62.5% respectively of glucose in aqueous solutions [14]. For fructose, four tautomers, namely - and -fructopyranose and - and -fructofuranose, can be observed in solution in considerable amounts (2.67%, 68.23%, 6.24%, 22.35% in tautomeric equilibrium at 20 C. [15]. (We disregard the intermediate open aldehyde form, which typically accounts for only 0.05% of fructose in solution). The tautomeric differences in the molecular structure affect magnetic shielding of even distant protons and thus alter their chemical shifts and J-coupling constants. As a result, these different isomers are distinguished by NMR spectroscopy and thus need to be modelled separately with their own reference models.
[0154] For the below described experiments, samples were measured using two NMR spectrometers: a medium-field Magritek Spinsolve benchtop system operating at a .sup.1H frequency of 43.6 MHz; and on a 400 MHz Agilent 400MR spectrometer equipped with a OneNMR probe. On the benchtop system, .sup.1H FID signals were acquired with 32768 points and dwell time of 200 s using a one-pulse sequence with pulse angle 545 of 90; datasets with variable numbers of scans were acquired with repetition time of up to 60 s. Likewise, on the high-field instrument, 16384 time points were measured with dwell time of 312.5 ps and pulse angle of 45; a coaxial capillary with D.sub.2O was inserted into the sample tube to establish the lock signal for the high-field spectrometer.
[0155] Development and Validation of Reference Models for Sugars
[0156] Using the quantum mechanical description of the spin systems as given in Table 1, reference signals are defined for each isomer. For the purpose of analysing .sup.1H NMR datasets, each isomer of the considered sugars is modelled as a spin system of 7 coupled protons. A molecule of sucrose is regarded as comprising two uncoupled moieties, glucose and fructose, present in equal amounts and model them with separate QM formulations. Therefore, 7 chemical shifts and 6-7 J coupling constants need to be estimated for each spin system to completely specify the model of the isomer. These parameters have been previously assigned using high-field NMR data and are reported in the literature [15] [16] [14]. However, differences in experimental conditions in these sources (for example the presence and amount of D.sub.2O in solutions added to provide a lock signal for the spectrometer) may notably affect the reported chemical shifts and are difficult to account for consistently.
[0157] To avoid these inaccuracies and confirm the parameter values, for each or glucose, fructose, and sucrose, a spectra of 0.5M pure equilibrated sugar solutions in 100% deionized water is acquired on the high field 400 MHz spectrometer. QM model signals are generated for each case using Equation 2, and their parameters are then fine-tuned. The values found for each studied isomer are given in Table 2. These values agree with the assignments reported elsewhere [17] [16] [18] [14] [15].
TABLE-US-00002 TABLE 2 H.sub.1 H.sub.1a H.sub.2 H.sub.3 H.sub.4 H.sub.5 H.sub.6a H.sub.6b .sup.3J.sub.1-2 .sup.3J.sub.1a-1b H.sub.1b .sup.3J.sub.2-3 .sup.3J.sub.3-4 .sup.3J.sub.4-5 .sup.3J.sub.5-6a .sup.3J.sub.5-6b .sup.3J.sub.6a-6b -GP 5.230 3.531 3.709 3.408 3.824 3.840 3.764 3.817 9.732 9.336 10.357 1.870 5.349 12.129 -GP 4.641 3.239 3.483 3.401 3.459 3.896 3.722 8.326 9.158 9.389 9.604 2.001 5.962 12.419 -FF 3.646 3.658 4.105 3.993 4.054 3.817 3.695 12.090 4.220 7.244 3.043 5.508 11.900 -FF 3.590 3.551 4.108 4.105 3.822 3.799 3.674 12.350 8.500 7.572 2.976 6.368 12.660 -FP 3.707 3.650 4.033 3.941 3.863 3.857 3.707 11.680 6.729 2.954 1.574 2.550 12.790 -FP 3.708 3.560 3.792 3.891 3.995 4.017 3.701 11.730 9.978 3.554 1.121 1.753 12.930 Su-F 3.676 3.676 4.211 4.044 3.884 3.828 3.809 12.500 8.751 8.469 2.723 6.952 12.310 Su-G 5.440 3.558 3.756 3.470 3.832 3.816 3.818 3.350 9.997 9.076 10.130 2.244 4.329 14.440
[0158]
[0159] To validate the models for lower-field data, .sup.1H spectra of the same three samples of pure sugars were obtained using the benchtop NMR instrument. Using the proton chemical shifts and J-coupling constants measured using the high field instruments, signal models were generated corresponding to the 43 MHz operating frequency, and fitted to the data by adjusting only the position of the entire group of the peaks and their relaxation rate.
[0160] Instead, the spectra is an ensemble of closely spaced and highly overlapping peaks corresponding to different quantum transitions of these large spin systems.
[0161] NMR spectra obtained using Benchtop instruments typically have lower signal to noise ratios than high field data and are considered to have poorer line shape. However, the line shape imperfections seen in the benchtop NMR data in
[0162] In a next step, the models obtained for each sugar isomer are used to quantify the relative concentrations in solution, both as measured with the high and medium field strength spectrometers. Table 3 compares the results with their reference values found in the literature ([14] for glucose and [15] for fructose) or the ground truth (for sucrose). The error for the medium-field values does not exceed 1% in relative concentrations of isomers and is less than 0.5% with high-field data. The reported 95% credible intervals are computed according to Equation (9) , where G is estimated based on the difference between the measured data and the fitted model. Therefore, in addition to the effect of noise, the estimated uncertainties include the effect of slight model misfit due to possible errors in model parameters. Notably, the estimates obtained at both field strengths have comparable confidence bounds.
TABLE-US-00003 TABLE 3 Relative equilibrium concentrations of sugar isomers in aqueous solutions found using data from 400 MHz spectrometer and 43 MHz benchtop instrument; the estimates are reported along with their 95% credible intervals. Reference 400 MHz 43 MHz -GP 0.375 0.373 0.007 0.379 0.006 -GP 0.625 0.627 0.009 0.621 0.007 -FF 0.062 0.051 0.005 0.053 0.008 -FF 0.224 0.228 0.007 0.241 0.010 -FP 0.027 0.023 0.007 0.019 0.009 -FP 0.682 0.698 0.011 0.688 0.016 Su-F 0.5 0.498 0.006 0.502 0.005 Su-G 0.5 0.502 0.006 0.498 0.005
[0163] The present model-based quantification of sugar mixtures with benchtop NMR, relies on the field invariance property of QM modelsassuming that chemical shifts and J-couplings obtained using well resolved high-field data remain unchanged in different samples.
[0164] To estimate possible errors due to model misfit, we run the following numerical simulation. Using the model values listed in Table 2, we generate synthetic signals for the mixture of glucose isomers taken in equal amounts at 400 and 43 MHz .sup.1H spectrometer frequencies. We then apply the present quantification method, but, to simulate the uncertainty in the experimental parameters, we randomly deviate all chemical shifts in the model with respect to the true values used to generate the data. Interestingly, the same deviation in chemical shift leads to much higher discrepancy between the model and the data at higher operating frequency than at 43 MHz as shown in 12 (i) to (iii). This demonstrates that, all other conditions being equal, model-based quantification of benchtop data is more robust to inaccuracies in chemical shifts of the peaks (e.g. as a result of changing pH) than its higher resolution 400 MHz counterpart. Model misfit errors such as these will give a lower limit on the estimated error that cannot be removed even with improvement in the signal to noise ratio of the measured data, but can be mitigated by additional fitting of underlying model parameters where possible.
[0165] In contrast, the uncertainty in the estimates inevitably increases at lower field strength due to significantly reduced spectral resolution. The effects of peak overlap on the estimation accuracy can be conveniently quantified with Equation (9) assuming fixed variance of noise .sup.2=1. Using datasets simulated at various field strengths,
for mixtures of isomers of glucose and fructose and also for the two components of sucrose modelled separately. The solid lines correspond to estimating relative concentrations of isomers of glucose and fructose as well as the two components of sucrose; the dashed line is for the case of analyzing mixtures of the three sugars (while holding ratios of their isomers fixed in the models. As expected, the analysis of mixtures containing four fructose isomers is a more challenging problem with higher associated uncertainty than the quantification of binary mixtures of glucose isomers or sucrose components. In the latter cases, the non-overlapping peaks of anomeric protons in spectra of glucose located further downfield from the main group of peaks enable more accurate quantification. Importantly, these simulations assume that the true chemical shifts and J-couplings are known. With experimental data, their estimation inevitably contributes to additional errors. Since high-field models are more sensitive to the accuracy of parameters, their resulting uncertainty is comparable to that of the values obtained at 43 MHz, as we observe in Table 3.
[0166] Monitoring Interconversion of Glucose in H.sub.2O
[0167] The ability of NMR spectroscopy to distinguish different structural isomers makes it an important tool for studying kinetics of interconversion reactions of reducing sugars. For example, the glucose isomer a-glucopyranose, upon dissolving in water, undergoes conversion into its [3] form until the equilibrium at approximately 37.5:62.5 in the respective mole fractions is reached [19]. The rate of reaction, determined by fitting a monoexponential model, is highly dependent on experimental conditions, primarily the temperature.
[0168] In high-field NMR spectroscopy, glucose isomers are typically quantified by integrating the peaks of the anomeric proton that occur at 5.2 and 4.6 ppm for - and -forms respectively (see
[0169] To illustrate the suitabilitiy of the present technique, a 0.5M solution of glucose was prepared in 100% deionized H2O, and immediately placed into a sample tube, and transfered to the benchtop NMR instrument. A series of spectra was acquired in 45 s intervals over the course of several hours. The temperature inside the bore of the magnet was kept at approximately 30 C.
[0170]
[0171] Quantification of Controlled Mixtures
[0172] In most applications, the total amount of a specific sugarbut not each of its isomers separatelyis of primary interest.
[0173] Mixtures of solid glucose, fructose, and sucrose were gravimetrically prepared with predefined mole fractions of each constituent and dissolved in 100% deionized H2O. About 4-5 g of each solid mixture was measured and water added to obtain 15 mL of the resulting solution (which, depending on the ratio of sugars, corresponds to molarity of 1 M). The manufacturer-specified purities of the components was over 99.0%, and the balance had an instrument accuracy of 0.1 mg (provided in the calibration protocol of the manufacturer).
[0174] Since the equilibrium concentrations of different isomers c.sub.j are known (see Table 3), we construct a hierarchical model similar to the one shown in
[0175] The results of quantification are summarized in Table 4 and are compared to the relative concentrations estimated gravimetrically. The uncertainties in the gravimetric mole fractions are determined by means of error propagation based on the accuracy of laboratory balances and are expected to be no more than 10-4 mol/mol. The values estimated with the present method are equipped with respective 95% confidence intervals tighter than 0.01 mol/mol. We note that the maximum average deviation between the estimated and gravimetric values is in the order of 0.03 mol/mol. The error here arises primarily due to lineshape imperfections in the measured data and the resulting misfit of model signals. Specifically, accurate estimation of chemical shifts and J-coupling constants of fructose is notably challenging even with well-resolved high-field data due to significant overlaps of the spectra of its multiple isomers. Furthermore, closely located peaks of coupled protonsfor example, H.sub.6a and H.sub.6b in sucrose (see Table 2 above)are extremely sensitive to deviations in their parameter values. These imperfect input models inevitably contribute to the final quantification error. Therefore, Equation 9 is regarded here as a lower bound on the confidence intervals of quantification. More comprehensive analysis of the contribution of uncertainty in specific model parameters to the final accuracy of estimation can be performed in a probabilistic setting.
TABLE-US-00004 TABLE 4 Results of model based quantification of five gravimetrically prepared ternary mixtures of sugars obtained with a benchtop instrument expressed in relative mole fractions, mol/mol. Glucose Fructose Sucrose Avg. diff. S1 grav. 0.332 0.332 0.336 0.012 estim. 0.317 0.35 0.334 S2 grav. 0.168 0.666 0.166 0.019 estim. 0.196 0.639 0.165 S3 grav. 0.666 0.168 0.166 0.031 estim. 0.62 0.211 0.169 S4 grav. 0.168 0.168 0.664 0.012 estim. 0.157 0.186 0.657 S5 grav. 0.417 0.417 0.167 0.008 estim. 0.418 0.427 0.155
[0176] Composition Analysis of Fruit Juices
[0177] Fruit juice samples were prepared fresh and filtered using a Millex syringe filter unit with 0.22 m pore size to reduce turbidity. To avoid altering the natural composition of the samples, there was no pasteurization, or pH equilibration, and no deuterated water was added to the samples. The samples were measured using both a benchtop and a high-field NMR spectrometer.
[0178] An example of acquired spectra on both instruments along with models fitted to them are shown in
[0179] The amounts of glucose, sucrose, and fructose are estimated using the method described above.
[0180] There is strong agreement between data from the high-field instrument (
[0181] A sample of bottled apple juice measured twice approximately six months apart (bottom left corner of the diagrams) shows the same trend towards decreasing of the amount of sucrose as the juice fermented notably. Similar changes occur with the yellow kiwifruit juice: this sample was measured on the high-field spectrometer immediately after preparation but two weeks later with the benchtop instrument. This indicates the potential application of portable benchtop instruments for accurate on-line monitoring of fermentation reactions, which are of great importance in the food industry, using the techniques described herein.
[0182] Finally,
[0183] Analysis of Alcohol Mixture
[0184] Figure B15 is a further illustration of the application of the present quantitative analysis techniques, this time applied to a mixture of alcohols, namely ethanol and propanol.
[0185] Two samples with different relative concentrations of ethanol and propanol were prepared gravimetrically and analysed using the aforementioned benchtop spectrometer operating at 43 MHz frequency. The resulting spectra displayed significant overlap of the peaks of the two chemical species. This, along with the effects of the closely located strong peak of water, make this exemplary alcohol mixture challenging to analyse using conventional peak integration techniques with benchtop NMR data.
[0186] Due to the significant peak overlap of different species, using peak integration, one can only find aggregate integral values for groups of peaks at approximately 3.5 ppm and 0.5 to 2.0 ppm. Then, the sought molar fractions xEth and xpro can be recovered by taking into account different number of protons of ethanol and propanol contributing to each integral (see equations in the Figure). However, difficulties in achieving accurate baseline correction, specifically due to the influence of the water peak, contribute to poor performance of peak integration.
[0187] The present method overcomes this difficulty by using a separate model for the water peak to eliminate the baseline. Moreover, the quantum mechanical formulation of the chemical species accounts for higher order coupling effects most notably seen in the spectrum of propanol. This leads to much better agreement of the estimated mole fractions with the gravimetric values (see the tables in the top left corners of each plot).
[0188] Analysis of Thiamine and Pyridoxine Vitamin Mixture
[0189] As a further example,
[0190] A sample containing vitamins B1 (thiamine), B6 (pyridoxine), and maleic acid in deuterated water is measured using the aforementioned high-field spectrometer operating at 400 MHz .sup.1H frequency. Models of each chemical specie and water are fitted using the above described methods. These models are shown in the various curves of in
[0191] For comparison, the spectra for the mixture was analysed using peak integration Peaks chosen for integration are indicated in
[0192] The relative mole fractions estimated using the present method strongly agree with those of peak integration (see the Table in
[0193] The experimental results presented above are restricted to simple one-pulse .sup.1H experiments and homonuclear couplings, augmenting the set of proton chemical shifts and J-couplings {.sup.H, J.sup.HH} with information about heteronuclear coupling, {.sup.C, J.sup.HC}, allows one to take into account .sup.13C satellites in proton spectra in a principled way or even easily model higher dimensional experiments, such as HSQC, if desired.
[0194] Preferred embodiments of the invention have been described by way of example only and modifications may be made thereto without departing from the scope of the invention.
REFERENCES
[0195] [1] M. Tiainen, P. Soininen and R. Laatikainen, Quantitative Quantum Mechanical Spectral Analysis (qQMSA) of 1H NMR spectra of complex mixtures and biofluids, Journal of Magnetic Resonance, vol. 242, pp. 67-78, 2014. [0196] [2] I. Kuprov, N. Wagner-Rundell and P. J. Hore, Polynomially scaling spin dynamics simulation algorithm based on adaptive state-space restriction, Journal of Magnetic Resonance, vol. 189, pp. 241-250, 2007. [0197] [3] A. M. Castillo, L. Patiny and J. Wist, Fast and accurate algorithm for the simulation of NMR spectra of large spin systems, Journal of Magnetic Resonance, vol. 209, pp. 123-130, 2011. [0198] [4] J. Canavagh, W. J. Fairbrother, A. G. Palmer II, M. Rance and N. J. Skelton, Theoretical Description of NMR Spectroscopy, Protein NMR Spectroscopy 2nd Edit, 2007. [0199] [5] S. A. Smith, W. E. Palke and J. T. Gerig, The Hamiltonians of NMR. part I, Concepts in Magnetic Resonance, vol. 4, pp. 107-144, 4 1992. [0200] [6] H. J. Hogben, M. Krzystyniak, G. T. P. Charnock, P. J. Hore and I. Kuprov, SpinachA software library for simulation of spin dynamics in large spin systems, Journal of Magnetic Resonance, vol. 208, pp. 179-194, 2011. [0201] [7] L. J. Edwards and I. Kuprov, Parallel density matrix propagation in spin dynamics simulations, Journal of Chemical Physics, vol. 136, 2012. [0202] [8] Y. Matviychuk, E. Harbou and D. J. Holland, An experimental validation of a Bayesian model for quantification in NMR spectroscopy, Journal of Magnetic Resonance, vol. 285, pp. 86-100, 12 2017. [0203] [9] M. Bydder, Solution of a complex least squares problem with constrained phase, Linear Algebra and its Applications, vol. 433, pp. 1719-1721, 2010. [0204] [10] I. Markovsky, On the Complex Least Squares Problem with Constrained Phase, SIAM Journal on Matrix Analysis and Applications, vol. 32, pp. 987-992, 2011. [0205] [11] G. L. Bretthorst, Bayesian spectrum analysis and parameter estimation, Springer-Verlag, 1988. [0206] [12] C. W. P. d. S. Grandizoli, F. R. Campos, F. Simonelli and A. Barison, Grape juice quality control by means of 1H NMR spectroscopy and chemometric analyses, Quimica Nova, vol. 37, pp. 1227-1232, 2014. [0207] [13] N. Ogrinc, I. J. Kosir, J. E. Spangenberg and J. Kidrc, The application of NMR and MS methods for detection of adulteration of wine, fruit juices, and olive oil. A review, Analytical and Bioanalytical Chemistry, vol. 376, pp. 424-430, 2003. [0208] [14] M. U. Roslund, P. Thtinen, M. Niemitz and R. Sjoholm, Complete assignments of the 1H and 13C chemical shifts and JH, H coupling constants in NMR spectra of d-glucopyranose and all d-glucopyranosyl-d-glucopyranosides, Carbohydrate Research, vol. 343, pp. 101-112, 2008. [0209] [15] T. Barclay, M. Ginic-Markovic, M. R. Johnston, P. Cooper and N. Petrovsky, Observation of the keto tautomer of D-fructose in D(2)O using (1)H NMR spectroscopy, Carbohydrate Research, vol. 347, pp. 136-141, 2012. [0210] [16] A. De Bruyn and J. Van Loo, The identification by 1H- and 13C-n.m.r. spectroscopy of sucrose, 1-kestose, and neokestose in mixtures present in plant extracts, Carbohydrate Research, vol. 211, pp. 131-136, 1991. [0211] [17] A. De Bruyn, M. Anteunls and G. Vereegge, A 1H-n.m.r. study of D-fructose in D2O., vol. 41, pp. 295-297, 1975. [0212] [18] M. Jaseja, A. S. Perlin and P. Dais, Two-dimensional NMR spectral study of the tautomeric equilibria of D-fructose and related compounds, Magnetic Resonance in Chemistry, vol. 28, pp. 283-289, 1990. [0213] [19] M. Nelson and F. M. Beegle, Mutarotation of Glucose and Fructose, J. Am. Chem. Soc., pp. 559-575, 1919. [0214] [20] United States Department of Agriculture, Agricultural Research Service, USDA Food Composition Databases. [0215] [21] M. Tiainen, Quantitative Quantum Mechanical Analysis of 1 H NMR Spectra.