Systems and Methods for Measuring Concentration of an Analyte
20230017186 · 2023-01-19
Assignee
Inventors
- Ieva Simonyte (Vilnius, LT)
- Augustinas Vizbaras (Vilnius, LT)
- Tadas Buciunas (Panevezys, LT)
- Arunas Miasojedovas (Vilnius, LT)
- Stephan Heinz Sprengel (Vilnius, LT)
Cpc classification
A61B5/14532
HUMAN NECESSITIES
A61B5/14546
HUMAN NECESSITIES
A61B2560/0247
HUMAN NECESSITIES
A61B5/0075
HUMAN NECESSITIES
A61B2562/0233
HUMAN NECESSITIES
A61B5/1455
HUMAN NECESSITIES
A61B2562/028
HUMAN NECESSITIES
International classification
Abstract
Techniques for acquiring and processing data in combination with a photonic sensor system-on-a-chip (SoC) (1) to provide real-time calibrated concentration levels of an analyte (e.g., a constituent molecule within a biological substance) are described. A raw signal (1300) to be analyzed is collected by the sensor chip (1) via diffuse reflectance or transmittance. Determination of the analyte concentration is based on, in part, Beer-Lambert principles and facilitated by applying (2240) scattering correction to the raw signal (1300) prior to decomposition and analysis thereof.
Claims
1. A method for calibrating a sensor for measurement of concentration of an analyte, the method comprising: collecting, using a hybrid group III-V/group IV semiconductor photonics system-on-a-chip (SoC), a plurality of raw spectra from an object having the analyte; partitioning the plurality of raw spectra according to respective spectral shapes thereof into a set of clusters, each cluster comprising a group of raw spectra; and within each cluster: applying a respective local scattering correction (LSC) to each raw spectrum belonging to the cluster to obtain a group of locally corrected spectra; and deriving, using the locally corrected spectra and gold standard analyte concentration values corresponding to the group of raw spectra belonging to the cluster, a cluster-specific optimized set of pre-processing parameters and a cluster-specific calibration vector.
2. The method of claim 1, wherein deriving the cluster-specific optimized set of pre-processing parameters and the cluster-specific calibration vector for a particular cluster comprises: evaluating each of a plurality of candidate sets of pre-processing parameters, evaluation of a particular candidate set comprising: pre-processing each locally corrected spectrum belonging to the particular cluster using the particular candidate set; deriving a candidate calibration vector by applying multivariate regression calibration to the pre-processed locally corrected spectra and using the gold-standard analyte concentration values corresponding to the group of raw spectra belonging to the particular cluster; and computing a corresponding accuracy measure for the candidate calibration vector via cross-validation; and designating the candidate set and the corresponding candidate calibration vector associated with a maximum accuracy measure as the cluster-specific optimized set of pre-processing parameters and cluster-specific calibration vector, respectively.
3. The method of any preceding claim, wherein: the object comprises tissue; and the analyte comprises at least one of: blood glucose, blood lactate, ethanol, urea, creatinine, troponin, cholesterol, albumin, globulin, ketones-acetone, acetate, hydroxybutyrate, collagen, keratin, or water.
4. The method of any preceding claim, wherein partitioning the plurality of raw spectra according to respective spectral shapes thereof comprises: applying a global scattering correction (GSC) to each of the plurality of raw spectra to obtain a plurality of globally corrected spectra; clustering the plurality of globally corrected spectra according to: (A) a specified number of clusters, or (B) a specified maximum distance of a globally corrected spectrum from a centroid of a cluster, or (C) both a specified number of clusters and a specified maximum distance to a globally corrected spectrum from a centroid of a cluster; and within each cluster, designating to that cluster a respective raw spectrum corresponding to a globally corrected spectrum belonging to the cluster.
5. The method of claim 4, wherein the clustering comprises at least one of: k-means clustering, affinity propagation, or agglomerative clustering.
6. The method of any preceding claim, further comprising: storing in the SoC a GSC reference spectrum.
7. The method of any of claim 4 or claim 5, wherein the global scattering correction comprises global multiplicative scattering correction, global standard normal variate (SNV) correction, Kubelka-Munk correction, Saunderson correction, or global mean centering and normalization correction.
8. The method of any of claim 4 or claim 5, where the local or global scattering correction comprises particle-size difference correction or pathlength-difference correction, each correction comprising Kubelka-Munk correction, Saunderson correction, multiplicative scattering correction, or a combination thereof.
9. The method of any preceding claim, further comprising: storing in the SoC, for each cluster: (i) a corresponding LSC reference spectrum, (ii) a corresponding calibration vector, and (iii) cluster centroid.
10. The method of claim 9, further comprising: storing in the SoC, for each cluster: (iv) the cluster-specific optimized set of pre-processing parameters.
11. The method of any preceding claim, further comprising: storing in the SoC the optimized set of pre-processing parameters for each cluster.
12. The method of any preceding claim, wherein the local scattering correction comprises local multiplicative scattering correction, local standard normal variate (SNV) correction, Kubelka-Munk correction, Saunderson correction, or local mean centering and normalization correction.
13. The method of any preceding claim, wherein determining the respective spectral shapes of the plurality of raw spectra comprises: pre-processing the plurality of raw spectra by applying thereto a linear transformation and a baseline correction based on a reference spectrum of a selected analyte.
14. The method of claim 13, wherein the pre-processing comprises Kubelka-Munk correction, Saunderson correction, multiplicative scattering correction, or a combination thereof.
15. A method for measuring concentration of an analyte, the method comprising: obtaining, using a hybrid group III-V/group IV semiconductor photonics system-on-a-chip (SoC), a raw spectrum from an object having the analyte; identifying from a plurality of clusters of spectra a cluster to which the raw spectrum belongs based on spectral shape of the raw spectrum; applying a local scattering correction (LSC) to the raw spectrum to obtain a locally corrected spectrum; pre-processing the locally corrected spectrum using a cluster-specific optimized set of pre-processing parameters; and multiplying the preprocessed locally corrected spectrum with a cluster-specific calibration vector to obtain a calibrated concentration value for the analyte.
16. The method of claim 15, wherein obtaining the raw spectrum comprises: directing from the SoC to the object electromagnetic radiation (EMR) tunable at a plurality of wavelengths; measuring using the SoC intensities of EMR received from the object at each of the plurality of wavelengths; and converting the intensities into absorbance values, wherein the raw spectrum comprises an absorbance spectrum.
17. The method of claim 16, wherein the plurality of wavelengths are selected from a range 1000 nm-3500 nm or a range 1900-2500 nm.
18. The method of any of claims 15 to 17, wherein: the plurality of clusters of spectra correspond to spectra collected previously using the SoC; and each of the plurality of clusters is represented via a respective LSC reference, cluster centroid and a respective calibration vector, the respective LSC reference, the respective cluster centroid, and the respective calibration vector for each cluster being stored on the SoC.
19. The method of any of claims 15 to 18, wherein identifying from the plurality of clusters of spectra the cluster to which the raw spectrum belongs comprises: deriving a globally corrected spectrum using a global scattering correction (GSC) reference; within each cluster from the plurality of clusters: comparing the globally corrected spectrum with a respective LSC reference to obtain a distance corresponding to that cluster; and selecting a cluster for which the corresponding distance is minimum.
20. The method of claim 19, wherein the global scattering correction comprises global multiplicative scattering correction, global standard normal variate (SNV) correction, Kubelka-Munk correction, Saunderson correction, global mean centering and normalization correction, or a combination thereof.
21. The method of claim 19, where the local or global scattering correction comprises particle-size difference correction or pathlength-difference correction such as Kubelka-Munk, Saunderson correction, multiplicative scattering correction, or a combination thereof.
22. The method of any of claims 15 to 21, wherein the local scattering correction comprises local multiplicative scattering correction, local standard normal variate (SNV) correction, or local mean centering and normalization correction, Kubelka-Munk correction, Saunderson correction, or a combination thereof.
23. The method of any of claims 15 to 22, wherein determining the spectral shape of the raw spectrum comprises: pre-processing the raw spectrum by applying thereto a linear transformation and a baseline correction based on a reference spectrum of a selected analyte.
24. The method of claim 23, wherein the pre-processing comprises Kubelka-Munk correction, Saunderson correction, multiplicative scattering correction, or a combination thereof.
25. A system for measuring concentration of an analyte, comprising: a hybrid group III-V/group IV semiconductor photonics system-on-a-chip (SoC) for obtaining a raw spectrum from an object having the analyte; and a processing unit, comprising a processor and memory, and configured to: obtain, using the hybrid group III-V/group IV semiconductor photonics system-on-a-chip (SoC), a raw spectrum from an object having the analyte; identify from a plurality of clusters of spectra a cluster to which the raw spectrum belongs based on spectral shape of the raw spectrum; apply a local scattering correction (LSC) to the raw spectrum to obtain a locally corrected spectrum; preprocess the locally corrected spectrum using a cluster-specific optimized set of pre-processing parameters; and multiply the preprocessed locally corrected spectrum with a cluster-specific calibration vector to obtain a calibrated concentration value for the analyte.
26. The system of claim 25, wherein: to obtain the raw spectrum, the SoC is configured to: direct to the object electromagnetic radiation (EMR) tunable at a plurality of wavelengths; and measure intensities of EMR received from the object at each of the plurality of wavelengths; and the processor is programmed to convert the intensities into absorbance values, wherein the raw spectrum comprises an absorbance spectrum.
27. The system of claim 26, wherein the plurality of wavelengths comprises a range 1000 nm-3500 nm or a range 1900-2500 nm.
28. The system of any of claims 25 to 27, wherein: the plurality of clusters of spectra correspond to spectra collected previously using the SoC; each of the plurality of clusters is represented via a respective LSC reference, a respective cluster centroid, and a respective calibration vector; and the SoC comprises memory for storing, for each cluster, the respective LSC reference, the respective cluster centroid, and the respective calibration vector.
29. The system of any of claims 25 to 28, wherein the SoC comprises memory for storing the optimized set of pre-processing parameters for each cluster.
30. The system of any of claims 25 to 29, wherein to identify from the plurality of clusters of spectra the cluster to which the raw spectrum belongs, the processor is programmed to: derive a globally corrected spectrum using a global scattering correction (GSC) reference; within each cluster from the plurality of clusters: compare the globally corrected spectrum with a respective LSC reference to obtain a distance corresponding to that cluster; and select a cluster for which the corresponding distance is minimum.
31. The system of claim 30, wherein the global scattering correction comprises global multiplicative scattering correction, global standard normal variate (SNV) correction, Kubelka-Munk correction, Saunderson correction, or global mean centering and normalization correction.
32. The system of claim 30, where the local or global scattering correction comprises particle-size difference correction or pathlength-difference correction, each correction comprising Kubelka-Munk correction, Saunderson correction, multiplicative scattering correction, or a combination thereof.
33. The system of any of claims 25 to 32, wherein the local scattering correction comprises local multiplicative scattering correction, local standard normal variate (SNV) correction, Kubelka-Munk correction, Saunderson correction, or local mean centering and normalization correction or a combination thereof.
34. The system of any of claims 25 to 33, wherein the SoC comprises: a wavelength shift tracker to track a shift in wavelength of radiation emitted by the SoC, a wavelength tracker to track absolute wavelength of the radiation emitted by the SoC; a temperature sensor to measure the temperature of the SoC; and an SoC output power monitor to monitor the intensity of the EMR emitted by the SoC during a wavelength sweep.
35. The system of any of claims 25 to 34, wherein to determine the respective spectral shapes of the plurality of raw spectra, the processing unit is configured to: pre-process the plurality of raw spectra by applying thereto a linear transformation and a baseline correction based on a reference spectrum of a selected analyte.
36. The system of claim 35, wherein while performing the pre-processing, the processing unit is configured to apply Kubelka-Munk correction, Saunderson correction, multiplicative scattering correction, or a combination thereof.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
[0033]
[0034]
[0035]
[0036]
[0037]
[0038]
[0039]
[0040]
[0041]
DETAILED DESCRIPTION
[0042] Optical remote sensing is a developed technique for a broad range of applications. Sensing can be performed as a form of ranging—i.e., measuring distance by means of time-of-flight or frequency modulated continuous wave (FMCW) technique, or sensing can be performed to remotely detect, identify, and quantify the presence or absence of one or more molecules within an object by spectroscopic sensing.
[0043] The term spectroscopic sensing, as used herein, refers to deployment of a hybrid III-V/IV semiconductor photonic system-on-a-chip (P-SoC), which emits wavelength tunable laser radiation and is in communication with a remote target object. The wavelength change and absolute value are monitored and accounted within every sweep, such that the SoC is autocalibrated in terms of absolute wavelength and wavelength shift and power spectrum.
[0044] The light impinges the object and penetrates to a certain depth, defined by the optical length, which depends on the individual specifity of the object such as scattering matrix, content, etc. For example, using tunable laser radiation in the 1900-2500 nm spectral region to perform a transdermal sensing experiment with a living object, the light penetrates up to about a 1 mm below the skin surface, where it is scattered and is partially absorbed by the tissue, blood, and interstitial fluid. Such absorption is molecule-specific and each constituent molecule modifies the light spectrum with a unique spectral absorption signature. After interaction with the object, the transmitted, scattered or reflected light is collected and detected with a photodetector.
[0045] A schematic block diagram describing an embodiment of the invention is shown in
[0046] In the illustrated embodiment, the hybrid III-V/IV semiconductor chip 1 includes a hybrid III-V/IV external cavity laser 100, which emits swept-wavelength laser radiation via an optical path 10. A portion of the beam is split via path 11, and fed into the wavelength shift tracker 120 via path 11, absolute wavelength reference 130 via optical path 14, laser power curve monitoring block 140 via optical path 17 and output section via optical path 19. Chip 1 may also include a temperature sensor 110, for sensing the temperature of the chip, which in turn can be used for absolute wavelength reference calibration.
[0047] Wavelength shift tracker 120 can be any type of non-balanced interferometer such as Mach-Zender, Michelson, Fabry-Perot, etc. A non-balanced interferometer provides a beat signal at the output of the 120 via the optical path 12, and the photodetector block 121 register an oscillating signal, where the oscillation period depends on the optical path difference within the interferometer and wavelength. The optical path difference is defined by the design and is a known parameter. The wavelength shift value can thus be extracted if the absolute value of the wavelength at any given moment is known. This is provided by the absolute wavelength reference block 130, coupled to monitoring photodetector 131 via optical path 15. The absolute wavelength reference can be a distributed Bragg grating (DBR), micro-ring resonator (MRR), distributed feedback grating (DFB) or any other optical cavity structure with unambiguous characteristic transmission or reflectance feature within the spectral region covered by the hybrid laser 100 sweep. In such a way, the photodetector blocks 121 and 131 cooperatively provide information about the absolute wavelength value and wavelength shift value at any given moment of time within the sweep.
[0048] Tracking wavelength shift and absolute wavelength value is often necessary in order to decouple the system effects from the object related effects. For example, the emission wavelength might change in a non-linear manner on the system side, and thus without precise knowledge of absolute wavelength shift and value information, it can be difficult to perform signal conversion from time-domain to wavelength (or frequency domain). Another aspect is that the collected spectra will change due to changes on the object side—such as water displacement due to temperature, or changes in other strong baseline contributors. Without knowing the system output at all times, it is impossible to decouple whether the collected spectrum from the object is shifted due to changes at the output of the system or influenced by the changes within the object. Therefore, wavelength shift and absolute wavelength information tracking within every sweep allows one to decouple the system specific modulation on the collected spectra from the object specific modulation, the latter being the useful signal.
[0049] In practical cases, the target molecules such as glucose, lactate, ethanol etc. have concentrations which are very small compared to the main baseline contributors, which for the case of transdermal sensing are main proteins (collagen, albumin, keratin) and water. These main contributors provide signal which is 10 000 or more times stronger compared to the target molecules, and thus a small change in water displacement due to temperature effect can lead to the baseline change that if unnoticed can smear out any useful signal that can be attributed to glucose. Therefore, the ability to track the wavelength shift and absolute value within every sweep allows access to tracking the baseline change within every sweep.
[0050] The wavelength shift may be monitored as the beat signal during the sweep, whereas the absolute value is measured once per sweep, and the information from both the wavelength shift and absolute wavelength is used to calibrate the recorded information immediately afterwards the sweep is complete. The accuracy for the determination of the wavelength shift depends on the system design such the optical path difference within the wavelength shift tracker which in turn provide a beat signal. In a practical case scenario, this depend on the finesse of the absorption feature of the target molecular specie within the object. In case where the object is a biological substance and the molecules represent liquid phase, which are characterized by very broad spectroscopic signatures, the wavelength shift tracker can have the accuracy of 0.1 nm to few nm., 3-5 nm being a typical value.
[0051] In case of gas sensing, where the absorption line width of interest can be in the range of 100 MHz, the, wavelength shift tracking needs to be designed to possess a better resolution and the absolute wavelength reference needs to be designed to provide the absolute wavelength with a high enough resolution. In practical cases, this can be achieved with a very good accuracy. For instance, typical group-IV semiconductor fabrication technologies rely on node sizes as low as 160 nm or even down to 7 nm, which is three orders of magnitude compared to a typical emission wavelength. The time duration for one sweep is defined by the system architecture and last from minutes, when the tuning mechanism is performed by a mechanical motion of the tuning element, to few micro seconds, if the tuning is electronic. In a practical case for a hybrid III-V/IV sensor chip, the sweep rate can be from few tens of Hz up to MHz range, depending on actual practical system design and the application requirements.
[0052] Depending on the sensor design, and the requirement for the spectral bandwidth coverage, a single sweep can contain from several 10s to several 100s of discrete wavelengths. A typical practical case for transdermal glucose sensing requires around 100 or more discrete wavelengths to perform accurate prediction. Based on the existing state-of-the art widely tunable (swept-wavelength) laser concepts, the sweep can be almost continuous when the Vernier-filter is operated in combination with phase control. In some embodiments, the absolute value of the wavelength of emission is tuned within a specified range, e.g., 1000 to 3000 nm, 1900 nm to 2500 nm, etc. Thus, the tuned value of the wavelength of emission at a particular time may be 1898 nm, 1905 nm, etc. The corresponding wavelength shift can be 1 nm, 2 nm, 10 nm, etc.
[0053] The EMR received from the medium of interest is converted from the optical domain into electrical signals within the photodetectors 121 and 131, and the electrical signal from the photodetector is routed via electrical paths 13 and 16 to the electrical path 30 which connects to the drive and control electronics block 2, and analog-digital-converter (ADC) and amplifier block 210 therein. Here, the analog signal from the photonic chip is amplified and digitized. Digitized signal is fed into CPU 220, which performs signal filtering, averaging and other processing The CPU 220 contains a memory block with a calibration model. This calibration model is applied to the collected data to retrieve a calibrated concentration level value, which is then fed to an output port, e.g., the display 240 via electrical route 39. Another function of the CPU 220 is to provide control signals to the driver and digital-to-analog converter (DAC) block 230 via path 38, which in turn provides control and drive signals to the SoC via path 40. The entire sensor system is powered by the power supply 200 via electrical bus 31, 32, 33, 34, 35, 36.
[0054] A simplified version of the sensor system of
[0055] When deployed in the field, the photonic sensor on a chip 1 sends a wavelength tunable signal to a remote object 3 via optical path 20. The intensity I of the signal can be represented as an arbitrary function of frequency ω (or wavelength) and time t:
I=ƒ(ω,t) (1)
[0056] The light interacts with the object 3, and undergoes numerous scattering and absorption events within the object. A portion of scattered and diffusely reflected light is collected with signal photodetector 150 via the optical path 21. This light signal intensity can be represented by a frequency and time function I′:
I′=ƒ′(ω,t) (2)
[0057] This signal is modulated due to the interaction with the object and carries object-specific information such as concentration level of constituent elements. The latter can be evaluated as absorbance A, which can be represented as a linear superposition of individual absorbances A.sub.i:
Here, ε(ω).sub.i is frequency dependent individual molar absorptivity of a constituent i, c.sub.i—is individual molar concentration of the constituent i and l—is the effective optical length within the object.
[0058] In a practical case, where the object is a living body, the individual absorbance contributions can be represented as contributions by different constituent elements such as for example: 1—keratin, 2—glucose, 3—lactate, 4—urea, 5—collagen, etc. This provides a path of elemental decomposition of complex matrix and thus offers a possibility for sensing. The procedure for collecting and processing data and deriving calibrated concentration values is shown in the form of a block diagram in
[0059] The basic operation method for performing sensing includes first using the calibration algorithm in combination with the hardware to create a calibration model and store it in the memory of the CPU. This model can be considered universal and be deployed with every sensor in the field without needing to modify it during use. The next step is to then use the sensing algorithm in accordance with
[0060] According to an embodiment of the invention, when deployed in the sensing configuration, the photonic system on a chip provides several output channels that contain information about the state of the photonic chip, such as wavelength shift value via photodetector 121, absolute wavelength reference value via photodetector 131, laser intensity curve via laser power curve monitoring block 140, and/or the reflected signal containing object-specific information via signal photodetector 150. These electrical signals are routed to the control and signal processing electronics block 2. Here, the signals are fed into the analog-digital-converter and amplifier block 210.
System Calibration for Analyte Measurement
[0061] The algorithm for processing the acquired analog signals received from the photonic SoC 1 starts by first amplifying and digitizing the received signal in the ADC and amplifier block 210. At this phase the signals are still processed as time domain signals. These amplified and digitized signals are then fed to the central processing unit (CPU) 220, where the object-specific signal 22 is processed and converted from time-domain into the frequency-domain using the information of wavelength shift received via electrical path 13, and absolute wavelength calibration received via electrical path 16, and is normalized with regard to the laser power curve received via electrical path 18. This procedure allows first having the signals in the frequency domain and also addresses the system-related nonlinearities, to further process the signal that primarily carries object-specific data, indicated as step 2210 in
[0062] Multiple spectra are collected, averaged, and filtered to reduce noise. For example, in
[0063] Referring to
[0064] As illustrated, the global MSC corrected data is only used for assigning raw spectra to each cluster. Thus, the assigned cluster contains raw or unprocessed data. Clustering can be performed in a number of ways. Two possible paths are shown in
[0065] Once the clustering is complete, an individual calibration model within each cluster is created at step 2280. An individual calibration model assigns a calibrated concentration level value to every spectrum within each cluster as measured by the gold standard as indicated. This set of calibration models is then stored in the CPU memory next to the MSC reference vector in step 2300.
[0066] The algorithm for constructing an individual calibration model 2280 is depicted in
[0067] The local reference from 2281 is then used to construct a partial-least-square (PLS) model within each cluster and obtain optimal model parameters such as noise filtering parameters, derivative order, number of PLS latent vectors using a cross-validation method within step 2282. The task yields an optimal set of data preprocessing parameters 2283 that are then applied to every cluster containing raw spectra to construct individual calibration model 2284. In other words, within each cluster, the raw spectra are modified using the local scattering correction reference. This ensures that all data is treated in the same manner, with the same set of parameters. A calibration model then assigns a calibrated concentration level(s) of the analyte(s) of interest that is/are measured by a selected reference technique (also called gold standard) to each locally corrected spectrum. The calibration model maps the absorbance represented by a spectrum at a particular wavelength to an analyte concentration level. Referring to
[0068] c=w.sub.1A.sub.1+w.sub.2A.sub.2+ . . . +w.sub.nA.sub.n, where n is the number of wavelengths in the spectrum. In some cases, when the sample is associated with a relatively simple scattering matrix, and when the sample includes fewer constituents, a reasonable concentration prediction can be obtained simply by preprocessing the spectral data obtained from the sample to correct for the nonlinear effects of scattering using Kubelka-Munk correction, MSC, Saunderson correction, or a combination thereof, and then by removing the baseline to obtain the spectrum of a constituent of interest. For greater accuracy and, especially for more complex samples such as biological tissue, scattering correction (or linearizing transformation) may be used in combination with multivariate linear regression such as PLS or similar.
[0069] In general during calibration, EMR is directed to a sample (also called medium), where the EMR is swept through a range of wavelengths. In response, EMR is received from the sample, where the received EMR is diffusively reflected by or is transmitted through the sample. The received EMR, having components at different wavelengths, is converted into a raw absorbance spectrum (also called a raw spectrum). This process may be repeated several times to obtain a number of raw spectra, which are then averaged to obtain an averaged raw spectrum. In the discussion below, we omit the term “averaged” for simplicity. These raw spectra may be denoted X.sub.i.sup.raw, where the index i denotes respective, averaged raw samples and can range from 1 to M, where M can be any number such as 50; 100; 2000; 10,000, or more. The above-described process is repeated at different times, where the analyte concentration in the sample may be different at different times, using different regions of the samples or different samples, where the analyte concentration may be different in different regions of the same sample or in different samples.
[0070] Scattering correction (MSC, Kubelka-Munk correction, Saunderson correction, etc.) is then applied to the raw spectra X.sub.i.sup.raw, to obtain a global reference denoted X.sub.ref.sup.G and, globally corrected spectra X.sub.i.sup.GC. Global reference X.sub.ref.sup.G is stored in the memory. Clustering is then performed using the globally corrected spectra X.sub.i.sup.GC to identify N clusters. The number N (e.g., 4, 5, 6, 10, etc.) may either be specified for the clustering operation or, alternatively, the clustering itself may determine the best N. For each X.sub.i.sup.GC, a corresponding cluster C.sub.k, k∈[1,N], is identified and, thereafter, the corresponding raw spectrum X.sub.i.sup.raw is designated to the same cluster. After clustering, the optimal number of clusters, cluster centroids and maximum allowable distance to cluster centroids are stored into the memory to be used for the sensing function.
[0071] Once all the raw spectra are designated to their respective clusters, within each cluster the above-described process is repeated. Specifically, scattering correction is applied to the raw spectra X.sub.i.sup.raw within a particular cluster k, to obtain a local reference denoted X.sub.ref.sup.L.sup.
[0072] Recall, the different raw spectra X.sub.i.sup.raw may correspond to different levels of analyte concentration. These concentrations levels, denoted .sub.i, are obtained from the samples using a selected gold-standard technique. Finally, a calibration vector V.sup.k is generated for each cluster k via multivariate linear regression calibration. The calibration vector V.sup.k, local reference X.sub.ref.sup.L.sup.
[0073] One example process for obtaining an optimal data preprocessing set is as follows: [0074] 1. Within a cluster, apply signal smoothing (noise filtering) with iteratively selected filter and its degree (e.g., Savitzky-Golay, Fourier transform filter, percentile, moving average) on locally corrected spectra. Additionally, 1.sup.st or 2.sup.nd order derivative baseline removal may also be applied. [0075] 2. Locally corrected and preprocessed spectra and corresponding concentrations are randomly divided into the training and test sets. [0076] 3. Multivariate regression calibration algorithm is applied on training set and after model is trained, prediction of concentration is performed using test set and prediction accuracy is evaluated. [0077] 4. Steps 2 and 3 are repeated a number of times (e.g., n iterations) in a process called cross-validation to get an average prediction accuracy for current data preprocessing set.
[0078] Steps 1-4 may be repeated with different sets of parameters selected in step 1. An optimal set of parameters is the set that resulted in best average prediction accuracy.
[0079] A multivariate regression algorithm models a relation between predictor and response variables. Thus, a calibration spectral matrix χ∈.sup.d may be considered as predictor, where d is a number of wavelengths, and analytes concentration vector
∈
is considered as a response. Each i.sup.th row of spectral matrix corresponds to locally corrected and preprocessed spectrum (e.g., Savitzky-Golay filter and second derivative applied on locally corrected absorbance spectrum) and each i.sup.th row of response vector corresponds to analyte concentration measured with the gold standard. Once the relation between predictor and response is determined, unknown value of analyte concentration can be predicted based on the new locally corrected and preprocessed spectrum. Multivariate regression may include partial least squares regression and its modifications, multiple linear regression, support vector regression, artificial neural networks, and/or principal components regression.
Sensing or Analyte Measurement
[0080] Referring to
[0081] Next, the collected absorbance spectra undergo baseline correction, using global scattering correction GSC reference taken from the CPU memory in step 2221 in order to initiate the clustering procedure. For clustering the collected spectra, cluster centroids and maximum allowable distance to the cluster centroids are provided from the CPU memory, and the data is classified accordingly in step 2223. If the distance to the provided cluster centroids exceeds a maximum allowable distance, the CPU initiates an error message to indicate to the user to adjust the sensor position and restart the data collection until the error is not larger than the maximum allowable, in step 2224. If the collected data, after baseline correction, has the distance to cluster centroids within the allowable range in step 2225, the collected corresponding raw spectrum is assigned to a cluster with minimal distance to the centroid in step 2226.
[0082] Next, in step 2227, the raw spectrum within the newly assigned cluster undergoes baseline correction using a local scattering correction reference from the CPU memory and the data is preprocessed using the data processing set from the CPU memory in step 2228 in order to be eligible for data prediction step 2229, where it is multiplied with the individual calibration vector V.sup.k from the CPU memory obtained by a multivariate regression calibration. Multiplying row vector of spectrum with column vector of regression weights, we obtain a single value for analyte concentration. Each different analyte will have different calibration vector and thus weights—i.e., different wavelength specificity for a particular analyte. For instance, 2100 nm can be relevant both for lactate and glucose, however the weights will be different. Analyte's concentration is c=w.sub.1*A.sub.1+w.sub.2*A.sub.2+ . . . +w.sub.n*A.sub.n. Here, w.sub.n is the calibration weight at n.sub.th wavelength and A.sub.n is locally corrected and pre-processed absorbance at n.sub.th wavelength. The output is then a calibrated concentration level of the analyte of interest.
[0083] In general the sensing process starts in a similar way as the calibration process. Specifically, EMR is directed to a sample (also called medium) from where analyte concentration is to be determined. The EMR is swept through a range of wavelengths. In response, EMR is received from the sample, where the received EMR is diffusively reflected by or is transmitted through the sample. The received EMR, having components at different wavelengths, is converted into a raw absorbance spectrum (also called a raw spectrum). This process may be repeated several times to obtain a number of raw spectra, which are then averaged to obtain an averaged raw spectrum denoted Y.sub.raw. Here again in the discussion below, we omit the term “averaged” for simplicity.
[0084] Scattering correction is then applied to the raw spectrum Y.sup.raw using the global reference denoted X.sub.ref.sup.G (generated during the calibration process), to obtain a globally corrected spectrum Y.sup.GC. Clustering is then performed using the cluster centroid value σ.sup.k and the distance to the centroid value from the memory. That cluster may be denoted C.sub.k, where k∈[1,N], and where the number N was either specified for the clustering operation or, alternatively, was determined while performing clustering as part of the calibration process. The corresponding raw spectrum Y.sup.raw is then designated to the same cluster C.sub.k.
[0085] Thereafter, scattering correction is applied again to the raw spectrum Y.sup.raw within the selected cluster C.sub.k, using the corresponding local reference denoted X.sub.ref.sup.L.sup.
[0086] An example of a transdermal sensor performance with a piglet for three different analytes—i.e., blood glucose, blood lactate and blood ethanol, in accordance with an embodiment of the invention, is provided in
[0087] In
[0088] The representative results, demonstrate excellent sensor's performance in a wide dynamic glucose concentration level range from 75 mg/dl (4.16 mmol/l) until 400 mg/dl (22.2 mmol/l), the determination coefficient of 97.2%, root mean square error of prediction (RMSEP) of 14.7 mg/dl (or 0.8 mmol/l) and mean absolute relative difference of 6.7% in the entire range.
[0089] In
[0090] In
[0091] In
[0092] Due to the nonlinear nature of the diffuse reflectance one of the important steps in data preprocessing is the linearization and scattering correction of the collected spectrum, which, when correctly applied, allows further processing of the data for instance Beer-Lambert absorbance based analysis, where of the linearized and corrected spectrum is decomposed into individual components. This subsequent analysis may be performed in combination with other linear regression techniques to obtain a calibrated value of the concentration level of the constituent/analyte of interest.
[0093] In
[0094] Further processing of the isolated spectrum can be performed as shown in
[0095] In
[0096] In
[0097] The described embodiments of the invention are intended to be merely exemplary and numerous variations and modifications are intended to be within the scope of the present invention as defined in the appended claims.