Apparatus and method for estimation concentration of blood compound
11504070 · 2022-11-22
Assignee
Inventors
- Sujit Jos (Bangalore, IN)
- Srikanth Mallavarapu Rama (Bangalore, IN)
- Kiran Bynam (Bangalore, IN)
- So Young Lee (Daejeon, KR)
- Gorish Aggarwal (Bangalore, IN)
Cpc classification
A61B5/14532
HUMAN NECESSITIES
A61B5/7275
HUMAN NECESSITIES
A61B5/0075
HUMAN NECESSITIES
A61B5/7278
HUMAN NECESSITIES
A61B5/1455
HUMAN NECESSITIES
International classification
A61B5/00
HUMAN NECESSITIES
A61B5/1455
HUMAN NECESSITIES
G01N21/27
PHYSICS
Abstract
A method of estimating concentration of a blood compound may include: removing a baseline drift from Near-Infrared (NIR) spectroscopy data to obtain drift-free spectral features; obtaining a set of global features based on the drift-free spectral features; and estimating the concentration of the blood compound by regression using the set of global features.
Claims
1. A method of estimating concentration of a blood compound, the method comprising removing a baseline drift from Near-Infrared (NIR) spectroscopy data to obtain drift-free spectral features, wherein the removing the baseline drift comprises: obtaining a linear drift approximation of the NIR spectroscopy data; scaling the linear drift approximation of the NIR spectroscopy data by a ratio between an amplitude span of a plurality of spectral features of the NIR spectroscopy data and an amplitude span of a plurality of principal components of the NIR spectroscopy data; subtracting the scaled linear drift approximation from the NIR spectroscopy data to obtain the drift-free spectral features; obtaining a set of global features based on the drift-free spectral features; and estimating a concentration of the blood compound by regression using the set of global features, wherein the amplitude span of the plurality of spectral features is a difference between a maximum value and a minimum value of the plurality of spectral features for each wavelength, and the amplitude span of the plurality of principal components is a difference between a maximum value and a minimum value of the plurality of principal components for each wavelength.
2. The method of claim 1, wherein the removing the baseline drift from the NIR spectroscopy data comprises removing the baseline drift from the NIR spectroscopy data using principal component analysis (PCA).
3. The method of claim 1, wherein the obtaining the set of global features comprises: obtaining similarity values between each of the drift-free spectral features and a compound vector consisting of a set of reference values; ranking the drift-free spectral features based on the similarity values between each of the drift-free spectral features and the compound vector consisting of the set of reference values; and based on rankings of the drift-free spectral features, selecting a predefined number of drift-free spectral features as the set of global features.
4. The method of claim 1, wherein the removing the baseline drift further comprises: selecting a principal component that characterizes the baseline drift, from among the plurality of principal components, based on a change in the principal component over time; and obtaining, as the linear drift approximation, a polynomial approximation of a predefined degree of the selected principal component.
5. The method of claim 4, wherein the selecting the principal component that characterizes the baseline drift comprises selecting a first principal component from among the plurality of principal components as the principal component that characterizes the baseline drift.
6. The method of claim 4, wherein the obtaining the polynomial approximation of the predefined degree of the selected principal component comprises obtaining the polynomial approximation that minimizes a least squared error between the polynomial approximation and the baseline drift.
7. The method of claim 1, wherein the scaling the linear drift approximation comprises dividing the linear drift approximation by the ratio between the amplitude span of the plurality of spectral features of the NIR spectroscopy data and the amplitude span of the plurality of principal components of the NIR spectroscopy data.
8. A blood compound concentration prediction apparatus, comprising at least one processor configured to: remove a baseline drift from Near-Infrared (NIR) spectroscopy data to obtain drift-free spectral features by: obtaining a linear drift approximation of the NIR spectroscopy data; scaling the linear drift approximation of the NIR spectroscopy data by a ratio between an amplitude span of a plurality of spectral features of the NIR spectroscopy data and an amplitude span of a plurality of principal components of the NIR spectroscopy data; subtracting the scaled linear drift approximation from the NIR spectroscopy data to obtain the drift-free spectral features; obtain a set of global features based on the drift-free spectral features; and estimate a concentration of a blood compound by regression using the set of global features, wherein the amplitude span of the plurality of spectral features is a difference between a maximum value and a minimum value of the plurality of spectral features for each wavelength, and the amplitude span of the plurality of principal components is a difference between a maximum value and a minimum value of the plurality of principal components for each wavelength.
9. The blood compound concentration prediction apparatus of claim 8, wherein the at least one processor is further configured to remove the baseline drift from the NIR spectroscopy data using principal component analysis (PCA).
10. The blood compound concentration prediction apparatus of claim 8, wherein the at least one processor is further configured to: obtain similarity values between each of the drift-free spectral features and a compound vector consisting of a set of reference values; rank the drift-free spectral features based on the similarity values between each of the drift-free spectral features and the compound vector consisting of the set of reference values; and based on rankings of the drift-free spectral features, selecting a predefined number of drift-free spectral features as the set of global features.
11. The blood compound concentration prediction apparatus of claim 8, wherein the at least one processor is further configured to: select a principal component that characterizes the baseline drift, from among the plurality of principal components, based on a change in the principal component over time; and obtain, as the linear drift approximation, a polynomial approximation of a predefined degree of the selected principal component.
12. The blood compound concentration prediction apparatus of claim 11, wherein the at least one processor is further configured to select a first principal component from among the plurality of principal components as the principal component that characterizes the baseline drift.
13. The blood compound concentration prediction apparatus of claim 11, wherein the at least one processor is further configured to obtain the polynomial approximation that minimizes a least squared error between the polynomial approximation and the baseline drift.
14. The blood compound concentration prediction apparatus of claim 8, wherein the at least one processor is further configured to obtain the linear drift approximation by dividing the linear drift approximation by the ratio between the amplitude span of the plurality of spectral features of the NIR spectroscopy data and the amplitude span of the plurality of principal components of the NIR spectroscopy data.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) The above and/or other aspects will be more apparent by describing certain exemplary embodiments, with reference to the accompanying drawings, in which:
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
DETAILED DESCRIPTION
(11) Exemplary embodiments are described in greater detail below with reference to the accompanying drawings.
(12) In the following description, like drawing reference numerals are used for like elements, even in different drawings. The matters defined in the description, such as detailed construction and elements, are provided to assist in a comprehensive understanding of the exemplary embodiments. However, it is apparent that the exemplary embodiments can be practiced without those specifically defined matters. Also, well-known functions or constructions are not described in detail since they would obscure the description with unnecessary detail.
(13) The specification may refer to “an”, “one” or “some” embodiment(s) in several locations. This does not necessarily imply that each such reference is to the same embodiment(s), or that the feature only applies to a single embodiment. Single features of different embodiments may also be combined to provide other embodiments.
(14) As used herein, the singular forms “a”, “an” and “the” are intended to include the plural forms as well, unless expressly stated otherwise. It will be further understood that the terms “includes”, “comprises”, “including” and/or “comprising” when used in this specification, specify the presence of stated features, integers, steps, operations, elements and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof. As used herein, the term “and/or” includes any and all combinations and arrangements of one or more of the associated listed items.
(15) Expressions such as “at least one of,” when preceding a list of elements, modify the entire list of elements and do not modify the individual elements of the list. For example, the expression, “at least one of a, b, and c,” should be understood as including only a, only b, only c, both a and b, both a and c, both b and c, all of a, b, and c, or any variations of the aforementioned examples.
(16) Unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure pertains. It will be further understood that terms, such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the relevant art and will not be interpreted in an idealized or overly formal sense unless expressly so defined herein.
(17) The embodiments herein and the various features and advantages details thereof are explained more fully with reference to the non-limiting embodiments that are illustrated in the accompanying drawings and detailed in the following description. Descriptions of well-known components and processing techniques are omitted so as to not unnecessarily obscure the embodiments herein. The examples used herein are intended merely to facilitate an understanding of ways in which the embodiments herein can be practiced and to further enable those of skill in the art to practice the embodiments herein. Accordingly, the examples should not be construed as limiting the scope of the embodiments herein.
(18) An example embodiment provides a method for predicting concentration of a blood compound non-invasively using NIR spectroscopy. The embodiment provides a drift removal algorithm which makes use of information from principal components of the NIR spectroscopy data for the drift removal process. The term “drift” may refer to a baseline drift of a bio-signal, such as a photoplethysmogram (PPG) signal, an electromyography (EMG) signal, or an electrocardiography (ECG) signal. The embodiment further provides extraction of a set of global features for prediction of the concentration of the blood compound using regression. The same is illustrated in
(19)
(20) According to one example embodiment, the blood compound concentration prediction apparatus 150 may include a drift removal unit 152, global feature extraction unit 154 and a prediction unit 156. The drift removal unit 152, the global feature extraction unit 154 and the prediction unit 156 may be implemented by one or more processors. The drift removal unit 152 computes and removes drift from the NIR spectroscopy data. In detail, the NIR spectroscopy data is obtained as follows.
(21) At first, the value of a blood compound is obtained using a standard invasive procedure (e.g., a blood pressure measurement using a cuff). Then, a non-invasive spectral scan is performed on a person/test subject using near-Infrared spectrometer to obtain raw NIR spectra. The raw NIR spectra is labelled as a blood compound value which was obtained from the invasive procedure, and is stored in the blood compound concentration prediction apparatus 150. The obtained raw NIR spectra are preprocessed further to obtain compound spectra. The compound spectra and the associated compound values may be arranged into the form of the matrix X using data obtained in consecutive measurements, which would be referred as data matrix in the rest of the document.
(22)
(23) Here, c=[c.sup.1 c.sup.2 . . . c.sup.N].sup.T is a compound vector. The matrix S is the NIR spectroscopy data. The NIR spectroscopy data is affected by the drift which in turn affects the prediction accuracy of the compound of interest. Each column of the matrix S is the absorption spectra associated with the wavelength λ and may be represented by the vector s.sub.λ. It may be noted that the absorption spectra s.sub.λ in some embodiments can be interchangeably referred to as “spectral feature” or “feature”.
s.sub.λ=[s.sub.λ.sup.1s.sub.λ.sup.2s . . . s.sub.λ.sup.N]
(24) The absorption spectra s.sub.λ could be written as
s.sub.λ==s.sub.λ.sup.t+f.sub.λ
(25) Here, s.sub.λ.sup.t is the true absorption spectra and f.sub.λ is the drift affecting the true absorption spectra.
(26) The drift removal unit 152 obtains an estimate of the drift component f′.sub.λ and subtracts it from s.sub.λ to obtain the drift-free spectra (t) which is expressed as:
=s.sub.λ−f′.sub.λ
(27) In one example embodiment, the drift removal unit 152 removes drift using principal component analysis (PCA). The drift removal unit 152 performs the PCA operation for obtaining the i.sup.th principal component P.sub.c.sup.i which is described as
Z=PCA(S)
p.sub.c.sup.i=S*Z(:,i)
(28) Here, the variables follow the standard notations. If the drift component on the data set is significant enough to impact the predictions based on the set, it is likely to manifest in the first principal component of the data set. Else, the drift would manifest in say i.sup.th principal component. Also, since all the principal components are uncorrelated, it is a reasonable assumption that if the drift component is captured in the i.sup.th principal component, it is unlikely that it would significantly manifest in any other principal components. Let the i.sup.th Principal component in which drift is manifested be denoted by p.sub.c. In an example embodiment, the first principal component may be selected from a plurality of principal components to remove the drift component when the change in the value of the first principal component over time is greater than a predetermined value.
(29)
(30)
(31)
(32) The d-decimation of S is defined as
(33)
(34) The set S.sup.d is obtained by including every d.sup.th row of the matrix S. As shown in
(35)
(36)
(37) Here, d.sub.s is the amplitude span of s.sub.λ given by d.sub.s=(max(s.sub.λ)−min(s.sub.λ)) and d.sub.p is the amplitude span of P′.sub.c given by d.sub.p=(max(p′.sub.c)−min(p′.sub.c)).
(38) Finally, in operation 408, the drift removal is performed by subtracting the spectral drift approximation f′.sub.λ from the respective s.sub.λ for every λ. This is represented as=s.sub.λ−f′.sub.λ,λ=λ.sub.0,λ.sub.1, . . . λ.sub.n−1
(39) The is also referred to as drift-free spectral feature or simply drift-free feature.
(40) and the compound's concentration be denoted by c.sup.k. In one example embodiment, the similarity value may be obtained as the correlation of the drift-free spectral feature
with the compound vector c.sup.k, which may be computed as
Ψ.sub.k(λ)=<c.sup.k,>
(41) In operation 504, similarity metric for each drift-free spectral feature is obtained using similarity values obtained across all test subjects. In one example embodiment, the similarity metric may be computed as
R(λ)=Σ.sub.kΨ.sub.k.sup.2(λ)
(42) In operation 506, the drift-free spectral features are ranked as per the similarity metric. In operation 508, a K number of drift-free spectral features are selected in order of the ranking for prediction of the compound concentration using regression. The number K may have a predetermined value, and/or may be decided based on the performance of particular regression method employed for prediction. The K number of drift-free features are referred to as “global features” in rest of the document.
(43) Now, based on the obtained global features, the prediction unit 156 predicts or estimates concentration of the blood compound using regression from the drift free spectroscopy data.
(44)
(45) Referring to
(46) The input interface 620 may receive NIR spectroscopy data, and may receive input of various operation signals from a user. In the embodiment, the input interface 620 may include a keypad, a dome switch, a touch pad (static pressure/capacitance), a jog wheel, a jog switch, a hardware (H/W) button, and the like. Particularly, the touch pad, which forms a layer structure with a display, may be called a touch screen.
(47) The storage 630 may store programs or commands for operation of the blood compound concentration prediction apparatus 600, and may store data input to and output from the blood compound concentration prediction apparatus 600 and data processed by the blood compound concentration prediction apparatus 600, and the like.
(48) The storage 630 may include at least one storage medium of a flash memory type memory, a hard disk type memory, a multimedia card micro type memory, a card type memory (e.g., an SD memory, an XD memory, etc.), a Random Access Memory (RAM), a Static Random Access Memory (SRAM), a Read Only Memory (ROM), an Electrically Erasable Programmable Read Only Memory (EEPROM), a Programmable Read Only Memory (PROM), a magnetic memory, a magnetic disk, and an optical disk, and the like. Further, the blood compound concentration prediction apparatus 600 may operate an external storage medium, such as web storage and the like, which performs a storage function of the storage 630 on the Internet.
(49) The communication interface 640 may communicate with an external device. For example, the communication interface 640 may transmit, to the external device, the data input to the blood compound concentration prediction apparatus 600, data stored in and processed by the blood compound concentration prediction apparatus 600, and the like, or may receive, from the external device, various data useful for estimating a blood compound concentration.
(50) In this case, the external device may be medical equipment using the data input to the blood compound concentration prediction apparatus 600, data stored in and processed by the blood compound concentration prediction apparatus 600, and the like, a printer to print out results, or a display device. In addition, the external device may be a digital TV, a desktop computer, a cellular phone, a smartphone, a tablet PC, a laptop computer, a personal digital assistant (PDA), a portable multimedia player (PMP), a navigation device, an MP3 player, a digital camera, a wearable device, and the like, but is not limited thereto.
(51) The communication interface 640 may communicate with external devices by using Bluetooth communication, Bluetooth Low Energy (BLE) communication, Near Field Communication (NFC), WLAN communication, Zigbee communication, Infrared Data Association (IrDA) communication, Wi-Fi Direct (WFD) communication, Ultra Wideband (UWB) communication, Ant+ communication, WIFI communication, Radio Frequency Identification (RFID) communication, 3G communication, 4G communication, 5G communication, and the like. However, this is merely exemplary and communication is not limited thereto.
(52) The output interface 650 may output the data input to the blood compound concentration prediction apparatus 600, data stored in and processed by the blood compound concentration prediction apparatus 600, and the like. In the embodiment, the output interface 650 may output the data input to the blood compound concentration prediction apparatus 600, data stored in and processed by the blood compound concentration prediction apparatus 600, and the like, by using at least one of an acoustic method, a visual method, and a tactile method. To this end, the output interface 650 may include a display, a speaker, a vibrator, and the like.
(53) While not restricted thereto, an exemplary embodiment can be embodied as computer-readable code on a computer-readable recording medium. The computer-readable recording medium is any data storage device that can store data that can be thereafter read by a computer system. Examples of the computer-readable recording medium include read-only memory (ROM), random-access memory (RAM), CD-ROMs, magnetic tapes, floppy disks, and optical data storage devices. The computer-readable recording medium can also be distributed over network-coupled computer systems so that the computer-readable code is stored and executed in a distributed fashion. Also, an exemplary embodiment may be written as a computer program transmitted over a computer-readable transmission medium, such as a carrier wave, and received and implemented in general-use or special-purpose digital computers that execute the programs. Moreover, it is understood that in exemplary embodiments, one or more units of the above-described apparatuses and devices can include circuitry, a processor, a microprocessor, etc., and may execute a computer program stored in a computer-readable medium.
(54) The foregoing exemplary embodiments are merely exemplary and are not to be construed as limiting. The present teaching can be readily applied to other types of apparatuses. Also, the description of the exemplary embodiments is intended to be illustrative, and not to limit the scope of the claims, and many alternatives, modifications, and variations will be apparent to those skilled in the art.