SYSTEMS AND METHODS FOR DETECTING DISEASES BASED ON THE PRESENCE OF VOLATILE ORGANIC COMPOUNDS IN THE BREATH
20230215569 · 2023-07-06
Assignee
Inventors
- Conrad BESSEMER (Millersville, MD, US)
- Susan Bessemer (Millersville, MD, US)
- Biplab PAL (Ellicot City, MD, US)
Cpc classification
G16H50/20
PHYSICS
G16H50/70
PHYSICS
International classification
Abstract
Systems and methods are provided for detecting potentially fatal, and non-fatal diseases in an non-invasive, low-cost, and reliable manner by detecting the trace presence of volatile organic compounds (VOCs) in the human breath. The systems and methods can be home-based, non-invasive systems and methods for diagnosing CLD (chronic liver disease), CKD (chronic kidney disease), and other diseases using lifestyle-based, repetitive detection of VOCs in the human breath and an adaptive machine learning algorithm.
Claims
1. An edge intelligent device configured to track bio-markers of diseases by measuring VOCs of the patient on different metabolic conditions (after and before fasting & eating etc.) and by building a heterogenous predictive model by combining the data obtained from the VOC levels after specified metabolic conditions with personal life-style information of the patients to reduce the dispersion of the bio-marker data for the purpose of clean and reliable separation between healthy and un-healthy patients, and a home-based platform for early detection of critical diseases like CKD, CLD, etc.
2. The edge intelligent device of claim 1, wherein the device is further configured to receive dynamic voltage-current signal from a VOC detecting bio-sensor which is made out of silk substrate functionalized with PPY (Polypyrrole), RGO (reduced graphene oxide) or CNT (carbon nanotube) or an organic transistor whose gate has been functionalized with RGO reduced with Curcumin or similar reducing agent.
3. A method for creating a diagnostic tool for determining the presence or absence of a disease, comprising: measuring levels of one or more volatile organic compounds in the breath of one or more individuals known to be afflicted with the disease, at a predetermined point in a metabolic cycle of the one of more individuals known to be afflicted with the disease; obtaining lifestyle data about the one or more individuals known to be afflicted with the disease; measuring levels of the one or more volatile organic compounds in the breath of one or more individuals known not to be afflicted with the disease at a predetermined point in a metabolic cycle of the one of more individuals known not to be afflicted with the disease; obtaining lifestyle data about the one or more individuals known not to be afflicted with the disease; assembling a data set comprising: the levels of the one or more volatile organic compounds in the breath of one or more individuals known to be afflicted with the disease; the lifestyle data about the one or more individuals known to be afflicted with the disease; the levels of one or more volatile organic compounds in the breath of one or more individuals known not to be afflicted with the disease; and the lifestyle data about the one or more individuals known not to be afflicted with the disease; reducing a dimensionality of a data set; creating a classification model for determining the presence and absence of the disease based on the reduced-dimensionality data set; and validating the classification model.
4. A method for determining the presence or absence of a disease in an individual, comprising: measuring levels of one or more volatile organic compounds in the breath of the individual, at a predetermined point in a metabolic cycle of the individual; obtaining lifestyle data about the individual; inputting the levels of one or more volatile organic compounds in the breath of the individual and the lifestyle data about the individual into the classification model of claim 3; and using the classification model to determine the presence or absence of the disease in the individual.
5. The method of claim 3, wherein measuring levels of one or more volatile organic compounds in the breath of one or more individuals known to be afflicted with the disease comprises measuring the levels of one or more volatile organic compounds in the breath of the one or more individuals known to be afflicted with the disease using a VOC detecting bio-sensor comprising silk substrate functionalized with PPY (Polypyrrole), RGO (reduced graphene oxide) or CNT (carbon nanotube); or an organic transistor comprising a gate functionalized with RGO reduced with curcumin or a similar reducing agent.
6. The method of claim 5, wherein measuring levels of the one or more volatile organic compounds in the breath of one or more individuals known not to be afflicted with the disease comprises measuring the levels of the one or more volatile organic compounds in the breath of the one or more individuals known not to be afflicted with the disease using the VOC detecting bio-sensor.
Description
DESCRIPTION OF THE DRAWINGS
[0019] The following drawings are illustrative of particular embodiments of the present disclosure and do not limit the scope of the present disclosure. The drawings are not to scale and are intended for use in conjunction with the explanations provided herein. Embodiments of the present disclosure will hereinafter be described in conjunction with the appended drawings.
[0020]
[0021]
[0022]
[0023]
DETAILED DESCRIPTION
[0024] The inventive concepts are described with reference to the attached figures, wherein like reference numerals represent like parts and assemblies throughout the several views. The figures are not drawn to scale and are provided merely to illustrate the instant inventive concepts. The figures do not limit the scope of the present disclosure or the appended claims. Several aspects of the inventive concepts are described below with reference to example applications for illustration. It should be understood that numerous specific details, relationships, and methods are set forth to provide a full understanding of the inventive concepts. One having ordinary skill in the relevant art, however, will readily recognize that the inventive concepts can be practiced without one or more of the specific details or with other methods. In other instances, well-known structures or operation are not shown in detail to avoid obscuring the inventive concepts.
[0025] Many researchers have reported VOCs or biomarkers specific to particular diseases, but vary on the specific concentrations of VOCs present in the breath of diseased and healthy persons. For example, according to a research paper published by Naim Alkhouri (see https://doi.org/10.1097/meg.0b013e3283650669), isoprene, acetone, trimethylamine, acetaldehyde, and pentane at the levels noted in
[0026] A journal article titled “GC-MS Analysis of Breath Oder Compounds in Liver Patients” (https://doi.org/10.1016/j.jchromb.2008.08.031) reports that dimethyl sulfide, acetone, 2-butanone, and 2-pentanone are present in increased levels in the breath of patients with liver disease. In contrast, the levels of indole and dimethyl selenide were decreased in such patients. A disease detecting model was built with a sensitivity and specificity of 100 percent and 70 percent, respectively, based on acquired data.
[0027] Another study titled “Isoprene in the Exhaled Breath is a Novel Biomarker for Advanced Fibrosis in Patients with Chronic Liver Disease: A Pilot Study” (10.1038/ctg.2015.40) reports that, of 61 patients, 33% had advanced fibrosis, 44% had chronic hepatitis C, 30% had non-alcoholic fatty liver disease, and 26% had other CLD. SIFT-MS analysis of exhaled breath revealed that patients with advanced fibrosis had significantly lower values of six compounds in comparison to patients without advanced fibrosis. Isoprene is an endogenous VOC that is a by-product of cholesterol biosynthesis.
[0028] Another study, titled “The Breath Prints in Patients with Liver Disease Identify Novel Breath Biomarkers in Alcoholic Hepatitis” (hllps://doi.org/10.1016/j.cgh.2013.08.048), reports increased levels of 2-propanol, acetaldehyde, acetone, ethanol, pentane, and trimethylamine [TMA] compounds in patients with liver disease in comparison to the levels in control subjects.
[0029] While the above studies each report a baseline for distinguishing healthy from unhealthy individuals, the baselines for particular VOCs in the breath are inconsistent for the same disease across the studies; and the baselines are so widely dispersed that separating healthy and unhealthy people is not possible merely by detecting the presence and/or level of the relevant biomarker.
[0030] Comparing the data from the above studies shows inconsistencies in the concentrations of the VOCs emitted by diseased and healthy individuals, invalidating the disease detection model built in some, or all of the studies. The following are at least some of the limitations seen when comparing the data across the studies.
[0031] The baseline values of different biomarkers reported by different studies were limited by sample data.
[0032] Different studies considered specific age groups and different clinical histories to determine baselines for different biomarkers. As a result, there is no consistent pattern found for both healthy and liver-disease data, and it is not possible to find a unique baseline for the biomarker (VOC) concentration level for each effective biomarker. For example, the acetone concentration data (in ppb) for liver disease reported by two different studies gives the data presented in
[0033] The disclosed technology builds an adaptive model for separating diseased and healthy individuals using a data science method/condition to reduce the dispersion of the baseline by building a VOC profile of each patient. The process for developing the model is depicted in
[0034] By taking VOC readings from an individual over one or two days, the VOC profile of the individual can be built, and can be used to identify diseased vs. healthy individuals.
[0035] The VOC readings can be obtained, for example, by a VOC detecting bio-sensor that is made of silk substrate functionalized with PPY (Polypyrrole), RGO (reduced graphene oxide), or CNT (Carbon Nanotube), or an organic transistor whose gate has been functionalized with RGO reduced with curcumin or similar reducing agent. The output of the sensor can be fed to an edge device or other computing device as a dynamic voltage-current signal.
[0036] Summary Analytical Approach
[0037] Various studies have reported that specific VOCs or biomarkers in the human breath can provide significant clues to detecting the health of the liver and kidney of an individual. Thus, these markers can be used as an indicator for the early detection of CLD and CKD. Several studies, however, show different optimal levels for the concentration of the biomarker (in ppb) used to detect the disease. With the variation in the range of the samples studied in the existing literature, it is not believed to be possible to demarcate a single baseline for a particular biomarker to identify the presence of the disease. The studies also reveal that the variation of the CLD can be of different forms like mild cirrhosis, cirrhosis with AH, fibrosis, and advanced fibrosis. The presence of these diseases is identified from the concentration (in ppb) of the biomarkers in the breath emitted during exhalation. In addition to the biomarkers, knowledge of the age, smoking status, alcohol-use status, BMI, fasting status, food habits, etc. of the patient can help provide a better understanding of the health of a person's liver and kidneys.
[0038] Data Acquisition
[0039] After a fast of four to eight hours, exhaled breath samples are collected from people aged ten to 70 who have different types of liver or kidney diseases, and healthy individuals (activity 10 of
[0040] Features Reduction
[0041] A particular dataset may contain many input features, making the predictive modelling task more complicated for that dataset. Because it is complicated to visualize or make predictions for a training dataset with a high number of features, dimensionality reduction techniques are required for such cases. If a machine learning model is trained on high-dimensional data, it becomes overfitted, resulting in poor performance. The dimensionality reduction technique is a way of converting a higher-dimension dataset into a lesser-dimension dataset, ensuring that it provides similar information. Principal Component Analysis, Forward Feature Selection, Backward Feature Selection are data reduction techniques that can be used before the model development, to reduce the dimensionality of the dataset acquired as described above (activity 14 of
[0042] Classification Model
[0043] A classification for predicting healthy and unhealthy subjects can be built on the dimension-reduced data set using a parametric classification model like Logistic Regression, a machine learning model like Decision Tree, Random Forest, or a deep learning model like Neural Network (activity 16 of
[0044] The resulting adaptive model for separating diseased and healthy individuals based on VOCs in the breath subsequently can be used to determine or predict the health status of individuals (activity 20 in
[0045] The above-noted steps of dimensionality reduction, feature concatenations, classification, validation, and prediction of health status can be made by a suitable computing device, such as but not limited to an edge-cloud server, programmed with computer-executable instructions that, when executed by the computing device, cause the computing device to carry out the logical operations in accordance with the above-noted techniques.