SYSTEM AND METHOD FOR QUANTIFICATION OF BACTERIA IN WATER USING FLUORESCENCE SPECTRA MEASUREMENTS AND MACHINE-LEARNING
20210389243 · 2021-12-16
Inventors
- Shlomo SELA (Modi'in-Macabim-Re'ut, IL)
- Amir NAKAR (Hadera, IL)
- Michael BORISOVER (Herzliya, IL)
- Ze'ev SCHMILOVITCH (Yehud, IL)
Cpc classification
G01N21/6486
PHYSICS
C12Q1/04
CHEMISTRY; METALLURGY
International classification
Abstract
The present invention provides a system and a method for rapid quantification of bacteria in high-quality water using fluorescence spectra measurements and machine- learning. The invention is applicable to drinking water distribution systems, water purification plants, food and beverage industry, and pharma and medical industry.
Claims
1. A method for quantifying bacteria in water comprising: i) obtaining a water sample; ii) generating an excitation-emission matrix (EEM) for said water sample; and iii) determining the concentration of bacteria in said water sample by correlating said EEM with calibrated data.
2. The method according to claim 1, wherein the calibrated data is obtained by determining the EEM of a plurality of test samples having known bacterial concentrations.
3. The method according to claim 1 or 2, wherein the EEM is generated by scanning excitation wavelengths from 200 to 800 nm in 1-5 nm steps, and detecting the emitted fluorescence in 1-5 nm steps between 200 and 800 nm.
4. The method according to claim 3, wherein the EEM is generated by scanning excitation wavelengths from 220 to 410 nm in 5 nm steps, and detecting the emitted fluorescence in 2 nm steps between 220 and 410 nm
5. The method according to claim 1, for monitoring water quality in drinking water distribution systems, water purification plants, food and beverage industry or pharma and medical industry.
6. A system for quantification of bacteria in water, the system comprising: a) device for generating an excitation-emission matrix (EEM) of a water sample; and b) logic circuity suitable for correlating said EEM with the bacterial concentration of said water sample.
7. The system according to claim 6, wherein the logic circuitry comprises or is associated with data managing and processing apparatus.
8. The system according to claim 7, wherein the processing apparatus is a machine-learning model trained using a set of historical data.
9. The system according to claim 6, for monitoring water quality in drinking water distribution systems, water purification plants, food and beverage industry or pharma and medical industry.
10. The system according to claim 6, for incorporation into an online monitoring system.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0015]
[0016]
[0017]
DETAILED DESCRIPTION OF EMBODIMENTS OF THE INVENTION
[0018] The present system and method usefully provide a mathematical modeling approach that utilizes water fluorescence measurements to extract data related to the total number of bacteria in water. The data are processed using algorithms based on methods, such as Partial Least Squares Regression (PLSR), which through machine-learning can analyze complex excitation-emission matrix (EEM) data and correlate these data to the number of bacteria in a high quality water sample.
[0019] It was presently found that measuring excitation-emission matrices (EEM) or maps of fluorescence intensities of water samples provide a way to rapidly and precisely quantify bacteria in high-quality water samples. Specifically, the linear regression between fluorescence intensities and the real bacterial concentration (as CFUs/ml) of a training set of samples (here, approximately 80% of the samples), was used for obtaining a prediction formula. This prediction formula is then be used for determining the bacterial concentration in water samples. The model is tested using a validation set of samples (here, approximately 20% of the samples). Two studies were conducted: 1) determination of E. coli concentration in double-distilled water; and 2) determination of bacterial density in natural ground water serving as drinking water.
[0020] In a first aspect, the invention is directed to a method for quantification of bacteria in water comprising: obtaining a water sample; generating an excitation-emission matrix (EEM) for said water sample; and determining the concentration of bacteria in said water sample by correlating said EEM with calibrated data.
[0021] In some embodiments the calibrated data is obtained by determining the EEM of a plurality of test samples having known bacterial concentrations. The bacterial concentrations may be determined using the heterotrophic plate count (HPC) method which quantifies aerobic mesophilic bacteria.
[0022] The water samples are scanned by spectrofluorophotometer using excitation wavelengths at a range of 200-800 nm, and an emission spectrum at each excitation wavelength at a range of 200-800 nm. Intensities at each excitation/emission wavelength are divided by the Raman scatter intensity of pure non-fluorescent water in order to normalize the data and minimize effects of machine/lamp instability.
[0023] In some embodiments, the EEM is generated by scanning excitation wavelengths from 200 to 800 nm in 1-5 nm steps, and detecting the emitted fluorescence in 1-5 nm steps between 200 and 800 nm. According to some embodiments, the EEM is generated by scanning excitation wavelengths from 220 to 400 nm in 5 nm steps, and detecting the emitted fluorescence in 2 nm steps between 220 and 410 nm.
[0024] Another embodiment relates to a system for quantification of bacteria in water, the system comprises: device for generating an excitation-emission matrix (EEM) of a water sample; and logic circuity suitable for correlating said EEM with the bacterial concentration of said water sample.
[0025] According to some embodiments, the logic circuitry comprises or is associated with data managing and processing apparatus. In some further embodiments the apparatus is a machine-learning model trained using a set of historical data.
[0026] In some embodiments, the method and system may be applied for monitoring water quality in drinking water distribution systems, water purification plants, food and beverage industry or pharma and medical industries.
[0027] The invention provides a system and method for real-time monitoring of bacterial concentration enabling effective water management for assuring the quality and safety of water. According to some embodiments the system may be incorporated into online monitoring systems in water plants or water distribution systems thereby providing accurate assessment of the microbiological quality of water.
[0028] The invention will be further described and illustrated in the following examples.
EXAMPLES
Example 1
[0029] Finding Prediction Formula for Determining Bacterial Concentration
[0030] Water samples, containing known bacterial concentrations (counts) were scanned by spectrofluorophotometer (Shimadzu, RF-5301PC, Kyoto, Japan) using excitation wavelengths range of 220-400 nm, and the emission spectrum at each excitation wavelength is obtained at the 220-410 nm range for generating an excitation-emission matrix (EEM). Intensities at each excitation/emission wavelength were divided by the Raman scatter intensity of pure non-fluorescent water in order to normalize the data and minimize effects of machine/lamp instability.
[0031] Principal Component Analysis (PCA; JMP Pro 13, SAS Institute, Cary, USA) is used to exclude major outliers (usually, <5% of samples). The samples are assigned their known bacterial concentration, described as the dependent variable, while the fluorescence intensities at each of the excitation/emission wavelength pairs are defined as the independent variables, thus producing 3775 different independent variables. The samples are then randomly divided into training (80%) and validation (20%) sets. This division means that the prediction formula will be based on 80% of the data, and tested on the remaining 20% of the data in order to evaluate the model's quality. A JMP Partial Least Squares (PLS) algorithm is used to obtain a prediction loadings formula, which is a list of weights given to fluorescence intensity at each combination of excitation/emission wavelength (i.e., the independent variable). This formula is applied to the validation dataset to predict bacterial counts (i.e. dependent variable). In order to test the model's prediction quality, a linear regression between the actual and predicted values is calculated and characterized with R.sup.2 and Root Mean Square Error as measures of correlation and deviation respectively.
[0032]
[0033] The prediction formula is calculated as a sum of all fluorescence intensities of each excitation-emission pair (X) multiplied by their individual coefficient (a) and an error (b), as shown in the following formula:
Example 2
[0034] Determination of E. Coli Concentration in Double-Distilled Water
[0035] In order to initially test the ability to rapidly and precisely quantify bacteria in water samples by measuring excitation-emission matrices (EEM), 54 samples containing increasing concentrations (0-10.sup.8 CFU/ml) of Escherichia coli in double distilled (non-fluorescent) water were prepared. Dataset containing bacterial concentrations and their cognate EEMs in 43 water samples (80%) were used as a training set, as described in Example 1. Each bacterial concentration was done in 5 replications. The derived model was used to predict the number of bacteria in the 11 samples (20% validation set).
[0036]
[0037] The model was able to detect the number of the E. coli cells at concentrations as low as 100 CFU/ml in the validation data. It correctly predicted bacterial concentration with a root mean square error (RMSE) of 0.61 log.sub.10 CFU per ml and accounted for more than 90% of the variation.
Example 3
[0038] Determination of Bacterial Density in Natural Ground Water Serving as Drinking Water
[0039] Applying of the method of the invention in critical control points of local and national drinking water distribution systems will enable real-time monitoring of the microbial quality of the water and consequently will allow prompt response in case of temporal deterioration of water quality.
[0040] In order to examine the validity of the method, 69 samples of drinking water (ground water) of known microbiological content (HPC determined by international standard methods) were read by spectrofluorometer to obtain EEM, as described in Example 1. The water samples were divided into two groups, containing 51 samples for training (74%) and 18 samples as a validation set (26%). Modeling was performed on the data from the 51 samples and the bacterial counts (HPC) in the 18 validation samples were determined.
[0041]
[0042] The model enabled enumeration of HPC at concentrations as low as 200 CFU/ml in the overall data. The RMSE of the predicted bacterial concentration was 20 CFU/ml and the model accounted for more than 90% of the variation.
[0043] Although embodiments of the invention have been described by way of illustration, it will be understood that the invention may be carried out with many variations, modifications, and adaptations, without exceeding the scope of the claims.