ENVIRONMENTAL ANOMALY DETECTING SYSTEM AND METHOD THEREOF
20250356267 ยท 2025-11-20
Inventors
- Yi-Jyun LIN (Kaohsiung City, TW)
- Yi-Hui LIN (Kaohsiung City, TW)
- Li-Wei CHEN (Kaohsiung City, TW)
- Ming-Che TSAI (Kaohsiung City, TW)
Cpc classification
G08B21/12
PHYSICS
F24F11/64
MECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
F24F11/32
MECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
G06F18/241
PHYSICS
F24F2110/20
MECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
F24F2140/50
MECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
International classification
F24F11/32
MECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
F24F11/64
MECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
Abstract
An environmental anomaly detecting system, applied to an air detection in an environment. The environmental anomaly detecting system includes: an air conditioning equipment, used for providing an air flow to the environment; at least one sensor, used for obtaining a feature data relating to the air flow; and a computing module, communicatively connecting to the at least one sensor, used for inputting the feature data into each of multiple machine learning models to obtain multiple first prediction labels. The computing module is used for inputting the multiple first prediction labels into an ensemble model to obtain a second prediction label, and the second prediction label is used for indicating whether the air flow is abnormal.
Claims
1. An environmental anomaly detecting system, applied to an air detection in an environment, the environmental anomaly detecting system comprising: an air conditioning equipment, used for providing an air flow to the environment; at least one sensor, used for obtaining a feature data relating to the air flow; and a computing module, communicatively connecting to the at least one sensor, used for inputting the feature data into each of multiple machine learning models to obtain multiple first prediction labels, wherein the computing module is used for inputting the multiple first prediction labels into an ensemble model to obtain a second prediction label, and the second prediction label is used for indicating whether the air flow is abnormal.
2. The environmental anomaly detecting system of claim 1, wherein a number of the at least one sensor is bigger than 1, and one of the sensors is deployed in an intake area of the air conditioning equipment, and the air flow enters the environment through the intake area, wherein another one of the sensors is deployed in the environment.
3. The environmental anomaly detecting system of claim 2, wherein the feature data comprises a temperature and a humidity.
4. The environmental anomaly detecting system of claim 3, wherein the computing module is further used for obtaining an outdoor temperature and a load rate of the air conditioning equipment, and the feature data further comprises the outdoor temperature and the load rate.
5. The environmental anomaly detecting system of claim 1, wherein the machine learning models comprise a clustering algorithm, a time series forecasting algorithm, and a decision tree algorithm.
6. The environmental anomaly detecting system of claim 1, wherein the computing module is used for calculating a classification indicator of each of the machine learning models in a training stage, wherein the computing module adds a constant to the classification indicator to calculate a weight of each of the machine learning models, and the ensemble model is used for summing a multiplication of each of the weights of the machine learning models and each of the first prediction labels of the machine learning models to obtain a prediction value, and is used for determining whether the prediction value is higher than a threshold to generate the second prediction label.
7. The environmental anomaly detecting system of claim 6, wherein the threshold is set to be less than or equal to 0.6.
8. The environmental anomaly detecting system of claim 7, wherein the threshold is set to be less than or equal to 0.3.
9. The environmental anomaly detecting system of claim 6, wherein the classification indicator is calculated according to the following equations:
10. The environmental anomaly detecting system of claim 9, wherein the computing module is used for calculating the weights according to the following equation:
11. The environmental anomaly detecting system of claim 6, wherein each of the machine learning models has multiple model parameters, and the computing module is used for re-training the machine learning models and the ensemble model at every preset time interval according to a new feature data to renew the model parameters, the classification indicators, and the weights.
12. An environmental anomaly detecting method, applied to the environmental anomaly detecting system according to claim 1, the environmental anomaly detecting method comprising: a data capture step for obtaining the feature data of the air flow; an input step for inputting the feature data into each of the multiple machine learning models to obtain the multiple first prediction labels; and an ensemble step for inputting the multiple first prediction labels into the ensemble model to obtain the second prediction label, and the second prediction label is used for indicating whether the air flow is abnormal.
13. The environmental anomaly detecting method of claim 12, wherein the feature data comprises a temperature and a humidity.
14. The environmental anomaly detecting method of claim 12, wherein the machine learning models comprise a clustering algorithm, a time series forecasting algorithm, and a decision tree algorithm.
15. The environmental anomaly detecting method of claim 12, further comprising: calculating a classification indicator of each of the machine learning models in a training stage; adding a constant to the classification indicator to calculate a weight of each of the machine learning models; and summing a multiplication of each of the weights of the machine learning models and each of the first prediction labels of the machine learning models to obtain a prediction value, and determining whether the prediction value is higher than a threshold to generate the second prediction label with the ensemble model.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0016] The foregoing aspects and many of the accompanying advantages of this disclosure will become more readily appreciated as the same becomes better understood by reference to the following detailed description, when taken in conjunction with the accompanying drawings.
[0017]
[0018]
[0019]
[0020]
DETAILED DESCRIPTION
[0021] The terms first, second, and the like, as used herein, are not intended to mean a sequence or order, and are merely used to distinguish elements or operations described in the same technical terms.
[0022]
[0023]
[0024] Therefore, a model may be built with the machine learning algorithm and the historical data of the temperature and humidity sensor, and the model may be used for detecting the changes of the air conditioning equipment more accurately. Also, through the repeated abnormal simulation experiments, the accuracy and real-time performance of the model is validated to apply to various time intervals.
[0025] At last, the final anomaly detecting model is built by ensemble learning with blending. This method can combine the advantages of multiple models, not only detecting the anomaly faster but also improving the accuracy.
[0026] The detailed explanation of the procedure to perform the machine learning and the ensemble model in the embodiment is described as following.
[0027]
[0028] The prediction labels 331-333 represent whether the air flow 111 is abnormal. In the training stage, any component or parameter of the air conditioning equipment 110 may be deliberately adjusted to generate the ground truth label for anomaly. For example, the heater, the humidifier, and/or the cold water valve of the air conditioning equipment 110 may be turned off, or the target temperature and the humidity may also be adjusted to the abnormal range. The corresponding ground truth label at this time is abnormal. In this embodiment, the prediction labels 331-333 are discrete labels which are 1 or 0, presenting abnormal or normal. In other embodiments, the prediction labels 331-333 may also be continuous values, presenting the abnormal probability of the air flow. In other words, the machine learning model 321-323 may be used for classification or regression, but is not limited thereto.
[0029] In the step 203, the prediction labels 331-333 are inputted into the ensemble model 340 to obtain the prediction label 341, which is used for indicating whether the air flow is abnormal. The ensemble model 340 is used for combining the outputs of multiple machine learning models 321-323 to generate a more accurate output. For example, each of the machine learning models 321-323 may be assigned a weight, and the ensemble model 340 sums the multiplication of each of the weights and each of the prediction labels 331-333 of the machine learning models 321-323 to obtain a prediction value, which is presented by Equation (1):
where represents the prediction value. x is a positive integer, y.sub.x is the x.sub.th prediction label of the prediction labels 331-333, and w.sub.x is the weight corresponding to the x.sup.th machine learning model. In some embodiments, the weights w.sub.x of the machine learning models 321-323 are assigned according to the accuracy of each of the machine learning models 321-323 that the models with higher accuracy are assigned the higher weights. Specifically, in the training stage, each feature data forms a sample, and these samples are classified into the abnormal or normal class by the machine learning models 321-323 then the classification indicator of each machine learning model may be calculated based on the classification result. This classification indicator is calculated by Equations (2)-(5) as follows:
where S is the classification indicator of one of the machine learning models, i, j, j.sub.k, and k are positive integers, m represents a number of all samples. n represents a number of a class to which the i.sup.th sample belongs and the i.sup.th sample and the j.sup.th sample belong to the same class, n.sub.k represents a number of a k.sup.th class and the i.sup.th sample and the j.sub.k.sup.th sample belong to different classes, and distance( ) is used for calculating a distance between two samples, for example, calculating an Eurasian distance between the vectors generated from the feature data. In other words, Equation (5) is used for calculating the distance between two samples belonging to different classes, and the greater distance indicates a greater separation between the two classes that the bigger the calculated value b (i) indicates a better classification result. Equation (4) is used for calculating the distance between two samples belonging to the same class, and the smaller distance indicates a higher concentration of the samples in the same class that the smaller the calculated value a (i) indicates a better classification result. Equation (3) combines the value a (i) and b (i), and the bigger the value b (i) and the smaller the value a (i), the bigger the calculated value s (i), indicating a better classification result. At last, Equation (2) calculates an average of the values s (i) corresponding to all samples as the classification indicator S of a machine learning model. Each of the machine learning models 321-323 corresponds to an individual classification indicator S, and the bigger the classification indicator S indicates a better classification result.
[0030] The value range of the classification indicator S is [1,1], and each of the classification indicators may be added a constant (for example, 1) to adjust the value range of the classification indicator S to [0,2] in order to calculate the weights of the machine learning models 321-323, and the bigger the classification indicator, the bigger the weight to be assigned. In some embodiments, the weight is calculated by Equation (6):
where S.sub.x is the classification indicator of the x.sup.th machine learning model. In other words, Equation (6) adds 1 to each classification indicator first then divides it by the sum of classification indicators corresponding to all machine learning models 321-323, to perform a normalization and calculate the weight w.sub.x.
[0031] In some embodiments, the weights of the machine learning models 321-323 may be determined by iteration, and the weights of the samples are adjusted at each iteration. Specifically, as described below,
represents the weight of the x.sup.th machine learning model at the i.sup.th iteration. In this embodiment, the above equation (1) may be rewritten as Equation (7):
[0032] As described below, .sub.n represents the weight of the n.sup.th sample, and all the weights are assigned to a same value at the 1.sup.st iteration. Next, the training samples are inputted into the machine learning model to calculate the error rate of each machine learning model, which is defined as the multiplication of the weights of the samples and the classification results, represented by Equation (8):
where f.sub.x(n) represents the classification result of the n.sup.th training sample obtained by the x.sup.th machine learning model. When the classification is correct, f.sub.x(n)=0, while the classification is wrong, f.sub.x(n)=1. Then the weight
is calculated based on the error rate represented by Equation (9):
[0033] At last, the weights .sub.n are renewed according to the classification result of all machine learning model at all iterations represented by Equation (10):
where .sub.n represents the renewed weight. {circumflex over (f)}.sub.x(n) represents the classification result of the weighted sum of the n.sup.th sample obtained by all machine learning models 321-323, which is set to 0 when the classification is correct and set to 1 when the classification is wrong. is an integer. After the weights .sub.n are renewed, all the weights .sub.n are normalized, for example, divided by the sum of all the weights .sub.n, then the next iteration (i=i+1) is performed. In Equation (8), the error rate is calculated based on the weight of the sample, thus the sample with the bigger weight contributes more to the error rate. In Equation (9), the weight
is determined according to the error rate Ex, and when the error rate is smaller the weight
becomes bigger. At last, the weight of the sample is renewed by Equation (10). If the classification is wrong, the weight is increased, and if the classification is correct, the weight is decreased, so that the sample misclassified is previously focused on in the next iteration. Here, a preset number of the iterations may be set, or the iteration may be set to stop when the error rate is lower than a value. In this way, multiple weak classifiers are combined to a strong classifier by the combination of not only the multiple machine learning models but also the classification results of the multiple iterations.
[0034] No matter in Equation (2) or (7), after calculating the prediction value , the prediction label 341 is generated by determining whether the prediction value is higher than a threshold, for example, when the prediction value y is higher than the threshold, the prediction label 341 is set to 1 for anomaly, or the prediction label 341 is set to 0. The value range of the threshold is [0,1], which may be set according to the experiment result in this embodiment.
[0035] When the anomaly of the air flow is determined, the computing module 130 delivers a massage via any human-machine interface or equipment, for example, by showing specific words, numbers, or images on a screen; emitting light through a luminous source; making a specific sound through a speaker; or sending a text massage to a relevant person's mobile phone. In this way, the relevant operators can learn about the anomaly of the air flow 111 then arrange the equipment maintenance.
[0036] Due to the temperature and humidity data may be changed by the season and weather, the trained machine learning models 321-323 and ensemble model 340 mentioned above require adaptive adjustments. Each machine learning model includes multiple model parameters, for example, the weights of neurons in the neural networks, and the thresholds used for determining which branch to take in the random forest, which need to be recalculated to meet the environment conditions after the changes. Similarly, the classification indicators and the weights in the ensemble model 340 also need to be recalculated. In some embodiments, the trained machine learning models 321-323 and the ensemble model 340 mentioned above may be re-trained according to new feature data at every time interval (for example, a few weeks, months, or any time) to renew the model parameters, the classification indicators S.sub.x and the weights w.sub.x. Therefore, the model parameters adjustment is performed when unexpected deviations occur in the data (for example, cold snaps, foehn winds) and the model is re-adjusted according to the new data. In other words, re-training the model regularly enables the model health to be maintained at a high level and ensures the models to adapt the changing environment.
[0037] More specifically, this embodiment uses the historical data 2 weeks before the forecast date as the training set to build the models, and because the difference in the model parameters affects the accuracy of the anomaly determination, at every renewal of the model, the best model parameter is selected by nested cross-validation to improve the accuracy of the model. To ensure the accuracy of the model is not decreased by shifts or changes in data distribution, these machine learning models are also regularly renewed, re-trained, and re-adjusted according to the new data to obtain better performance in the new data.
[0038] The steps in
[0039] In the environmental anomaly detecting system and the environmental anomaly detecting method disclosed in this embodiment, through inputting the first prediction label of each machine learning model into the ensemble model, and obtaining the second prediction label by integrating with blending ensemble method to indicate whether the air flow is abnormal, the prediction is performed to implement an effective detection and to detect and prevent the environmental anomalies earlier.
[0040] Therefore, when the environmental anomaly detecting system is deployed in the manufacturing site or the production line, the following two technical effects can be achieved: [0041] 1. Abnormal event warning: The early warning is performed before the temperature and the humidity anomalies happen in the manufacturing site to help the production line to make relevant preparation and take corresponding measures to reduce the possibility of defective products. [0042] 2. Products and material quality protection: Most of materials keep their qualities under the specific temperature and humidity control, and damage when the condition is under the standard. Therefore, through the anomaly detecting mechanism, timely adjustment will be performed to protect the products and the qualities of the materials.
[0043] It will be apparent to those skilled in the art that various modifications and variations can be made to the structure of the disclosure without departing from the scope or spirit of the disclosure cover modifications and variations of this disclosure provided they fall within the scope of the following claims.