METHOD AND DEVICE FOR DETECTING SMOKE
20220230519 · 2022-07-21
Inventors
Cpc classification
International classification
Abstract
The invention relates to a machine learning system (10) which is configured, on the basis of a plurality of images captured in succession, to detect smoke (12) within the images. The machine learning system (10) comprises a convolutional recurrent neural network. The invention also relates to a method for detecting smoke by means of this machine learning system.
Claims
1. A machine learning system (10) comprising: a convolutional recurrent neural network and wherein the machine learning system is parameterized, on the basis of successive images of a sequence of images, to detect smoke (12) in the images
2. The machine learning system (10) as claimed in claim 1, which is further parameterized to output an output variable that characterizes whether smoke (12) is represented within the images.
3. The machine learning system (10) as claimed in claim 1 which is parameterized, on the basis of each of the individual images, to output a matrix as the output variable, wherein the elements of the matrix are each assigned to a section of a predetermined plurality of sections of the respective image, and wherein the elements of the matrix characterize whether smoke (12) is represented within this assigned section of the respective image.
4. The machine learning system (10) as claimed in claim 1, which comprises a plurality of concatenated convolutional LSTM modules (20), wherein a number of filters that the respective convolutional LSTM module (20) comprises increases with increasing depth of a position of the respective convolutional LSTM module, wherein a pooling layer is positioned between each of the convolutional LSTM modules (20), wherein the final convolutional LSTM module (20) of the plurality of the concatenated convolutional LSTM modules (20) is connected to a fully connected neural network, which is parameterized to output the output variable of the machine learning system (10).
5. The machine learning system (10) as claimed in claim 1, further comprising vith an input, wherein the input is configured to provide the plurality of images captured by a camera to the machine learning system (10).
6. A method for detecting smoke (12) within an image by means of a machine learning system (10), which is parameterized, on the basis of a plurality of images captured in direct succession to detect smoke (12) in the images, wherein the machine learning system (10) comprises a plurality of layers connected in a specified sequence and at least one of the layers comprises a convolutional recurrent neural network, said method comprising the following steps: obtaining a plurality of images captured in direct succession; propagating the plurality of the captured images through the machine learning system (10) in succession, wherein during the propagation, the images are processed successively by the layers of the machine learning system (10) and the final layer of the sequence of layers outputs the output variable.
7. The method as claimed in claim 6, wherein the machine learning system (10) is trained on the basis of a plurality of training data, comprising training images (x) and respectively assigned training output variables (y.sub.s), wherein the training images (x) are propagated through the machine learning system (10) during the training and, on the basis of the determined output variables of the machine learning system (10) and the respectively assigned training output variables of the training images, a parameterization of the machine learning system (10) is adjusted in such a way that a deviation between the determined output variables and the training output variables becomes a minimum.
8. The method as claimed in claim 6, wherein a smoke detector is activated on the basis of the determined output variable of the machine learning system (10).
9. A non-transitory, computer-readable medium that contains instructions that when executed on a computer cause said computer to detect smoke (12) within an image by obtaining a plurality of images captured in direct succession; and propagating the plurality of the captured images through a machine learning system (10) in succession, wherein the machine learning system (10) comprises a plurality of layers connected in a specified sequence and at least one of the layers comprises a convolutional recurrent neural network, and wherein during the propagation, the images are processed successively by the layers of the machine learning system (10) and the final layer of the sequence of layers outputs an output variable.
10. (canceled)
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0038] In the following, exemplary embodiments are described in more detail by reference to the accompanying drawings. In the drawings:
[0039]
[0040]
[0041]
[0042]
DETAILED DESCRIPTION
[0043]
[0044] In a further embodiment of the device, the machine learning system (10) is connected to a smoke detector, for example. The smoke detector determines whether or not the machine learning system should trigger an alarm, depending on the output variable of the machine learning system.
[0045]
[0046]
[0047]
[0048] The ConvLSTM (20) receives three different input variables (x,h,c). Based on these three input variables (x,h,c), the ConvLSTM (20) determines two output variables (c′,h′). A first input variable (x) is the input variable of the respective ConvLSTM layer, or the image. A second input variable (h) is a previously determined output variable of the ConvLSTM (20) at a processing/time step, in particular an immediately preceding one. This means that this determined output variable contains information from the previous processed input variables of this ConvLSTM. A third input variable (c) is an internal state of the ConvLSTM (20), which is updated depending on the first and second input variables (x,h), in particular at each processing/time step.
[0049] In order to determine the two output variables (h′,c′), the first and second input variables are filtered by means of different filters (fh,fx,ch,cx,ih,ix,oh,ox) and summed according to the combinations shown in
[0050] The results of the activation functions are then processed according to the combinations shown in
[0051]
[0052] The method starts at step 300. In this step, the machine learning system (10) receives the plurality of the images captured from the camera (11) in succession.
[0053] In the following step 310, the individual images are propagated through the machine learning system (10) in succession.
[0054] In step 320, the machine learning system (10) outputs an output variable after each of the images has been propagated through the machine learning system.
[0055] In the following step 330 it can then be decided, based on the output values output by the machine learning system (10), whether smoke is present in the area surrounding the camera. This can be carried out, for example, by comparing the output variable of the machine learning system with a threshold value. If the output variable is greater than the threshold value, the decision can be made that smoke is present. Optionally, step 340 can be performed after completing step 330. Depending on the result of step 330, a smoke detector is activated there. For example, the smoke detector can issue a warning signal such as a warning tone if the output variable of the machine learning system has exceeded the threshold value.
[0056] In a further embodiment of the method for detecting smoke, the machine learning system (10) can be trained before step 300 is executed.
[0057] When training the machine learning system (10), an optimization method, preferably a gradient descent method such as backpropagation-through-time, is used to optimize the parameterization of the machine learning system with respect to a loss function. The loss function characterizes a difference between determined output variables and supplied training output variables based on the parameterization. To optimize the parameterization, gradients can be determined using the gradient descent method, the parameterization then being adjusted according to the gradients determined.
[0058] Training data containing training images is provided for training purposes, and the machine learning system determines the output values based on these. Furthermore, the training output variables are assigned to the training images.
[0059] The training step can be executed several times in succession until a predefined abort criterion is met, for example until the difference or a change in this difference is less than a predefined value.
[0060]
[0061] Furthermore, the device (40) can comprise a computing unit (44) and a storage element (45).