METHOD FOR GENERATING TRAINED PREDICTION MODEL THAT PREDICTS AMOUNT OF DROSS GENERATED IN MELTING FURNACE, METHOD FOR PREDICTING AMOUNT OF DROSS GENERATED IN MELTING FURNACE, AND COMPUTER PROGRAM
20250164952 ยท 2025-05-22
Assignee
Inventors
Cpc classification
G05B17/00
PHYSICS
International classification
Abstract
A method of generating a trained model includes: a step of acquiring a process state parameter for every single charge (S110); a step of performing preprocessing by applying machine learning to a data set of one or more process state parameters acquired through m charges (where m is an integer of 2 or greater) (S130); a step of generating a learning data set (S140); and a step of generating a trained model (S150). The learning data set is generated based on n-dimensional features (where n is an integer of 1 or greater) that have been extracted through the preprocessing, and at least contains one or more process target parameters representing process fundamental information that is set for every single charge.
Claims
1. A method of generating a trained prediction model for predicting an amount of dross occurring in a melting furnace, comprising: a step of acquiring one or more process state parameters of different attributes for every single charge spanning from a loading of raw materials to a completion of melting, wherein each process state parameter is defined by a continuous aggregate of chronological data that is acquired based on an output from one of a variety of sensors provided in the melting furnace; a step of performing preprocessing by applying machine learning to a data set of the one or more process state parameters acquired through m charges (where m is an integer of 2 or greater), the preprocessing comprising extracting n-dimensional features (where n is an integer of 1 or greater) from each process state parameter containing an aggregate of chronological data acquired for every single charge; a step of generating a learning data set based on the extracted n-dimensional features, the learning data set at least containing one or more process target parameters representing process fundamental information that is set for every single charge; and a step of training a prediction model by using the generated learning data set to generate the trained prediction model that predicts the amount of dross generation.
2. The method of claim 1, wherein the learning data set contains one or more disturbance parameters.
3. The method of claim 2, wherein the one or more disturbance parameters include an external environmental factor.
4. The method of claim 1, wherein, the preprocessing further comprises finding a pattern in an aggregate of chronological data defining each process state parameter on the basis of the extracted n-dimensional features to determine a control pattern; and the learning data set further contains the control pattern.
5. The method of claim 4, wherein the preprocessing performs clustering for the extracted n-dimensional features as input data to determine the control pattern, the control pattern containing a label indicating a group that each process of the m charges belongs to.
6. The method of claim 4, wherein, the preprocessing further comprises applying machine learning to an aggregate of chronological data defining at least one of the one or more process state parameters to find a pattern in each process of the m charges and determine a process pattern; and the learning data set further contains the process pattern.
7. The method of claim 6, wherein the preprocessing applies an encoding process and clustering to an aggregate of chronological data defining one of main process state parameters among the one or more process state parameters that directly governs a melting process to determine the process pattern, the process pattern containing a label indicating a group that each process of the m charges belongs to.
8. The method of claim 7, wherein the one of main process state parameters is a combustion gas flowrate.
9. The method of claim 1, wherein, the preprocessing further comprises combining all of the n-dimensional features acquired from each process state parameter for every single charge to generate combined features for every single charge, and applying clustering to the combined features to determine a control pattern containing a label indicating a group that each process of the m charges belongs to; and the learning data set further contains the control pattern.
10. The method of claim 1, wherein, the one or more process state parameters are classified into two or more groups; the preprocessing further comprises combining all of the n-dimensional features that are acquired from each of at least one process state parameter belonging to the same group for every single charge to generate combined features for each group, and applying clustering to the combined features for every group to determine for each group a control pattern containing a label indicating a group that each process of the m charges belongs to; and the learning data set further contains the control pattern for each group.
11. The method of claim 10, wherein, the preprocessing further comprises applying an encoding process and clustering to an aggregate of chronological data defining one of main process state parameters among the one or more process state parameters that directly governs a melting process to determine a process pattern containing a label indicating a group that each process of the m charges belongs to; and the learning data set further contains the process pattern.
12. The method of claim 1, further comprising a step of acquiring one or more other process state parameters that are distinct from the one or more process state parameters, and extracting features from the acquired one or more other process state parameters by a classical method, wherein the step of generating the learning data set comprises generating the learning data set based on the extracted n-dimensional features and the features extracted by the classical method.
13. The method of claim 12, wherein the one or more other process state parameters comprise a component value of a combustion exhaust gas of the melting furnace.
14. The method of claim 1, wherein the trained prediction model predicts an amount of dross generation in a melting furnace used for manufacturing an aluminum alloy.
15. A method of predicting an amount of dross occurring in a melting furnace, comprising: a step of receiving, as inputs at run time, input data containing control pattern candidates, process pattern candidates, and one or more process target parameters indicating process fundamental information to be set for every single charge spanning from a loading of raw materials to a completion of melting; and a step of inputting the input data to a prediction model and outputting a predicted amount of dross generation for every single charge, wherein, the prediction model is a trained model that has been learned by using a learning data set generated by n-dimensional features that are extracted from one or more process state parameters of different attributes; each of the one or more process state parameters is defined by a continuous aggregate of chronological data that is acquired for every single charge based on an output from one of a variety of sensors provided in the melting furnace; and the learning data set contains one or more process target parameters encompassing a data range of the process target parameter or parameters contained in the input data.
16. The method of claim 15, wherein, the input data further contains one or more disturbance parameters; and the learning data set further contains one or more disturbance parameters encompassing a data range of the one or more disturbance parameters contained in the input data.
17. The method of claim 15, further comprising a step of displaying a predicted amount of dross generation for every single charge on a display device.
18. The method of claim 15, further comprising a step of inputting the input data to the prediction model and outputting a control pattern and a process pattern conducive to an amount of dross generation that satisfies a predetermined reference value.
19. A computer program, stored on a non-transitory computer readable storage medium, for causing a computer to execute: a step of acquiring a prediction model to predict an amount of dross occurring in energy efficiency of a melting furnace; a step of receiving input data containing control pattern candidates, process pattern candidates, and one or more process target parameters indicating process fundamental information to be set for every single charge spanning from a loading of raw materials to a completion of melting; and a step of inputting the input data to the prediction model and outputting a predicted amount of dross generation for every single charge, wherein, the prediction model is a trained model that has been learned by using a learning data set generated by n-dimensional features that are extracted from one or more process state parameters of different attributes; each of the one or more process state parameters is defined by a continuous aggregate of chronological data that is acquired for every single charge based on an output from one of a variety of sensors provided in the melting furnace; and the learning data set contains one or more process target parameters encompassing a data range of the process target parameter or parameters contained in the input data.
20. The computer program of claim 19, wherein, the input data further contains one or more disturbance parameters; and the learning data set further contains one or more disturbance parameters encompassing a data range of the one or more disturbance parameters contained in the input data.
Description
BRIEF DESCRIPTION OF DRAWINGS
[0012]
[0013]
[0014]
[0015]
[0016]
[0017]
[0018]
[0019]
[0020]
[0021]
[0022]
[0023]
[0024]
[0025]
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
[0033]
DESCRIPTION OF EMBODIMENTS
[0034] Aluminum alloys and other alloy materials are manufactured through multiple manufacturing processes involving various processes. For example, the manufacturing processes for direct chill (DC) casting of an aluminum alloy may include: a process of melting materials in a melting furnace; a process of holding the melt in a holding furnace to adjust composition and temperature; a process of degassing to remove hydrogen gas by using continuous degassing equipment; a process of removing inclusions by using an RMF (Rigid Media Tube Filter); and a process of casting a slab. The melting process can include, after charging materials into the melting furnace, further processes such as: additional loading of molten metals or raw materials (material recycling); removal of dross; and reheating. This series of processes is in-line processes.
[0035] According to a study by the inventor, optimization of the melting process in the in-line processes is complicated because it is affected by subsequent processes. In addition, there are limits to physical-model based simulations, and thus it is difficult to optimize the processes through simulations.
[0036] Materials manufacturers can accumulate, in a database, a vast amount of chronological process data acquired during the manufacture phase, for example, over a few, ten, twenty, or even more years. Chronological series process data can be associated with design and development information, climate data during manufacturing, test data, etc., and accumulated in a database. Such an aggregate of data is called big data. However, at present, big data has not been effectively utilized by materials manufacturers.
[0037] In view of such problems, the inventor has utilized a data-driven prediction model for amount of dross generation that is constructed by using existing big data, and arrived at a novel technique that can optimize melting process conditions.
[0038] Hereinafter, with reference to the accompanying drawings, a method of generating a trained prediction model for predicting the amount of dross occurring in a melting furnace, a method of predicting the amount of dross occurring in a melting furnace, and an operation support system according to the present disclosure will be described in detail. It should be noted that unnecessarily detailed descriptions may be avoided. For example, to avoid unnecessarily obscuring the present disclosure, well-known features may not be described or substantially the same elements or steps may not be redundantly described, for example. This is also for ease of understanding the present disclosure. In the following description, like elements may be indicated by like reference numerals.
[0039] The embodiments described below are for illustrative purposes. Methods of generating a trained prediction model for predicting the amount of dross occurring in a melting furnace, methods of predicting the amount of dross occurring in a melting furnace, and operation support systems according to the present disclosure are not limited to the embodiments described below. For example, the numerical values, shapes, materials, steps, and the order of the steps shown in the following embodiments are only examples, and various modifications are possible so long as there is no technical contradiction. One embodiment can be used in combination with another so long as there is no technical contradiction.
[0040]
[0041] The sensors measure data at predetermined sampling intervals. Examples of predetermined sampling intervals are 1 second or 1 minute. The data measured by the sensors is stored in a database 100, for example. Communication between the sensors and the database is realized, for example, by wireless communication compliant with the Wi-Fi (registered trademark) standards.
[0042] Now, the terminology used in the present specification will be defined.
[0043] A melting yield in a melting furnace according to the present embodiment means a ratio (AB)/A, relative to a weight A of raw material that is loaded during melting, of a weight resulting by subtracting the weight of dross (amount of dross generation) B occurring from the weight A of raw material. Therefore, if the amount of dross occurring at a given loaded amount at melting can be predicted, then the melting yield can be predicted. Dross is a thick film or lump of metal oxide (or impurity) floating in or on the surface of metal. An example of dross according to embodiments of the present disclosure is aluminum dross. Aluminum dross occurs as an impurity in the aluminum melting process. Aluminum dross is said to contain about 60 to 80% aluminum, and development of technology to recover and recycle aluminum from aluminum dross is underway.
[0044] Chronological data that is acquire based on outputs from the sensors provided in the melting furnace 700 is referred to as process data. Examples of process data are an exhaust gas flowrate (m.sup.3/h), a combustion air flowrate (m.sup.3/h), a combustion gas flowrate (m.sup.3/h), a furnace pressure (kPa), a furnace atmosphere temperature ( C.), and an exhaust gas analysis concentration (%).
[0045] A continuous aggregate of chronological data that is acquired for every single charge, spanning from the loading of raw materials to the completion of melting, is referred to as a process state parameter. In other words, a process state parameter is defined by a continuous chronological aggregate of process data that is acquired for every single charge. Similarly to process data, examples of process state parameters are exhaust gas flowrate, combustion air flowrate, combustion gas flowrate, furnace pressure, and furnace atmosphere temperature.
[0046] Data indicating fundamental information of the melting process that is set for every single charge is referred to as a process target parameter. Examples of process target parameters are loaded raw material quantity (ton), loaded molten material quantity (ton), melting time (min), number of charges (ordinal number) as counted from the charge at which a cleaning of the furnace bed is performed, melt component (%) containing an Mg concentration (%), and Mg master alloy addition quantity (ton). A process target parameter is non-chronological data, and is designated as a specific value. In addition to the aforementioned data, process target parameters can include, for example, data concerning the state of the loaded raw material (shape, oxide film thickness, surface deposits) and data concerning the drossing flux used during melting.
[0047] In the present specification, the number of charges as counted from the charge at which a cleaning of furnace bed is performed is referred to as a cleaning factor. As counted from the charge at which a cleaning of furnace bed is performed, a following charge is the first, the next following charge is the second, then continuing onto the third, fourth, etc. In general, the furnace bed is to be cleaned with a frequency of once in ten and several charges, in order to remove any dross adhering to the wall surface or the bed. Therefore, it is foreseeable that the amount of dross is small in a charge immediately after the charge at which a cleaning of furnace bed is performed, and that the amount of dross will increase with an increasing number of charges. In order to take this into account, the cleaning factor is introduced as a process target parameter.
[0048] A parameter that involves an external environmental factor is referred to as a disturbance parameter. An example of a disturbance parameter is climate data, such as hourly average temperature ( C.) and average relative humidity (%). The climate data is chronological data. Other than climate data, disturbance parameters may include data concerning operators and work groups, work time, and so on, for example.
[0049]
[0050] The database 100 is a storage device, such as a semiconductor memory, a magnetic storage device, or an optical storage device.
[0051] The data processing device 200 includes a body 201 of the data processing device and a display device 220. For example, software (or firmware) that is used to generate a prediction model for predicting the amount of dross generation in a melting furnace using data accumulated in the database 100, and software for predicting the amount of dross generation by using a trained prediction model at run time, are implemented in the body 201 of the data processing device. Such software may be commercially available as packaged software stored in a computer-readable storage medium, such as an optical disc, or may be provided through the Internet.
[0052] The display device 220 is, for example, a liquid crystal display or organic EL display. The display device 220 displays a predicted value of amount of dross generation for every charge based on output data that is output from the body 201, for example.
[0053] A typical example of the data processing device 200 is a personal computer. Alternatively, the data processing device 200 may be a dedicated device that functions as an operation support system for a melting furnace.
[0054]
[0055] The input device 210 converts instructions from the user into data, which is in turn input to the computer. The input device 210 is, for example, a keyboard, mouse, or touch panel.
[0056] The communication I/F 230 is an interface for data communication between the data processing device 200 and the database 100. The form and protocol thereof are not limited, as long as data transfer is possible. For example, the communication I/F 230 is capable of wired communication compliant with USB, IEEE1394 (registered trademark), Ethernet (registered trademark), or the like. The communication I/F 230 is capable of wireless communication compliant with the Bluetooth (registered trademark) standard and/or the Wi-Fi standard. These standards include a wireless communication standard that uses the 2.4 GHz or 5.0 GHz frequency band.
[0057] The storage device 240 is, for example, a magnetic storage device, an optical storage device, a semiconductor storage device, or a combination thereof. Examples of the optical storage device include optical disk drive s and magneto-optical disk (MD) drives. Examples of the magnetic storage device include hard disk drives (HDDs), floppy disk (FD) drives, and magnetic tape recorders. Examples of the semiconductor storage device include solid-state drives (SSDs).
[0058] The processor 250 is a semiconductor integrated circuit, and is also referred to as a central processing unit (CPU) or microprocessor. The processor 250 sequentially executes a computer program that is stored in the ROM 260 and includes instructions to train a prediction model and use the trained model, thereby carrying out a desired process. The processor 250 is to be broadly interpreted as a term encompassing an FPGA (Field Programmable Gate Array), a GPU (Graphics Processing Unit), an ASIC (Application Specific Integrated Circuit) or an ASSP (Application Specific Standard Product) with a CPU mounted thereon.
[0059] The ROM 260 is, for example, a writable memory (e.g., a PROM), a rewritable memory (e.g., a flash memory), or a read-only memory. The ROM 260 stores a program that controls operations of the processor. The ROM 260 may not necessarily be a single storage medium, or may be a set of storage media. A portion of the set of storage media may be removable memory.
[0060] The RAM 270 provides a work area into which the control program stored in the ROM 260 will be temporarily laid out during boot-up. The RAM 270 may not necessarily be a single storage medium, and may be a set of storage media.
[0061] Some representative example configurations of the system 1000 according to the present disclosure will be described below.
[0062] In an example configuration, the system 1000 includes the database 100 and the data processing device 200 shown in
[0063]
[0064] In another example configuration, as shown in
[0065] The system 1000 may include one or more data processing devices 200 and the cloud server 300. In that case, in the place of the processor 250 included in the data processing device 200 or in cooperation with the processor 250, the processor 310 included in the cloud server 300 can sequentially execute a computer program including instructions to train a prediction model and use the trained model. Alternatively, for example, a plurality of data processing devices 200 connected to the same LAN 400 may execute the computer program including such instructions in cooperation with one another. Such a distributed process performed by the plurality of processors can reduce calculation load on each processor.
<1. Generation of Trained Prediction Model>
[0066]
[0067] A trained model according to the present embodiment predicts the amount of dross generation in a melting furnace that is used in the manufacture of an aluminum alloy. However, the trained model may also be used to predict the amount of dross generation in a melting furnace that is used for the manufacture of any alloy material other than aluminum alloys. Furthermore, as described above, it is possible to predict the melting yield based on the predicted value of amount of dross generation and the weight of the loaded raw material. Therefore, the trained model according to the present embodiment may also be regarded as a prediction model that predicts the melting yield.
[0068] A method of generating a trained model according to the present embodiment includes: step S110 of acquiring a process state parameter for every single charge; step S120 of determining whether process state parameters from m charges (where m is an integer of 2 or greater) have been acquired or not; step S130 of performing preprocessing; step S140 of generating a learning data set; and step S150 of generating a trained model.
[0069] It is one or more processors that performs each process (or task). One processor may perform one or more processes, or a plurality of processors may work in cooperation to perform one or more processes. The processes are to be described in a computer program as software modules. However, in the case where an FPGA or the like is used, all or some of these processes may be implemented as a hardware accelerator. In the following description, it is the data processing device 200 including the processor 250 that performs each step.
[0070] At step S110, the data processing device 200 accesses the database 100 to acquire or obtain one or more process state parameters of different attributes for every single charge spanning from the loading of raw materials to the completion of melting. In the present embodiment, the data processing device 200 accesses respective aggregates of process data of exhaust gas flowrate, combustion air flowrate, combustion gas flowrate, furnace pressure, furnace atmosphere temperature, O.sub.2 concentration, CO concentration, and NO.sub.X concentration in the exhaust gas, and exhaust gas temperature that are stored in the database 100, and obtain process state parameters for every single charge. In other words, as process state parameters, the nine of exhaust gas flowrate, combustion air flowrate, combustion gas flowrate, furnace pressure, furnace atmosphere temperature, O.sub.2 concentration, CO concentration, and NO.sub.X concentration in the exhaust gas, and exhaust gas temperature are acquired for every single charge.
[0071] The data processing device 200 may access the database 100 after aggregates of chronological process data from a plurality of charges have been stored to the database 100, and acquire the process state parameters from the plurality of charges all in once (off-line processing). Alternatively, the data processing device 200 may access the database 100 every time an aggregate of chronological process data from one charge is stored to the database 100, and acquire a process state parameter from one charge (on-line processing).
[0072] At step S120, the data processing device 200 repeatedly performs step S110 until process state parameters from m charges have been acquired. The number m of charges in the present embodiment may be about 1000, for example. Once acquiring a data set containing process state parameters from m charges, the data processing device 200 proceeds to the next step S130. The data set contains process state parameters for the nine of exhaust gas flowrate, combustion air flowrate, combustion gas flowrate, furnace pressure, furnace atmosphere temperature, O.sub.2 concentration, CO concentration, and NO.sub.X concentration in the exhaust gas, and exhaust gas temperature that have been acquired through m charges.
[0073] At step 130, the data processing device 200 applies machine learning to the data set acquired at step S120, thereby performing preprocessing. With respect to each process state parameter of a different attribute, the preprocessing extracts n-dimensional features (where n is an integer of 1 or greater) from the process state parameter containing an aggregate of chronological data acquired for every single charge. In the present specification, n-dimensional features may be expressed as an n-dimensional feature vector.
[0074] Examples of machine learning to be applied in the preprocessing according to the present embodiment include autoencoders such as convolutional autoencoders (CAE), variational autoencoders (VAE), and clustering such as k-means technique, c-means technique, mixed Gaussian distribution (GMM), dendrogram methods, spectral clustering or probabilistic latent semantic analysis methods (PLSA or PLSI). The preprocessing will be described in detail later.
[0075] At step S140, the data processing device 200 generates a learning data set based on the n-dimensional features extracted from each process state parameter for every single charge. The learning data set at least contains one or more process target parameters representing process fundamental information that is set for every single charge. The learning data set may further contain one or more disturbance parameters, among which external environmental factors, e.g., climate data, may be included. In the present embodiment, the learning data set includes two process target parameters of loaded material quantity and melting time, as well as disturbance parameters such as average temperature and average relative humidity. However, the learning data set may contain other process target parameters and disturbance parameters. Although disturbance parameters are not essential parameters, they may be included in the learning data set to improve the prediction accuracy for amount of dross generation.
[0076] At step S150, the data processing device 200 trains a prediction model by using the generated learning data set, thereby generating a trained model. In the present embodiment, the prediction model, which is a supervised prediction model, is constructed by a neural network. An example of the neural network is a multilayer perceptron (MLP). The MLP is also called a feedforward neural network. The supervised prediction model is not limited to neural networks, and may, for example, be a support-vector machine, random forest, or the like.
[0077] The trained model that predicts the amount of dross generation in a melting furnace according to the present embodiment can be generated in accordance with various processing procedures (i.e., algorithms). Hereinafter, first to fourth example implementations of the algorithm will be described. In each of the first to fourth example implementations, a distinct preprocessing is performed. A computer program containing instructions that describe any such algorithm may be supplied through the Internet, for example. Hereinafter, the distinct preprocessing in each example implementation will mainly be described.
First Example Implementation
[0078]
[0079] A process flow according to the first example implementation includes a step (S110, S120) of acquiring process state parameters, step S130A of performing preprocessing, step S140 of generating a learning data set, and step S150 of generating a previously trained model.
[0080] The data processing device 200 acquires a data set containing process state parameters from m charges. In this example implementation, the data set contains process state parameters for the nine of exhaust gas flowrate, combustion air flowrate, combustion gas flowrate, furnace pressure, furnace atmosphere temperature, O.sub.2 concentration, CO concentration, and NO.sub.X concentration in the exhaust gas, and exhaust gas temperature that have been acquired through m charges.
[0081] The sampling intervals of the respective sensors vary depending on the attribute of the data to be measured. For example, the process data of exhaust gas flowrate, combustion air flowrate, combustion gas flowrate, furnace pressure, and component concentrations in the exhaust gas (e.g., O.sub.2 concentration, CO concentration, NO.sub.X concentration) are measured by the flowrate sensor 705 and the pressure sensor 706 with a sampling interval of 1 second, whereas the furnace atmosphere temperature and exhaust gas temperature are measured by the temperature sensor 707 with a sampling interval of 1 minute.
[0082] At step 130, with respect to each process state parameter, the data processing device 200 applies an encoding process S131A to the respective process state parameter containing an aggregate of chronological data acquired for every single charge to extract n-dimensional features (or an n-dimensional feature vector). In the present embodiment, the dimensional number of the features to be extracted differs depending on the sampling interval of the sensor. For any process parameter defined by an aggregate of chronological process data that is measured with a sampling interval of 1 second, the data processing device 200 extracts an n.sub.1-dimensional feature vector. For any process parameter defined by chronological process data that is sampled with a sampling interval of 1 minute, the data processing device 200 extracts an n.sub.2-dimensional feature vector. In this example implementation, the data processing device 200 extracts an 18-dimensional feature vector from the respective process state parameters of exhaust gas flowrate, combustion air flowrate, combustion gas flowrate, furnace pressure, and O.sub.2 concentration, CO concentration, and NO.sub.X concentration in the exhaust gas, and extracts a 5-dimensional feature vector from the respective process state parameters of furnace atmosphere temperature and exhaust gas temperature.
[0083]
[0084] An autoencoder is a machine learning model that iteratively learns parameters so that the input and the output match through dimensional compression (encoding) on the input side and dimensional expansion (decoding) on the output side. The learning by an autoencoder can be unsupervised or supervised learning. A CAE has a network structure that utilizes convolutional layers, instead of fully-connected layers, for the encoding and decoding portions. A VAE, on the other hand, has intermediate layers each represented as a random variable (latent variable) that follows an N-dimensional normal distribution. The latent variable, which is a dimensional compression of the input data, can be used as a feature.
[0085] In this example implementation, the encoding process S131A is a VAE. As is illustrated in
[0086] By applying a VAE to the process state parameters 500, the data processing device 200 generates an mn-dimensional feature vector for every process state parameter. Given that there are 1 process state parameters, an 1mn-dimensional feature vector 510 is generated as a whole. In
[0087] Using representative values, such as mean values, integral values, and slopes, which can be subjected to scrutiny by operators and skilled workers, may result in oversights, because they can only be calculated to the extent that they allow scrutiny by them. On the other hand, applying an encoding process to the process state parameters 500 makes it possible to extract features with a high accuracy, and may even allow unexpected features to be extracted.
[0088]
[0089] At step 140, the data processing device 200 generates a learning data set that contains the 1mn-dimensional feature vector 510 generated at step S130, a process target parameter (s), and a disturbance parameter(s). In this example implementation, the learning data set contains an [m20]-dimensional feature vector concerning the respective process state parameters of exhaust gas flowrate, combustion air flowrate, combustion gas flowrate, furnace pressure, and O.sub.2 concentration, CO concentration, and NO.sub.X concentration in the exhaust gas; an [m18]-dimensional feature vector concerning the respective process state parameters of furnace atmosphere temperature and exhaust gas temperature; loaded material quantity; process target parameters such as loaded molten material quantity, melting time, cleaning factor, Mg concentration, and Mg master alloy addition quantity; and disturbance parameters such as hourly average temperature and average relative humidity.
[0090] At step S150, the data processing device 200 uses the learning data set generated at step S140 to train a prediction model, thereby generating a trained model. In this example implementation, the prediction model is an MLP.
[0091]
[0092] In MLPs, information propagates from the input side to the output side in one direction. Each unit receives a plurality of inputs, and calculates a single output. Assuming that the plurality of inputs are [x.sub.1, x.sub.2, x.sub.3, . . . , x.sub.i (i is an integer of two or greater)], the overall input to the unit is obtained by multiplying the respective inputs x by different weights w, adding them up, and adding a bias b to this, which is represented by equation 1. Herein, [w.sub.1, w.sub.2, w.sub.3, . . . , w.sub.i] are weights for the respective inputs. The output z of the unit is given as the output of a function f called an activation function for all inputs u, which is represented by equation 2. The activation function is typically a monotonically increasing nonlinear function. An example of the activation function is a logistic sigmoid function, which is represented by equation 3. In equation 3, e represents Napier's constant.
[0093] Between layers, each unit in one layer is connected to every unit in the other. As a result, an output of a unit in a left layer is an input of a unit in a right layer, which connection allows a signal to propagate from the left layer to the right layer in one direction. By determining the outputs of the layers sequentially while optimizing the parameters, i.e., the weights w and the bias b, the final output of the output layer is obtained.
[0094] As training data, actual values of amount of dross generation are used. The parameters (the weights w and the bias b) are optimized based on a loss function (squared error) such that the output of the output layer of the neural network approaches the actual value. In this example implementation, learning is performed 10000 times, for example.
[0095]
[0096] According to this example implementation, by applying a VAE to the aggregate of chronological process data, it is possible to extract an 18- or 5-dimensional feature vector for each process state parameter. Using a prediction model that is generated by integrating a VAE and a neural network makes it possible to predict amount of dross generation with a high accuracy. Furthermore, data generation based on a VAE, i.e., using a latent variable that has been compressed to a lower dimension, is useful in terms of allowing a chronological process assessment. For example, it become possible to tune the operating conditions of the melting furnace for each process step.
Second Example Implementation
[0097]
[0098] The second example implementation differs from the first example implementation in that a control pattern is generated based on n-dimensional features. Hereinafter, differences will mainly be described.
[0099] The data processing device 200 finds a pattern in an aggregate of chronological process data defining each process state parameter on the basis of the extracted n-dimensional features, thereby determining a control pattern.
[0100] The preprocessing according to this example implementation includes: step S130A of applying an encoding process S131A to the aggregate of chronological process data defining a process state parameter to extract n-dimensional features; and step 130C of applying a clustering S131B to the n-dimensional features to generate a control pattern. The process of step S130A is as has been described with respect to the first example implementation. Examples of clustering are GMM and K-means. In this example implementation, the clustering is GMM. Hereinafter, representative algorithms of GMM and k-means technique will be briefly described. These algorithms can be relatively easily implemented in the data processing device 200.
(Mixed Gaussian Distribution)
[0101] Mixed Gaussian distribution (GMM) is a method of analysis based on probability distributions, and is a model that is expressed as a linear combination of multiple Gaussian distributions. The model is fitted by the maximum likelihood method, for example. In particular, when there are multiple clusters in the data aggregate, the mixed Gaussian distribution can be used for clustering. From given data points, GMM calculates the mean and variance of each of the multiple Gaussian distributions. [0102] (i) Mean value and variance of each Gaussian distribution are initialized. [0103] (ii) Weights to be given to the data points are calculated for each cluster. [0104] (iii) Based on the weights calculated in (ii), the mean value and variance of each Gaussian distribution are updated. [0105] (iv) Until change in the mean value of each Gaussian distribution as updated in (iii) becomes sufficiently small, (ii) and (iii) are repeated.
(k-Means Technique)
[0106] Because k-means technique is relatively simple, and is applicable to relatively large data, k-means technique is broadly used in data analysis. [0107] (i) From among multiple data points, as many arbitrary points are selected as there are clusters, and these are designated as the centroids (or representative points) of the clusters. The data are also referred to as records. [0108] (ii) The distance between each data point and the centroid of each cluster is calculated, and from among the as many centroids as there are clusters, the cluster whose centroid is at the closest distance is defined as the cluster to which that data point belongs. [0109] (iii) For each cluster, a mean value of the multiple data points belonging to that cluster is calculated, and the data point that exhibits the mean value is defined as a new centroid of that cluster. [0110] (iv) Until movements of all data points between clusters subside or the upper limit number of computation steps is reached, (ii) and (iii) are repeated.
[0111] At step S130C, the data processing device 200 performs clustering for the n-dimensional features extracted at step S130A as input data, thereby determining a control pattern containing a label indicating a group that each process of the m charges belongs to. For example, the clustering can classify the input n1-dimensional feature vector into 10 groups, and the input n.sub.2-dimensional feature vector into 5 groups.
[0112] At step S131A, the data processing device 200 extracts e.g. an 18-dimensional feature vector from each of exhaust gas flowrate, combustion air flowrate, combustion gas flowrate, furnace pressure, and O.sub.2 concentration, CO concentration, and NO.sub.X concentration in the exhaust gas, and extracts e.g. a 5-dimensional feature vector from each of furnace atmosphere temperature and exhaust gas temperature.
[0113] The data processing device 200 applies clustering (step S131B) to the extracted 18-dimensional feature vector, thereby determining a control pattern containing a label indicating a group that each process of the m charges belongs to. Similarly, the data processing device 200 applies clustering (step S131B) to the extracted 5-dimensional feature vector, thereby determining a control pattern containing a label indicating a group that each process of the m charges belongs to. By performing clustering, the data processing device 200 classifies the 18-dimension and 5-dimensional feature vectors for every single charge into 10 groups and 5 groups, respectively, for example. Correspondingly to each of 18-dimension and 5-dimensional feature vectors, the data processing device 200 generates an m-dimensional control pattern vector 520 that is defined by m control patterns for the m charges.
[0114]
[0115]
[0116] Similarly to the first or second example implementation, as a result of training the prediction model, as is illustrated in
Third Example Implementation
[0117]
[0118] The third example implementation differs from the first or second example implementation in that a process pattern is generated based on a main process state parameter. Hereinafter, differences will mainly be described.
[0119] In this example implementation, the preprocessing includes: step S130D of generating a control pattern based on n-dimensional features that have been extracted at step S130A; and step S130E of generating a process pattern based on a main process state parameter.
[0120] The process of step S130A is as has been described in the third Example. In other words, for example, the data processing device 200 extracts an 18-dimensional feature vector from an aggregate of chronological process data defining each of exhaust gas flowrate, combustion air flowrate, combustion gas flowrate, furnace pressure, and O.sub.2 concentration, CO concentration, and NO.sub.X concentration in the exhaust gas, and extracts a 5-dimensional feature vector from an aggregate of chronological process data defining furnace atmosphere temperature and exhaust gas temperature.
[0121] The process of step S130D differs from the process of step S130C of the second example implementation. The difference is that a process state parameter(s) associated with the same sampling interval are classified into two or more groups. At step S130D, the data processing device 200 combines all of the n-dimensional features that are acquired from each of at least one process state parameter belonging to the same group for every single charge, to generate combined features for each group. In the third example implementation, among multiple process state parameters that have been acquired with a sampling interval of 1 second, the two of exhaust gas flowrate and combustion air flowrate are assigned to group A; furnace pressure is assigned to group B; and the three of O.sub.2 concentration, CO concentration, and NO.sub.X concentration in the exhaust gas are assigned to group C. Furnace atmosphere temperature and exhaust gas temperature, which are process state parameters that have been acquired with a sampling interval of 1 minute, are assigned to group D.
[0122] At step S132, the data processing device 200 combines all of the 18-dimensional feature that have been extracted from each of the process state parameters of exhaust gas flowrate and combustion air flowrate belonging to group A, to generate combined features for group A. The combined features in group A have 36 dimensions. The data processing device 200 combines all of the 18-dimensional feature that have been extracted from the process state parameter of furnace pressure belonging to group B, to generate combined features for group B. In this case, because there is only one kind that needs feature combination, the combined features in group B have 18 dimensions, which is the same dimensions as those of the furnace pressure features. The data processing device 200 combines all of the 18-dimensional feature that have been extracted from each of the process state parameters of O.sub.2 concentration, CO concentration, and NO.sub.X in the exhaust gas belonging to group C, to generate combined features for group C. The combined features in group C have 54 dimensions. All of the 5-dimensional feature that have been extracted from the respective process state parameters of furnace atmosphere temperature and exhaust gas temperature belonging to group D are combined to generate combined features for group D. The combined features in group D have 10 dimensions.
[0123] By applying the clustering S131B to the combined features for each group, the data processing device 200 determines for each group a control pattern containing a label indicating a group that each process of the m charges belongs to. In this example implementation, the clustering is GMM. For example, by GMM, the input n-dimensional features may be classified into 10 groups.
[0124] By applying GMM to the 36-dimensional combined features in group A, the data processing device 200 generates an m-dimensional control pattern vector containing a control pattern A for every single charge. By applying GMM to the 18-dimensional combined features in group B, the data processing device 200 generates an m-dimensional control pattern vector containing a control pattern B for every single charge. By applying GMM to the 54-dimensional combined features in group C, the data processing device 200 generates an m-dimensional control pattern vector containing a control pattern C for every single charge. By applying clustering to the 10-dimensional combined features in group D, the data processing device 200 generates an m-dimensional control pattern vector containing a control pattern D for every single charge. Each of the control patterns A to D may include 10 patterns from labels AA to JJ, for example. Control patterns A are control patterns concerning burner control; control patterns B are control patterns concerning the furnace pressure pattern; control patterns C are control patterns concerning component concentrations in the exhaust gas; and patterns D are control patterns concerning temperature.
[0125] At step S130E, the data processing device 200 applies machine learning to an aggregate of chronological process data defining at least one of the one or more process state parameters to find a pattern in each process of the m charges, thereby determining a process pattern. To explain in more detail, the data processing device 200 applies an encoding process and clustering to an aggregate of chronological process data defining one of the main process state parameters, thereby determining a process pattern containing a label indicating a group that each process of the m charges belongs to.
[0126] Main process state parameters refer to those parameters among the one or more process state parameters which directly govern the melting process. For example, the amount of dross generation in the melting furnace is largely governed by the opening and closing of the furnace lid, the turning ON/OFF of the burner, and so on. Therefore, in the present embodiment, the parameters which reflect these are regarded as the main process state parameters. An example of a main process state parameter is the combustion gas flowrate.
[0127]
[0128] At step S130Z, the data processing device 200 applies an encoding process and clustering to an aggregate of chronological process data defining one of the main process state parameters among the one or more process state parameters, thereby determining a process pattern containing a label indicating a group that each process of the m charges belongs to. In this example implementation, the encoding process is VAE, and the clustering is k-means technique.
[0129] The process pattern may include 4 patterns from labels AAA to DDD, for example. The process pattern relates to the work required in the melting process. The process pattern is a pattern expression of an aggregate of chronological process data defining the main process state parameter, focusing on the combination of the presence/absence of work, work sequence, and work timing, where characteristic features are extracted. Similarly to the process pattern, the aforementioned control pattern may contain information concerning work, but is different from the process pattern in that it contains information other than work, e.g., information such as the control state of the melting furnace, for example.
[0130] The data processing device 200 applies a VAE to an aggregate of chronological process data defining combustion gas flowrate, and extracts e.g., a 2-dimensional feature from the process state parameter of combustion gas flowrate for every single charge. By applying k-means technique to the extracted 2-dimensional feature, the data processing device 200 determines a process pattern containing a label indicating a group that each process of the m charges belongs to. The data processing device 200 generates an m-dimensional process pattern vector 530 containing a process pattern for every single charge.
[0131]
[0132] Preferably, hyperparameters are adjusted for the trained model, thereby optimizing the accuracy of the prediction model. This adjustment can be performed by using a grid search, for example.
[0133] A method of generating a trained prediction model according to an embodiment of the present disclosure may further include a step of acquiring one or more other process state parameters that are distinct from the one or more process state parameters, and extracting features from the acquired one or more other process state parameters by a classical method. The other process state parameter(s) is distinct from the aforementioned process state parameters such as exhaust gas flowrate, combustion air flowrate, and combustion gas flowrate. The other process state parameter(s) is a component value of a combustion exhaust gas of the melting furnace, or the combustion exhaust gas temperature, for example. The learning data set may be generated based on the extracted n-dimensional features and the feature(s) extracted by the classical method.
Fourth Example Implementation
[0134]
[0135] The fourth example implementation differs from the first example implementation in that a learning data set is generated based on n-dimensional features extracted by applying machine learning and a feature(s) extracted by a classical method. Hereinafter, differences will mainly be described.
[0136] The other process state parameter in the fourth example implementation is a component value of a combustion exhaust gas of the melting furnace. The process flow according to the fourth example implementation further includes: step (8171) of continuously analyzing a component value of the combustion exhaust gas of the melting furnace to acquire analysis data of the exhaust gas component value; and a step (S172) of extracting, from the acquired analysis data, the feature(s) of the exhaust gas component value during burner combustion by a classical method. Examples of classical methods may be theoretically- or empirically-based.
[0137] At step S171, the data processing device 200 acquires continuous aggregates of data of component values of various combustion exhaust gases, e.g., O.sub.2, CO, CO.sub.2, and NO.sub.X, based on an output value output from a combustion exhaust gas analysis device that includes the gas sensor 708, for example. For example, continuous aggregates of data may be acquired for every single charge. The data processing device 200 analyzes the continuous aggregates of data to acquire analysis data of each exhaust gas component value. An example of a gas component value is the concentration of a gas component. The data processing device 200 in this example implementation analyzes the continuous data aggregate of component values of the combustion exhaust gases O.sub.2, CO, and NO.sub.X to acquire analysis data of O.sub.2 concentration, CO concentration, and NO.sub.X concentration.
[0138] At step S172, from analysis data acquired for each exhaust gas component, the data processing device 200 extracts the feature(s) of an exhaust gas component value during burner combustion for each exhaust gas component. The feature(s) of an exhaust gas component value may be expressed as a 1-dimensional feature vector, for example. As the feature of an exhaust gas component value, a median of an analysis value that is acquired by analyzing data which is obtained during burner combustion may be used, for example.
[0139] At step S140, the data processing device 200 generates a learning data set based on n-dimensional features extracted by applying machine learning and a feature(s) of exhaust gas component values extracted by a classical method. In this example implementation, the data processing device 200 generates a learning data set containing an 1mn-dimensional feature vector 510 generated at step S130, a process target parameter(s), a disturbance parameter(s), and the feature of the exhaust gas component value extracted at step S172.
[0140] Because an exhaust gas component value is special process data, it is preferable to extract its features by a classical method rather than by machine learning. Therefore, this example implementation treats component values of combustion exhaust gases distinctly from the aforementioned process state parameters. However, exhaust gas component values may be treated as a kind of process state parameter, and their features may be extracted by applying machine learning to the component values of combustion exhaust gases as has been described in the first example implementation.
[0141] At step S150, the data processing device 200 uses the learning data set generated at step S140 to train a prediction model, thereby generating a trained model.
<2. Run Time>
[0142] By using inputting input data containing control pattern candidates, process pattern candidates, and the like as the aforementioned trained model, it becomes possible to predict the amount of dross generation in a melting furnace, or output a control pattern and a process pattern conducive to an amount of dross generation that satisfies a predetermined reference value. The predetermined reference value may be set as a target value of amount of dross generation.
[0143]
[0144] A method of predicting the amount of dross generation in a melting furnace according to the present embodiment includes: a step of receiving, as inputs at run time, input data containing control pattern candidates, process pattern candidates, one or more process target parameters indicating process fundamental information to be set for every single charge spanning from the loading of raw materials to the completion of melting, and one or more disturbance parameters; and a step of inputting the input data to a trained model and outputting a predicted amount of dross generation for every single charge. However, if the learning data set used when causing the prediction model to learn does not contain any disturbance parameters, the input data at run time does not contain any disturbance parameters. In the present embodiment, it is assumed that the input data contains disturbance parameters.
[0145] The trained model can be generated according to the aforementioned first to fourth example implementations, for example. The learning data set to be used in training the prediction model contains one or more process target parameters encompassing the data range of the process target parameter(s) contained in the input data, and one or more disturbance parameters encompassing the data range of the disturbance parameter(s) contained in the input data. Stated otherwise, the one or more process target parameters in the input data are selected from within the data range of one or more process target parameters contained in the learning data set. Similarly, the one or more disturbance parameters in the input data are selected from within the data range of one or more disturbance parameters contained in the learning data set.
[0146] Now, the control pattern candidates and the process pattern candidates will be described.
[0147] The control pattern candidates include all control patterns that were generated through the preprocessing when generating the prediction model. When four kinds (patterns AA, BB, CC and DD) of control patterns are generated through the preprocessing, all of the four patterns qualify as control pattern candidates. The control pattern that is conducive to the highest amount of dross generation may vary depending on the process target parameters, process patterns, and disturbance parameters contained in the input data. Therefore, the present embodiment adopts a method where, a desirable control pattern is selected from among control pattern candidates, in order to optimize the control pattern in accordance with changes in the process target parameters, process patterns, and disturbance parameters. The desirable control pattern means a control pattern which is conducive to an amount of dross generation that satisfies a predetermined reference value, i.e., a target value.
[0148] Process pattern candidates are process patterns which have been selected by an operator as selectable candidate patterns in the melting process, from among process patterns which were generated through the preprocessing when generating the prediction model. Process pattern candidates are used in the sense of constraints in selecting a desirable control pattern. The operator is able to select one or more process pattern candidates in accordance with the work schedule, for example. For instance, given that the process patterns which were generated through the preprocessing include the four pattern of pattern AAA (number of times material is loaded: once, cleaning of furnace interior: NO), pattern BBB (number of times material is loaded: once, cleaning of furnace interior: YES), pattern CCC (number of times material is loaded: twice, cleaning of furnace interior: NO), and pattern DDD (number of times material is loaded: twice, cleaning of furnace interior: YES), consider a case where the number of times material is loaded in the melting process may be arbitrary, and no cleaning of furnace bed is required. In that case, the operator may select the two of pattern AAA and pattern CCC as the selectable candidate patterns via the input device 210 of the data processing device 200, for example.
[0149]
[0150] The output data associates all combinations of control pattern candidates and process pattern candidates with predicted values of amount of dross generation. These predicted values of amount of dross generation are charge-by-charge predicted values. In the illustrated example, correspondence between eight combinations and predicted values of amount of dross generation is shown. From among the eight combinations, the data processing device 200 selects a combination of a control pattern candidate and a process pattern candidate conducive to an amount of dross generation that satisfies the target value as the desirable control pattern and process pattern. The data processing device 200 may output the selected control pattern and process pattern to the display device 220 for displaying, or output to a log file, for example. In the illustrated example, a result is shown where control pattern candidate BB and process pattern candidate CCC are selected as the desirable control pattern and process pattern satisfying the target value.
EXAMPLES
[0151] Through comparison with Comparative Example, the inventor has examined the prediction accuracies for amount of dross generation in the first to fourth example implementations. In Comparative Example, mean values were calculated from chronological process data defining process state parameters, and these were used for the input data as representative values. In Comparative Example, amount of dross generation was predicted through multiple regression, and the prediction accuracy was calculated.
[0152]
[0153] In Comparative Example, the coefficient of determination R.sup.2 was 0.00. The mean absolute error (MAE) was 0.85, and the mean square error (MSE) was 1.14. In the first to fourth example implementations, the coefficient of determination R.sup.2 was 0.17, 0.15, 0.18, and 0.24, respectively. In the first to fourth example implementations, the mean absolute error (MAE) was 0.71, 0.78, 0.76, and 0.66, respectively, while the mean square error (MSE) was 0.85, 1.00, 0.96, and 0.73, respectively. The coefficients of determination R.sup.2 in the first to fourth example implementations were all above the coefficient of determination R.sup.2 in Comparative Example. The MAEs in the first to fourth example implementations were all below the MAE in Comparative Example, and also the MSEs in the first to fourth example implementations were all below the MSE in Comparative Example. Among the first to fourth example implementations, the fourth example implementation in particular is considered to be one of the best models for accurately predicting an amount of dross generation. By adding features of exhaust gas component values that were extracted by classical methods, it becomes possible to perform an analysis of exhaust gas based on component values.
[0154] According to the present embodiment, a prediction model that is generated by integrating an encoding process such as CAE or VAE, clustering such as GMM or k-means, and a supervised prediction model such as a neural network is used to enable prediction of amount of dross generation or melting yield with a high accuracy. Moreover, there is provided an operation support system for a melting furnace which, under a desired furnace operating schedule and amounts of material inputs, allows for recommending a control pattern and a process pattern that maximizes amount of dross generation or melting yield by using a trained model.
INDUSTRIAL APPLICABILITY
[0155] The technique according to the present disclosure may be widely used in support systems which, in addition to generating a prediction model to predict the amount of dross generation in a melting furnace used for the manufacture of an alloy material, selects operating conditions for the melting furnace by using a trained model.
REFERENCE SIGNS LIST
[0156] 100, 340: storage device (database); 200: data processing device; 201: body of data processing device; 210: input device; 220: display device; 230, 330: communication I/F; 240: storage device; 250, 310: processor; 260: ROM; 270: RAM; 280: bus; 300: cloud server; 320: memory; 350: Internet; 400: local area network; 700: melting furnace; 701: high-speed burner; 702: flame; 703: material; 704: flue; 705A, 705B, 705C: flowrate sensor; 706: pressure sensor; 707: temperature sensor; 708: gas sensor; 1000: operation support system