Optimization decision-making method of industrial process fusing domain knowledge and multi-source data
11409270 · 2022-08-09
Assignee
Inventors
Cpc classification
G05B19/41885
PHYSICS
G06N5/01
PHYSICS
International classification
Abstract
Disclosed is an optimization decision-making method of an industrial process fusing domain knowledge and multi-source data. The method comprises the steps of: acquiring the domain knowledge of the industrial process by using probability soft logic, and building an domain rule knowledge base of the industrial process; fusing multi-source data semantics and multi-source data features to form a new semantic knowledge representation of the industrial process, and constructing a semantic knowledge base of the industrial process; under a posteriori regularization framework, utilizing the domain rule knowledge base of the industrial process and the semantic knowledge base of the industrial process to obtain an optimization decision-making model embedded with the domain rule knowledge and obtain a posteriori distribution model; and migrating knowledge in the optimization decision-making model embedded with the domain rule knowledge into the posteriori distribution model through the knowledge distillation technology.
Claims
1. An optimization decision-making method of an industrial process fusing domain knowledge and multi-source data, comprising: S1: acquiring the domain knowledge of the industrial process by using probability soft logic, and building a domain rule knowledge base of the industrial process, wherein a specific expression form of the domain rule knowledge base of the industrial process is K.sub.R{(r.sub.i,λ.sub.i)}.sup.m.sub.i=1, wherein m represents a size of the domain rule knowledge base, r.sub.i represents an i-th first-order logic rule, and λ.sub.i is weight of a corresponding first-order logic rule; S2: fusing multi-source data semantics and multi-source data features to form a new semantic knowledge representation of the industrial process, and constructing a semantic knowledge base of the industrial process, wherein a step of acquiring the multi-source data semantics comprises: S21: clustering the multi-source data by an unsupervised clustering method to extract the multi-source data semantics; and wherein a step of acquiring the multi-source data features comprises: S22: preforming feature extraction on the multi-source data by using a convolution self-encoder; S3: under a posteriori regularization framework, utilizing the domain rule knowledge base of the industrial process and the semantic knowledge base of the industrial process to obtain an optimization decision-making model embedded with domain rule knowledge and obtain a posteriori distribution model, wherein specific steps of obtaining the optimization decision-making model embedded with the domain rule knowledge and obtaining the posteriori distribution model comprise: S31: defining a mapping relation ϕ.sub.i:{r.sub.il(X,Y)}.sub.l=1.sup.L.fwdarw.R by using the domain rule knowledge base K.sub.R={(r.sub.i,λ.sub.i)}.sub.i=1.sup.m of the industrial process to encode the domain knowledge of the industrial process; S32: under a posteriori regularization framework, introducing ϕ.sub.i:{r.sub.il(X,Y)}.sub.l=1.sup.L.fwdarw.R in form of constraints into a learning process of the model, converting an optimization decision-making problem thereof into a constraint-optimization problem,
2. The optimization decision-making method of the industrial process fusing the domain knowledge and the multi-source data of claim 1, wherein specific steps of acquiring the domain knowledge of the industrial process by using the probability soft logic comprises: S11: expressing properties, states, attributes and correlations there between different production factors in a manufacturing process by using an N-ary predicate so as to construct a first-order logic rule; S12: defining a weighted first-order logic rule to express a relationship between the optimization decision-making problem and a cause of the optimization decision-making problem; and S13: performing weight learning by using the probability soft logic to acquire the domain rule knowledge of the industrial process.
3. The optimization decision-making method of the industrial process fusing the domain knowledge and the multi-source data of claim 2, wherein a specific formula of the weighted first-order logic rule is ∀D.sub.1,D.sub.2, . . . ,D.sub.l,R.P.sub.1(D.sub.1, . . . )∧P.sub.2(D.sub.2, . . . )∧ . . . ∧P.sub.l(D.sub.l, . . . ).Math.P.sub.R(Q,R):λ wherein P.sub.1, P.sub.2, P.sub.l, P.sub.R are predicates; D.sub.1, D.sub.2, D.sub.l, R are variables; λ represents the weight which indicates an importance of the first-order logic rule, and larger the weight, more important the first-order logic rule is; a specific value of the weight is a non-negative real number; such rule indicates that states of industrial process targets D.sub.1, D.sub.2, . . . , D.sub.l in a certain condition cause a result of the optimization decision-making problem Q is R.
4. The optimization decision-making method of the industrial process fusing the domain knowledge and the multi-source data of claim 1, wherein a specific form of the new semantic knowledge representation of the industrial process is wherein a first half is a fusing feature h.sub.A and h.sub.A=(H.sub.1,H.sub.2, . . . ,H.sub.N): {A.sub.1,A.sub.2, . . . A.sub.N}; and R is decision-making semantics corresponding to the fusing feature h.sub.A, and R represents a specific decision-making result of the optimization decision-making problem; (H.sub.1,H.sub.2, . . . ,H.sub.N) is a feature of compact optimization for the multi-source data (X.sub.1,X.sub.2, . . . ,X.sub.N); and {A.sub.1,A.sub.2, . . . ,A.sub.N} is semantics of the multi-source data (X.sub.1,X.sub.2, . . . ,X.sub.N).
5. The optimization decision-making method of the industrial process fusing the domain knowledge and the multi-source data of claim 1, wherein a specific expression form of the semantic knowledge base of the industrial process is K.sub.S={k.sub.i}.sub.i=1.sup.n, wherein n represents a size of the semantic knowledge base, and k.sub.i represents an i-th knowledge element.
6. The optimization decision-making method of the industrial process fusing the domain knowledge and the multi-source data of claim 1, wherein specific steps of migrating knowledge in the optimization decision-making model embedded with the domain rule knowledge into the posteriori distribution model comprises: S41: defining the optimization decision-making model q(Y|X) embedded with the domain rule knowledge as a teacher network, and defining the posteriori distribution model p.sub.74 (Y|X) established by the multi-source data semantics as a student network; S42: training the teacher network and the student network simultaneously by using the domain rule knowledge base K.sub.R of the industrial process and the semantic knowledge base K.sub.S of the industrial process, wherein a parameter θ is updated as follows:
7. The optimization decision-making method of the industrial process fusing the domain knowledge and the multi-source data of claim 6, wherein a specific form of the loss function loss(⋅) is as follows: when solving a classification problem, the loss function of a cross-entropy
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) The accompanying drawings herein, which are incorporated in and constitute a part of this specification, illustrate embodiments according to the present disclosure and together with the specification serve to explain the inventive principles.
(2) For the purpose of more clearly illustrating the embodiments of the present disclosure or the technical solution in the prior art, a brief description of the accompanying drawings to be used in describing the embodiments or the prior art is given below. It is obvious for ordinary persons skilled in the art to obtain other accompanying drawings from these accompanying drawings without any inventive effort.
(3)
(4)
(5)
(6)
(7)
(8)
(9)
DETAILED DESCRIPTION OF THE EMBODIMENTS
(10) For the purpose of making the objects, technical solutions and advantages of the present disclosure clearer, the technical solutions in the embodiments of the present disclosure will be described clearly and completely below in conjunction with the accompanying drawings of embodiment of the invention. Obviously, the described embodiments are a part of the embodiments of the invention, and not all of the embodiments of the invention. Based on the embodiments of the invention, all other embodiments obtained by ordinary persons skilled in the art without inventive work fall within the protective scope of the invention.
(11) As shown in
(12)
(13) S1: acquiring the domain knowledge of the industrial process by using probability soft logic, and building a domain rule knowledge base of the industrial process;
(14) S2: fusing multi-source data semantics and multi-source data features to form a new semantic knowledge representation of the industrial process, and constructing a semantic knowledge base of the industrial process;
(15) S3: under a posteriori regularization framework, utilizing the domain rule knowledge base of the industrial process and the semantic knowledge base of the industrial process to obtain an optimization decision-making model embedded with domain rule knowledge and obtain a posteriori distribution model; and
(16) S4: migrating knowledge in the optimization decision-making model embedded with the domain rule knowledge into the posteriori distribution model through the knowledge distillation technology.
(17) Further, as shown in
(18) S11: expressing properties, states, attributes and correlations therebetween of different production factors in a manufacturing process by using an N-ary predicate so as to construct a first-order logic rule;
(19) S12: defining a weighted first-order logic rules to express the relationship between an optimization decision-making problem and the cause of the optimization decision-making problem; and
(20) S13: performing weight learning by using the probability soft logic to acquire the domain rule knowledge of the industrial process.
(21) Further, based on the above solution, a specific formula of the weighted first-order logic rule is
(22) ∀D.sub.1,D.sub.2, . . . ,D.sub.l,R.P.sub.1(D.sub.1, . . . )∧P.sub.2(D.sub.2, . . . )∧ . . . ∧P.sub.l(D.sub.l, . . . ).Math.P.sub.R(Q,R):λ
(23) where P.sub.1, P.sub.2, P.sub.l, P.sub.R are predicates; D.sub.1, D.sub.2, D.sub.l, R are variables; λ represents the weight. The weight indicates an importance of the first-order logic rule. That is, the larger the weight, the more important the first-order logic rule is. The specific value of the weight is a non-negative real number. Such rule indicates that the states of industrial process targets D.sub.1, D.sub.2, . . . , D.sub.l in a certain condition cause the result of the optimization decision-making problem Q is R.
(24) Further, based on the above solution, a specific expression form of the domain rule knowledge base of the industrial process is K.sub.R={(r.sub.i,λ.sub.i)}.sub.i=1.sup.m, where m represents a size of the domain rule knowledge base, r.sub.i represents an i-th first-order logic rule, and λ.sub.i is the weight of a corresponding first-order logic rule.
(25) Further, as shown in
(26) S21: clustering the multi-source data by an unsupervised clustering method to extract the multi-source data semantics; and
(27) the step of acquiring the multi-source data features includes:
(28) S22: preforming feature extraction on the multi-source data by using a convolution self-encoder.
(29) Further, based on the above solution, a specific form of the new semantic knowledge representation of the industrial process is k=h.sub.A:R ,
(30) where the first half is a fusing feature h.sub.A h.sub.A=(H.sub.1,H.sub.2, . . . ,H.sub.N):{A.sub.1,A.sub.2, . . . ,A.sub.N}; and R is decision-making semantics corresponding to the fusing feature h.sub.A, and R generally represents a specific decision-making result of the decision-making problem. Further, (H.sub.1,H.sub.2, . . . ,H.sub.N) is a feature of compact optimization for the multi-source data (X.sub.1,X.sub.2, . . . ,X.sub.N) ; and {A.sub.1,A.sub.2, . . . ,A.sub.N} is the semantics of the multi-source data (X.sub.1,X.sub.2, . . . ,X.sub.N).
(31) Further, based on the above solution, a specific expression form of the semantic knowledge base of the industrial process is K.sub.S={k.sub.i}.sub.i=1.sup.n, where n represents a size of the semantic knowledge base, and k.sub.i represents an i-th knowledge element.
(32) Generally, the domain knowledge of the industrial process reflects natural variation laws of the optimization decision-making process, while the knowledge implied by the multi-source data reflects dynamic variation laws of the manufacturing process. The internal mechanism and evolutionary characteristics of the optimization decision-making problem of the industrial process may be accurately revealed by effectively combining the domain knowledge and the innovative knowledge implied by the multi-source data.
(33) Further, as shown in
(34) S31: defining a mapping relation by ϕ.sub.i: {r.sub.il(X,Y)}.sub.l=1.sup.L.fwdarw. by using the domain rule knowledge base K.sub.R={(r.sub.i,λ.sub.i)}.sub.i=1.sup.m of the industrial process to encode the domain knowledge of the industrial process;
(35) S32: under the posteriori regularization framework, introducing ϕ.sub.i:{r.sub.il(X,Y)}.sub.l=1.sup.L.fwdarw. in the form of constraints into a learning process of the model, converting the optimization decision-making problem thereof into a constraint-optimization problem,
(36)
(37) where C, λ.sub.i and ξ.sub.i are constants; KL(q(Y|X)∥p.sub.θ(Y|X)) is a form of (X,Y) under the posteriori regularization framework; and KL(⋅) is a divergence calculation; and
(38) S33: obtaining an optimization decision-making knowledge inference model embedded with the domain rule knowledge of the industrial process by solving the constraint-optimization problem;
(39)
(40) where q(Y|X) represents the optimization decision-making model embedded with the domain rule knowledge, and p.sub.θ(Y|X) is the posteriori distribution model.
(41) Further, as shown in
(42) S41: defining the optimization decision-making model q(Y|X) embedded with the domain rule knowledge as a teacher network, and defining the posteriori distribution model p.sub.θ(Y|X) established by the multi-source data semantics as a student network;
(43) S42: training the teacher network and the student network simultaneously with the domain rule knowledge base K.sub.R of the industrial process and the semantic knowledge base K.sub.S of the industrial process, where a parameter θ is updated as follows:
(44)
(45) where loss(⋅) is a loss function; σ.sub.θ is a soft output of the student network p.sub.θ(Y|X); S.sub.i.sup.t is a soft output of the teacher network q(Y|X); and C.sub.R.sup.i is a real decision-making semantics; and
(46) performing iteration training of a teacher-student network by using the formula
(47)
and the formula
(48)
so as to migrate the knowledge from the teacher network to the student network.
(49) Further, based on the above solution, a specific form of the loss function loss(⋅) may be chosen as follows:
(50) when solving a classification problem, the loss function of a cross-entropy
(51)
may be chosen,
(52) where y is a real value, and ŷ is a prediction value; and
(53) when solving a regression problem, the loss function of square
(54)
may be chosen,
(55) where y is a real value, and ŷ is a prediction value.
(56) Embodiments
(57) The energy flow network of iron and steel industry is a complex and huge system. The energy flow network is in a dynamic state in the production process and is in coupling correlation with the material flow. The built model of the material flow and the energy flow of a whole process is required to be dynamically self-adaptive to meet actual needs. The operational big data of the whole process of the iron and steel contains the dynamic variation laws of the material flow and the energy flow, while the dynamic simulation of virtual data of the whole process may reflect the evolution characteristics of physical entities. The method of accurately modeling the material flow and the energy flow of the whole process of the iron and steel by fusing knowledge learning is provided herein, which mines the innovative knowledge implied by the operational big data of the whole process of the iron and steel, forms the domain rule knowledge base and the semantic knowledge base, builds the optimization decision-making knowledge inference model embedded with the domain rule knowledge based on the domain rule knowledge base and the semantic knowledge base, and intelligently corrects the built dual drive model of the material flow and the energy flow of the whole process of the iron and steel, so as to realize the dynamic self-adaptation of the model and improve the accuracy of the model.
(58)
(59) Hereinafter, the 3 parts mentioned above will be described in detail, respectively.
(60) (1) The dual drive model of the material flow and the energy flow of the whole process of the iron and steel
(61) The dual drive model of the material flow and the energy flow of the whole process of the iron and steel mainly performs fusing of mechanism model and data model of the material flow and the energy flow of the whole process of the iron and steel by means of a certain method so as to form complementary advantages of the mechanism model and the data model, thereby enhance the accuracy of modeling. Since the material flow and the energy flow have complex structures during the production process of the iron and steel and there are characteristics such as multi-dimensional correlation, multi-field cooperation, multi-phase coupling, and the like between the material flow and the energy flow, the accurately built fusing model is the foundation of achieving collaborative optimization of production operation of the iron and steel. On one hand, in the invention, a network modeling method of “node-connector” is utilized to construct equipment, units, processes, and the like involved by the material flow into “nodes” of different scales, and to construct pipings, hot-metal tanks, ladles, and the like involved by the energy flow into “connectors” so as to form a mechanism model with coupling network of the material flow and the energy flow. On the other hand, semantic features of multi-source heterogeneous space-and-time data of the production process are extracted and mapping mechanism of the semantic features of the material flow and the energy flow is explored to build the data model. Finally, the mechanism model and the data model of the material flow and the energy flow of the whole process of the iron and steel are fused to form the dual drive model .sub.mechanism-data-driven (X;θ), where θ is the model parameter.
(62) (2) The operation optimization method based on knowledge inference model parameters
(63) The manufacturing process environment of the iron and steel is complex, and the built model is continuously changing. With the method provided by the invention, the parameter θ of the dual drive model .sub.mechanism-data-driven (X;θ) of the material flow and the energy flow of the whole process of the iron and steel is intelligently corrected. Firstly, the operational big data of the whole process of the iron and steel and the simulation data of the dynamic simulation module of the material flow and the energy flow of the whole process of the iron and steel are mined for knowledge so as to form the domain rule knowledge base and the semantic knowledge base.
(64) The representation of the domain rule is as shown follows:
(65) ∀D.sub.1,D.sub.2, . . . ,D.sub.l,R.P.sub.1(D.sub.1, . . . )∧P.sub.2(D.sub.2, . . . )∧ . . . ∧P.sub.l(D.sub.l, . . . ).Math.P.sub.R(Q,R):λ,
(66) where P.sub.1, P.sub.2, P.sub.l, P.sub.R are predicates; D.sub.1, D.sub.2, D.sub.l, R are variables; λ represents the weight. The weight indicates an importance of the first-order logic rule. That is, the larger the weight, the more important the first-order logic rule is. The specific value of the weight is a non-negative real number. Such rule indicates that the states of industrial process targets D.sub.1, D.sub.2, . . . , D.sub.l in a certain condition cause the result of the optimization decision-making problem Q is R.
(67) The representation of the semantic knowledge is shown as follows:
k=(H.sub.1,H.sub.2, . . . ,H.sub.N):{A.sub.1,A.sub.2, . . . ,A.sub.N}:R,
(68) where the first half is the fusing feature h.sub.A h.sub.A=(H.sub.1,H.sub.2, . . . ,H.sub.N):{A.sub.1,A.sub.2, . . . ,A.sub.N}; and R is decision making semantics corresponding to the fusing feature h.sub.A, and R generally represents a specific decision-making result of a certain decision-making problem.
(69) The domain rule knowledge base KR and the semantic knowledge base KS are built from the above, and optimization decision-making knowledge inference model embedded with the domain rule knowledge is shown as follows:
(70)
(71) q.sup.*(R|X) represents the optimization decision-making mode embedded with the domain rule knowledge which is used as the teacher network; p.sub.θ(R|H,X) is the optimization decision-making model driven by the multi-source data semantic knowledge, which is used as the student network; and ϕ.sub.i: {r.sub.il(X,Y)}.sub.l=1.sup.L.fwdarw. is a mapping function of the constraints of the domain rule.
(72) The parameter of the dual drive model of the material flow and the energy flow of the whole process of the iron and steel is the optimization decision-making problem of the invention, and the nature of this problem is a regression problem. The student network p.sub.θ(R|H,X) chooses LSTM model. From this, the optimization decision-making knowledge inference model for the parameter of the dual drive model of the material flow and the energy flow of the whole process of the iron and steel is obtained so as to intelligently correct the model parameter in real time.
(73) (3) The dynamic simulation of the material flow and the energy flow of the whole process of the iron and steel
(74) The whole process, such as sintering, coking, iron-making, steel-making, hot-rolling and cold-rolling, of the iron and steel is modelled by utilizing the built dual drive model of the material flow and the energy flow of the whole process of the iron and steel to establish the dynamic simulation system and performs dynamic interactive iteration mapping with the actual production process, and a digital twin system of the material flow and the energy flow of the whole process of the iron and steel is formed. The dynamic simulation data generated by the digital twin system supports the operation optimization of the knowledge inference model parameters as a data source.
(75) In the invention, it should be noted that relational terms such as “first” and “second” are only used to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply any such actual correlation or sequence between these entities or operations. Further, terms “include”, “comprise” or any other variation thereof are intended to cover non-exclusive inclusion, such that a process, method, article, or device that includes a series of elements includes not only those elements, but also other elements that are not explicitly listed, or elements inherent for such process, method, article, or device. Without further limitations, the elements defined by the statement “including a...” do not exclude the existence of other same elements in the process, method, article or device including the elements.
(76) The above is only the specific embodiments of the invention, which enables persons skilled in the art to understand or realize the invention. Various modifications to these embodiments will be apparent to persons skilled in the art, and the general principles defined herein can be implemented in other embodiments without departing from the spirit or scope of the invention. Hence, the present disclosure will not be limited to these embodiments shown herein, but will conform to the widest scope consistent with the principles and novel features applied herein.