Distributed control system
11307549 · 2022-04-19
Assignee
Inventors
- Satoshi Kikuchi (Susono, JP)
- Michihiro Kawanishi (Nagoya, JP)
- Tatsuo Narikiyo (Nagoya, JP)
- Ngoc Huynh Tran (Nagoya, JP)
Cpc classification
G05B2219/1204
PHYSICS
G05B19/045
PHYSICS
International classification
G05B19/045
PHYSICS
Abstract
A system includes a plurality of control devices that respectively control the states of a plurality of apparatuses and are connected to each other via communication lines. When each of the control devices determines a state target value of its own apparatus using the current state indicator value of the own apparatus, and the distributed controller input which is a function of the state indicator value of an apparatus adjacent to the own apparatus and the state indicator value of the own apparatus, the control gain which adjusts contribution of the distributed controller input to the state target value is determined based on a communication delay time between the control devices.
Claims
1. A distributed control system comprising: a plurality of controllers configured to respectively control states of a plurality of apparatuses; and a communication network composed of a plurality of communication lines that connect the plurality of controllers, wherein: a state indicator value indicating a selected state is measured in each of the apparatuses, and is transmitted from a controller of a corresponding one of the apparatuses via the communication lines to a controller of an apparatus adjacent to the corresponding apparatus, from among the plurality of apparatuses connected to the communication lines; the controller of each of the apparatuses is configured to control the state of an own apparatus from among the plurality of apparatuses by referring to the state indicator value of the own apparatus and the state indicator value of the adjacent apparatus, such that the state indicator value of the own apparatus matches a state target value that is determined according to a control protocol of a multi-agent system, the own apparatus controlling the selected state by oneself; each of the plurality of controllers determines, according to the control protocol, the state target value of the own apparatus, using a current state indicator value of the own apparatus, and a distributed controller input that is a function of the state indicator value of the adjacent apparatus and the state indicator value of the own apparatus; each of the plurality of controllers is configured to determine a control gain which adjusts contribution of the distributed controller input to the state target value based on at least one of a communication delay time when the state indicator value of the adjacent apparatus is transmitted from the controller of the adjacent apparatus to each of the plurality of controllers and the communication delay time when the state indicator value of the own apparatus is transmitted from each of the plurality of controllers to the controller of the adjacent apparatus; each of the plurality of controllers is configured to set, when the distributed controller input is a sum of functions of the state indicator values of a plurality of apparatuses adjacent to the own apparatus and the state indicator value of the own apparatus, the control gain for each of functions corresponding to the controllers of the adjacent apparatuses connected to each of the plurality of controllers; and the control gain set for each of the functions corresponding to the controllers of the adjacent apparatuses connected to each of the plurality of controllers is indicated as G.sub.ij, G.sub.ij is given as G.sub.ij=Γ.sup.max(Δij,Δji), where an integer Γ smaller than 1, the communication delay time Δ.sub.ij when the state indicator value of the adjacent apparatus is transmitted from the controller of the adjacent apparatus to each of the plurality of controllers, and the communication delay time Δ.sub.ji when the state indicator value of the own apparatus is transmitted from each of the plurality of controllers to the controller of the adjacent apparatus.
2. The system according to claim 1, wherein each of the plurality of controllers is configured to determine the control gain based on a longer time from among the communication delay time when the state indicator value of the adjacent apparatus is transmitted from the controller of the adjacent apparatus to each of the plurality of controllers and the communication delay time when the state indicator value of the own apparatus is transmitted from each of the plurality of controllers to the controller of the adjacent apparatus.
3. The system according to claim 1, wherein each of the plurality of controllers is configured to reduce the control gain when the communication delay time is a longer delay time compared to when the communication delay time is a shorter delay time, wherein the longer delay time is longer than the shorter delay time.
4. The system according to claim 1, wherein: each of the plurality of controllers transmits the state indicator value of the own apparatus to the controller of the adjacent apparatus; the controller is configured to transmit a latest state indicator value of the own apparatus after the state indicator value of the own apparatus is transmitted and arrives at the controller of the adjacent apparatus; and the function used by each of the plurality of controllers is a function of the latest state indicator value of the adjacent apparatus received from the controller of the adjacent apparatus and the latest state indicator value of the own apparatus received by the controller of the adjacent apparatus.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) Features, advantages, and technical and industrial significance of exemplary embodiments of the disclosure will be described below with reference to the accompanying drawings, in which like signs denote like elements, and wherein:
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
DETAILED DESCRIPTION OF EMBODIMENTS
(17) Embodiments of the present disclosure will be described in detail with reference to the drawings hereinafter. In the drawings, the same reference numerals indicate the same parts. In addition, in the following embodiments, a case in which the state indicator value of each agent of a distributed control system is controlled by the average consensus control of a multi-agent system will be described as an example. However, configurations of “an intermittent transmission correction” and “a control gain correction” to be described below are also applied to other control forms, such as consensus control other than average consensus control, covering control, and distributed optimization control, such that convergence of the state indicator value of each agent can be further improved. It should be understood that such cases also fall within the scope of the present disclosure.
(18) With reference to
(19) In the above system, the control device (i, j, and the like, are numerals of the agents) of each agent i may be typically composed of a primary controller that controls an object to be controlled, that is, a selected state and an output of an apparatus, and a secondary controller that determines a target value of the selected state of the apparatus, as schematically illustrated in
(20) In the distributed control system as illustrated in
(21) However, as described in the “Summary”, in arithmetic processing of the control protocol of an existing multi-agent system using a differential equation of the above equation (1), the communication delay time occurring in the communication network is not taken into consideration. In addition, when the differential equation of the above equation (1) is used as it is for control in an environment in which the communication delay time occurs in the communication network, phenomena such as the state indicator value of each agent failing to converge to the consensus value, and the like, an error occurring in the consensus value, and the like, and a fluctuation in the consensus value, and the like are observed. In other words, when the arithmetic processing of the control protocol of the existing multi-agent system is applied as it is to an actual distributed control system as described above, a situation in which stable control cannot be achieved may occur. Therefore, in the present embodiment, as to be described below in detail, the configuration of the secondary controller of the control device of each agent is improved such that the signal communication and the arithmetic processing are performed using a new control protocol that can stably converge the state indicator value of each agent to the consensus value, and the like even in an environment in which the communication delay time occurs in the communication network, especially even when the communication delays between agents are not symmetric.
(22) Arithmetic Processing of Consensus Control in Existing Multi-Agent System
(23) Before description of the control configuration according to the present embodiment, a phenomenon that occurs in the existing control configuration will be briefly described. With reference to
(24) However, as described above, since a finite time is required for the signal communication of the state indicator value in the actual communication network, as illustrated in
x.sub.i[k+1]=x.sub.i[k]+Σ.sub.j∈N.sub.
(25) Here, k−δk is the measurement time immediately before the point in time traced back by the amount of the communication delay time Δ from the current time k. δk is the number of sampling time intervals corresponding to the sum of the communication delay time Δ and the standby time Δ.sub.w after reception (see
(26) In addition, according to another aspect, in measurement of the state indicator value of each agent, the measurement time of the state indicator value of each agent is simultaneously recorded, and data of the measurement time is transmitted together with the state indicator value to the adjacent agent. In a case in which the state target value is calculated using an equation which is modified such that the measurement time of the state indicator value of the transmission-side agent matches that of the reception agent, in the distributed controller (the second term on the right side of the equation (1)), as represented by an arrow b in
x.sub.i[k+1]=x.sub.i[k]+Σ.sub.j∈N.sub.
when the communication delay time is equal to or less than the sampling time interval (not zero), the state indicator values of all the agents converge to the expected consensus value, but when the communication delay time slightly exceeds the sampling time interval, it is observed that the state indicator values of all the agents do not even show a tendency to converge (see Japanese Patent Application No. 2019-010040).
(27) Improvement of Arithmetic Processing of Consensus Control of Multi-Agent System
(28) (A) Intermittent Transmission Correction
(29) As described above, in an environment in which a finite delay time occurs in the signal communication between the agents in the distributed control system, the consensus control might not be able to be stably achieved depending on the condition of the delay of the signal communication in the control protocol using the widely known existing equation (2) (or equations (8) and (9)). Therefore, as described in the “Summary”, the inventors of the present disclosure have proposed in Japanese Patent Application No. 2019-010040, regarding processing in which each agent transmits the state indicator value to the adjacent agent, a configuration for improving the convergence of the state indicator value of each agent by changing the control protocol so as to intermittently transmit the state indicator values instead of transmitting all the state indicator values measured in each agent (intermittent transmission correction), as described below.
(30) With reference to
(31) As described above, when each agent, as a transmission-side agent, changes the type of transmission of its state indicator value, each agent, as a reception-side agent, uses the latest state indicator value that has arrived from the transmission source as the state indicator value of the adjacent agent in the distributed controller (corresponding to the second term on the right side of the equation (1)) for calculation of the state target value (see
x.sub.i[k+1]=x.sub.i[k]+Σ.sub.j∈N.sub.
(32) Here, k.sub.aj is the measurement time of the state indicator value transmitted from the transmission-side agent j, and is expressed as k.sub.aj=l.sub.aj−δk . . . (10a), δk=Δ.sub.s+Δ.sub.ij+Δ.sub.r . . . (10b), using a first measurement time l.sub.aj (<k [current time]) after the reception of the state indicator value by the reception-side agent i. Here, Δ.sub.s is the standby time from the measurement time k.sub.a immediately before the transmission time of the transmission-side agent to the transmission time, and Δ.sub.r is the standby time from when the state indicator value of the transmission-side agent arrives at the reception-side agent to the calculation time. In addition, Δ.sub.ij is the communication delay time, that is, the time required for signal transmission from the transmission-side agent j to the reception-side agent i. In a configuration in which the reception notification is transmitted from the reception-side agent to the transmission-side agent, Δ.sub.ij may include the time until the reception notification arrives. According to this protocol, once receiving the state indicator value of the transmission-side agent (adjacent agent), the reception-side agent continuously uses the received state indicator value in the distributed controller until receiving the next state indicator value of the transmission-side agent. Moreover, the state indicator value of the adjacent agent used in the distributed controller of each agent may be updated for each adjacent agent. For example, measurement times k.sub.aj of the state indicator values of agents 2, 5, and 7 used in the distributed controller of an agent 6 in
(33) Thus, in a case in which the state target value is calculated by applying the intermittent transmission correction using the above equation (10), as illustrated in the result of the calculation simulation in the upper part of
(34) (B) Reference Correction
(35) As described above, when the state indicator value is controlled using the above equations (1), (2), or (8) to (10) in an environment in which a finite delay time in signal communication between the agents in the distributed control system occurs, the average value of the state indicator values of all agents which are connected to form the undirected graph, is not maintained. As a result, a phenomenon occurs in which even when the state indicator values converge to the consensus value by applying, for example, the intermittent transmission correction, the consensus value deviates from the expected average value of the initial values of the state indicator values of all agents. On the other hand, when the communication delay time in signal transmission between the agents in the system does not occur, for any two agents i, j, a difference between terms on the two agents in the respective distributed controllers u.sub.i, u.sub.j are (x.sub.j[k]−x.sub.i[k]), (x.sub.i[k]−x.sub.j[k]), respectively. In other words, the state indicator values referred to in the terms are the same x.sub.i[k], x.sub.j[k]. In short, since the state indicator value referred to in the distributed controller of each agent in the system is common to that of the adjacent agent, the average value of the state indicator values of all agents is maintained. However, in the existing control protocol of the state indicator value of the agent, when the communication delay time occurs in the signal transmission between the agents in the system, the state indicator value of the own apparatus referred to in the distributed controller of each agent may be different from that transmitted to the adjacent agent. Accordingly, the state indicator value of the own apparatus does not necessarily match the state indicator value of the own apparatus referred to as the state indicator value of an adjacent agent in the distributed controller of the adjacent agent, and as a result, the average value of the state indicator values of all the agents is not maintained. Therefore, according to the present embodiment, the control protocol is modified such that the state indicator value of the own apparatus referred to in the distributed controller of each agent is the same as that transmitted to the adjacent agent. As such, the average value of the state indicator values of all agents can be maintained. Hereinafter, this modification of the control protocol is referred to as a “reference correction”.
(36) Theoretically, the distributed controller u.sub.i in the equation (2) is modified as follows:
u.sub.i[k]=Σ.sub.j∈N.sub.
(37) Here, Δ.sub.ij [k] is the communication delay time required for signal transmission from the agent j, which is adjacent to the agent i, to the agent i, and x.sub.j[k−Δ.sub.ij[k]] is the state indicator value of the agent j at a time traced back by the amount of Δ.sub.ij[k] from the current time k, which is received from the agent i, and Δ.sub.ji[k] is the communication delay time required for signal transmission from the agent i to the agent j, and x.sub.i[k−Δ.sub.ji[k]] is the state indicator value of the agent i at a time traced back by the amount of Δ.sub.ji from the current time k, which is received by the agent j. Alternatively, Δ.sub.ij[k] and Δ.sub.ji[k] do not have to be constant but may change every moment.
(38) According to the above equation (11), it is proved that the average value of the state indicator values of all the agents which are connected to form the undirected graph is maintained as below. First, the equation (11) is expressed as follows by performing z-transform:
(39)
(40) Here, U.sub.i[z], Q.sub.i[z], and Q.sub.j[z] are z-transforms of u.sub.i[k], x.sub.i[k], and x.sub.j[k], respectively. Therefore, the distributed controller U of all agents is expressed as U[z]=−L.sup.d.sub.a[k]Q[z] . . . (13), using Graph Laplacian L.sup.d.sub.a[k] and the vector Q[z] having z-transforms of the state indicator values of all agents as a component. Here, the Graph Laplacian L.sup.d.sub.a[k] is as follows:
(41)
(42) Then, when the Graph Laplacian L.sup.d.sub.a[k] is multiplied by a row vector 1.sup.T.sub.n in which all components are one, from the left, 1.sup.T.sub.nL.sup.d.sub.a[k]=0.sup.T . . . (15) is obtained (0.sup.T is a row vector in which all components are zero). As such, the change in the sum of the state indicator values of all agents can be zero, and the average value of the state indicator values of all agents is maintained.
(43) In the above modification of the control protocol, when the above-described intermittent transmission correction is applied to the state indicator value of each agent, the distributed controller u.sub.i (of a time area) is expressed as follows:
(44)
(45) Here, k.sub.aj is the measurement time of the latest state indicator value transmitted from the adjacent agent j and received by the agent i, and k.sub.bi is the measurement time of the latest state indicator value transmitted from the adjacent agent i and received by the agent j. In addition, when the intermittent transmission correction is applied in the reference correction in which the value referred to as the state indicator value of own apparatus in the distributed controller of each agent is set to that transmitted to the adjacent agent, it should be understood that the state indicator value x.sub.i of the own apparatus referred to in the distributed controller u.sub.i of the agent i is also a value intermittently transmitted to and received by the adjacent agent from among the state indicator values measured in time series. Moreover, in this regard, each agent cannot notice whether the state indicator value transmitted to the adjacent agent has arrived at the adjacent agent just by transmitting the state indicator value. Therefore, according to the present embodiment, each agent may use the transmitted state indicator value in the distributed controller after receiving, from the adjacent agent of the transmission destination, the notification that the transmitted state indicator value has arrived at the adjacent agent. In other words, the state indicator value of the own apparatus referred to by each agent in the distributed controller may be the state indicator value of the own apparatus, which is confirmed to have been received by the adjacent agent. In addition, for this purpose, each agent is appropriately configured to, upon receiving the state indicator value from the adjacent agent, notify the adjacent agent of the transmission source of that fact.
(46) In the above reference correction, the respective measurement times of the state indicator value of the own apparatus and the state indicator value of the adjacent apparatus referred to in the distributed controller of each agent do not have to match each other. Therefore, it should be understood that an advantageous effect in which the average value of the state indicator values of all agents in the system is maintained by the reference correction can be achieved even when the time delay of the signal communication between any two agents is not symmetric.
(47) (C) Control Gain Correction
(48) As described above, in an environment in which a finite delay time occurs in the signal communication between the agents in the distributed control system, it is possible to improve the convergence of the state indicator value of each agent to a certain degree, using the intermittent transmission correction. Moreover, as disclosed in Japanese Patent Application No. 2019-010040, when the communication delay time between the agents becomes longer, oscillations occur in the state indicator value due to oscillations of the calculated value by the distributed controller, and it is difficult for the state indicator values to converge. Therefore, it has been found that the convergence of the state indicator value can be further improved by multiplying the distributed controller by a gain γ.sub.i (0<γ<1) as the following equation (17), so as to reduce the contribution of the distributed controller to the target value of the state indicator value:
x.sub.i[k+1]=x.sub.i[k]+T.sub.s.Math.γ.sub.i.Math.u.sub.i[k] (17)
(49) In this regard, as described above, since the oscillations of the calculated value by the distributed controller depends on the length of the communication delay time, in a general system in which the communication delay time may randomly fluctuate between the agents, it is considered that the oscillations of components (for example, for agents i and j, (x.sub.j[k.sub.aj]−x.sub.i[k.sub.bi]) and (x.sub.i[k.sub.bi]−x.sub.j[k.sub.aj])) of the distributed controller, which are associated with agents having a long communication delay time, become larger. Therefore, the convergence of the state indicator value can be further improved using the control gain which is determined based on the communication delay time so as to adjust the contribution of the components of the distributed controller associated with the agents to the state target value according to the length of the communication delay time between the agents.
(50) Specifically, the distributed controller u.sub.i may be modified as follows:
u.sub.i[k]=Σ.sub.j∈N.sub.
(51) Here, G.sub.ij is the control gain set for each difference corresponding to the control device of the adjacent apparatus connected to each control device, and it may be given by the following equation:
G.sub.ij=g(Δ.sub.ij,Δ.sub.ji) (19)
(52) Here, g(Δ.sub.ij, Δ.sub.ji) may be a function of a first communication delay time Δ.sub.ij in the transmission of the state indicator value from the agent j to the agent i and a second communication time Δ.sub.ji in the transmission of the state indicator value from the agent i to the agent j. The communication delay time Δ.sub.ij and the communication delay time Δ.sub.ji are generally time variables as described above. Moreover, as described above, generally, since the longer the communication delay time Δ.sub.ij or the communication delay time Δ.sub.ji is, the larger the oscillations of the components of the distributed controller becomes, function g may be a function of which the size is decreased as the communication delay time Δ.sub.ij, or the communication delay time Δ.sub.ji becomes longer, or a monotonically decreasing function. In addition, when the communication delay time between the agents is not symmetric, function g may be determined according to the longer one of the two-way communication delay times, in which case, function g may be a function of max(Δ.sub.ij, Δ.sub.ji). Further, as described in the above description of the reference correction, when there is a request to maintain the average value of the state indicator values of all agents in the system, the contribution of the distributed controller to the state indicator value of the agent i and the contribution of the distributed controller to the state indicator value of the agent j need to be equal to each other. Therefore, the control gain may be set as follows:
G.sub.ij=G.sub.ji (20)
(53) Assuming that the above requirement is satisfied, the control gain G.sub.ij may be given as, for example, G.sub.ij=Γ.sup.max(Δij,Δji) . . . (21), using an integer Γ smaller than 1, the first communication delay time Δ.sub.ij, and the second communication delay time Δ.sub.ji. Alternatively, the control gain G.sub.ij may be G.sub.ij=1/{c.Math.max(Δ.sub.ij, Δ.sub.ji)} . . . (22). Here, c is a positive coefficient.
(54) In the above configuration, the two-way communication delay time (Δ.sub.ij, Δ.sub.ji) between the agents may be acquired in each agent by any method. According to one embodiment, each agent records a measurement time tm at the time of measuring the state indicator value, and transmits the state indicator value to the transmission destination agent together with the measurement time tm. In addition, the reception time tr when the transmission destination agent receives the state indicator value is recorded, and the communication delay time to the transmission destination agent may be calculated by subtracting the measurement time tm from the reception time tr. Here, the calculated communication delay time may be used for determination of the control gain in the agent of the transmission destination. Then, the communication delay time may be transmitted from the agent which receives the state indicator value to each agent which transmits the state indicator value, together with the reception notification of the state indicator value, and may be used for determination of the control gain in each agent.
(55) (D) Communication Sequence
(56)
(57) With reference to
(58) For example, assuming that the state indicator value x.sub.j4, which are measured at k=4 in agent j and transmitted from the agent j, is received by the agent i after k=7 in the agent i, the state indicator value x.sub.j4 is referred to as the x.sub.j term of the distributed controller u.sub.i from that point of time, the communication delay time Δ.sub.ij4, which is a difference between the measurement time and the reception time of the state indicator value x.sub.j4, is calculated and used for determination of the control gain G.sub.ij. Then, the state indicator value x.sub.j4 and the communication delay time Δ.sub.ij4 are used until the next state indicator value arrives from the agent j. Then, the reception notification of the state indicator value x.sub.j4 together with the communication delay time Δ.sub.ij4 is transmitted from the agent i to the agent j, and when the agent j receives the notification, the state indicator value x.sub.j4 is referred to as the x.sub.j term of the distributed controller u.sub.j of the agent j from that point of time, and the communication delay time Δ.sub.ij4 is used for determination of the control gain G.sub.ji. Subsequently, the latest state indicator value x.sub.j9 in the agent j is transmitted to the agent i. In the agent j, the state indicator value x.sub.j4 and the communication delay time Δ.sub.ij4 are used in the calculation of the distributed controller until the reception notification of the state indicator value x.sub.j9 arrives.
(59) (E) Calculation Simulation
(60) In the system illustrated in
(61) First,
(62) On the other hand, in the example of
(63) From the results of the above calculation simulation, it is confirmed that the average value of the state indicator values of the agents is maintained by the control protocol to which the above reference correction is applied and the state indicator value of each agent can converge to the expected consensus value. In addition, it is confirmed that the convergence of the state indicator value of each agent can be accelerated by performing the correction for applying the control gain determined according to the communication delay time between the agents. Further, it should be understood that the above advantageous effect is achieved even when the delay in signal transmission between the agents is not symmetric.
(64) In addition, the above control gain correction is not limited to the illustrated average consensus control, but may be also applied when a communication delay time occurs in other control forms, for example, consensus control, covering control, and distributed optimization control. Further, it should be understood that the effect of compensating for a worsening of the convergence of the state indicator value of each agent caused by the communication delay time can be obtained.
(65) Although the above description has been made related to the embodiments of the present disclosure, many modifications and changes can be easily made by those skilled in the art. It will be clear that the present disclosure is not limited only to the above-exemplified embodiments, but can be applied to various devices without departing from the concept of the disclosure.