VALIDATING AND COMPUTING STABILITY LIMITS OF HUMAN-IN-THE-LOOP ADAPTIVE CONTROL SYSTEMS
20180148069 ยท 2018-05-31
Inventors
Cpc classification
B60W50/08
PERFORMING OPERATIONS; TRANSPORTING
B60W2400/00
PERFORMING OPERATIONS; TRANSPORTING
G05B13/042
PHYSICS
B60W2050/0022
PERFORMING OPERATIONS; TRANSPORTING
G05D1/0088
PHYSICS
B60W2050/0017
PERFORMING OPERATIONS; TRANSPORTING
International classification
Abstract
Systems and methods for implementing and/or validating a model reference adaptive control (MRAC) for human-in-the-loop control of a vehicle system. A first operator model is applied to a first feedback-loop-based MRAC scheme, wherein the first operator model is configured to adjust a control command provided as an input to the MRAC scheme based at least in part on an actual action of the vehicle system and a reference action for the vehicle system with a time-delay. A stability limit of a first operating parameter is determined for the MRAC scheme based on the application of the first operator model to the first feedback-loop-based MRAC scheme. The MRAC scheme is validated in response to determining that expected operating conditions of the first operating parameter are within the determined stability limit of the first operating parameter.
Claims
1. A method of implementing a model reference adaptive control (MRAC) for a vehicle system, the method comprising: defining a first feedback-loop-based MRAC scheme, wherein the first feedback-loop based MRAC scheme is configured to receive a control command, apply a reference model to determine a desired action for the vehicle system based on the control command, determine an actuator command based on the control command, transmit the actuator command to at last one actuator of the vehicle system, monitor a sensor to determine an actual action of the vehicle system in response to application of the actuator command by the at least one actuator, determine a system error based on a difference between the desired action determined by the reference model and the actual action, and adjust at least one adaptive parameter used to determine the actuator control command based on the determined system error; applying a first operator model to the first feedback-loop-based MRAC scheme, wherein the first operator model is configured to adjust the control command based at least in part on the actual action of the vehicle system and a reference action for the vehicle system with a time-delay; determining a stability limit of a first operating parameter of the first feedback-loop-based MRAC scheme based on the application of the first operator model to the first feedback-loop-based MRAC scheme; and validating the first feedback-loop-based MRAC scheme in response to determining that expected operating conditions of the first operating parameter are within the determined stability limit of the first operating parameter.
2. The method of claim 1, further comprising: receiving, by an electronic process, the control command from a user control; and controlling the vehicle system by an electronic processor configured to apply the first feedback-loop-based MRAC scheme to generate the actuator command in response to a control command received from a user control.
3. The method of claim 2, wherein receiving the control command from a user control includes receiving a control command from a steering wheel, wherein the control command is indicative of a rotational position of the steering wheel.
4. The method of claim 2, wherein determining the stability limit of the first operating parameter includes determining whether the first feedback-loop-based MRAC scheme will cause the system error to approach zero regardless of variations in the first operating parameter due to human operator-based manipulations of the user control.
5. The method of claim 1, wherein controlling the vehicle system by the electronic processor further includes: determining, by the electronic processor, the actuator command based on the control command received from the user control and a previous actuator command value to ensure that the first operating parameter remains within the determined stability limit of the first operating parameter.
6. The method of claim 1, further comprising: determining that the expected operating conditions of the first operating parameters are not within the determined stability limit of the first operating parameter and, in response, adjusting at least one parameter of the first feedback-loop-based MRAC scheme.
7. The method of claim 1, further comprising: determining that the expected operating conditions of the first operating parameters are not within the determined stability limit of the first operating parameter and, in response, defining a second feedback-loop-based MRAC scheme and applying the first operator model to the second feedback-loop-based MRAC scheme.
8. The method of claim 1, wherein the first operating parameter of the first feedback-loop-based MRAC scheme includes a time-delay indicative of a period of time between the occurrence of the actual action and a corresponding corrective action applied by an operator to a user control.
9. The method of claim 8, wherein determining the stability limit of the first operating parameter of the first feedback-loop-based MRAC scheme includes determining whether the feedback-loop-based MRAC scheme will ensure that operation of the vehicle system remains stable regardless of a value of the time-delay parameter.
10. The method of claim 7, wherein determining the stability limit of the first operating parameter of the first feedback-loop-based MRAC scheme includes determining range of time-delay values for which the first feedback-loop-based MRAC scheme will ensure that operation of the vehicle system remains stable, and wherein validating the first feedback-loop-based MRAC scheme includes determining that a range of expected time-delay values for the operator is within the determine range of time-delay values.
11. The method of claim 1, wherein the vehicle system includes an airplane control system and wherein the first feedback-loop-based MRAC scheme is configured to adjust the actuator to counteract an external force acting on the airplane and to maintain a desired path of travel.
12. The method of claim 11, wherein the external force acting on the airplane includes turbulence.
13. The method of claim 1, wherein the vehicle system includes an automobile system and wherein the first feedback-loop-based MRAC scheme is configured to regulate operation of at least one selected from a group consisting of an automobile steering system and an automobile braking system.
14. The method of claim 1, wherein applying the first operator model to the first feedback-loop-based MRAC scheme includes determining a mathematical model representative of the first operator model and apply the mathematical model of the first operator model to a mathematical model representative of the first feedback-loop-based MRAC scheme to determine an overall mathematical model representative of system operation under parallel control of both a human operator and the first feedback-loop-based MRAC scheme.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0014]
[0015]
[0016]
[0017]
[0018]
[0019]
[0020]
[0021]
[0022]
[0023]
[0024]
[0025]
[0026]
[0027]
DETAILED DESCRIPTION
[0028] Before any embodiments of the invention are explained in detail, it is to be understood that the invention is not limited in its application to the details of construction and the arrangement of components set forth in the following description or illustrated in the following drawings. The invention is capable of other embodiments and of being practiced or of being carried out in various ways.
[0029]
[0030] For example, the system illustrated in
[0031]
[0032] The control architecture illustrated in
[0033] However, before the controller 101 is able to adjust the actuators 107 in such a way that the actual performance is corrected to match the expected performance, the pilot of the airplane may also notice that the path of travel of the airplane is deviating from its intended straight path. In response, the pilot may adjust the position of the user control 111 in a way intended to offset/correct for the deviation in the path of travel. Accordingly, the controller 101 and the human operator (via the user control 111) both attempt to correct for the system error. However, the human-induced correction may inadvertently affect the ability of the controller 101 to correct the system error and, in some cases, the interference of the human-induced correction and the MRAC implemented by the controller 101 may, not only prevent the controller 101 from correcting the system error, but may also cause the steering of the airplane to become unstable.
[0034] To study the effect of human interactions with the MRAC control architecture, the system may be adjusted to apply an additional modeled feedback loop mechanism. For example, a human dynamics model 301, as discussed in further detail below, may be provided as a control model designed to represent an expected human response to detecting an actual performance that does not match the expected performance. In this way, the control architecture provided by the system of
[0035] Furthermore, in still other examples, the performance capabilities of the MRAC can be evaluated through modeling instead of through observation of actual system performance. For example, we start with the block diagram configuration given by
{dot over ()}(t)=A.sub.h(t)+B.sub.h(t),(0)=.sub.0(1)
c(t)=C.sub.h(t)+D.sub.h(t)(2)
where (t).sup.n.sup.
.sub.+ is the internal human time-delay, A.sub.h
.sup.n.sup.
.sup.n.sup.
.sup.n.sup.
.sup.n.sup.
.sup.n.sup.
(t)r(t)E.sub.h(t)(3)
where (t).sup.n.sup.
.sup.n.sup.
.sup.n is the state vector (further details below) and E.sub.h
.sup.n.sup.
[0036] Next, at the inner loop architecture, we consider the uncertain dynamical system given by
{dot over (x)}.sub.p(t)=A.sub.px.sub.p(t)+B.sub.pu(t)+B.sub.p.sub.p(x.sub.p(t)), x.sub.p(0)=x.sub.p.sub.
where x.sub.p(t).sup.n.sup.
.sup.m is the control input, .sub.p:
.sup.n.sup.
.sup.m is an uncertainty, A.sub.p
.sup.n.sup.
.sup.n.sup.
.sub.+.sup.mm D.sup.mm is an unknown control effectiveness matrix. Furthermore, we assume that the pair(A.sub.p, B.sub.p) is controllable and the uncertainty is parameterized as
.sub.p(x.sub.p)=W.sub.p.sup.T.sub.p(x.sub.p), x.sub.p.sup.n.sup.
where W.sub.p.sup.sm is an unknown weight matrix and .sub.p.sup.n.sup.
.sup.s is a known basis function of the form .sub.p(x.sub.p)=[.sub.p.sub.
.sub.p(x.sub.p)=W.sub.p.sup.T.sub.p.sup.nn(V.sub.p.sup.Tx.sub.p)+.sub.p.sup.nn(x.sub.p),x.sub.pD.sub.x.sub.
where W.sub.p.sup.sm and V.sub.p
.sup.n.sup.
.sup.s is a known basis composed of neural networks function approximators, .sub.p.sup.nn: D.sub.x.sub.
.sup.m is an unknown residual error, and D.sub.x.sub.
.sup.n.sup.
[0037] To address command following at the inner loop architecture, let x.sub.c(t).sup.n.sup.
{dot over (x)}.sub.c(t)=E.sub.px.sub.p(t)c(t), x.sub.c(0)=x.sub.c.sub.
where E.sub.p.sup.n.sup.
{dot over (x)}(t)=Ax(t)+Bu(t)+BW.sub.p.sup.T.sub.p(x.sub.p(t))+B.sub.rC(t), x(0)=x.sub.0(8)
where
and x(t)[x.sub.p.sup.T(t),x.sub.c.sup.T(t)].sup.T.sup.n is the augmented state vector, x.sub.0 [x.sub.p.sub.
.sup.n, and n=n.sub.pn.sub.c. In this inner loop architecture setting, it is practically reasonable to set E.sub.h=[E.sub.h.sub.
.sup.n.sup.
[0038] Finally, consider the feedback control law at the inner loop architecture given by
u(t)=u.sub.n(t)+u.sub.a(t)(12)
where u.sub.n(t).sup.m and u.sub.a(t)
.sup.m are the nominal and adaptive control laws, respectively. Furthermore, let the nominal control law be
u.sub.n(t)=Kx(t)(13)
with K.sup.mn, such that A.sub.r ABK is Hurwitz. For instance, such K exists if and only if (A,B) is a controllable pair. Using (12) and (13) in (8) next yields
{dot over (x)}(t)=A.sub.rx(t)+B.sub.rc(t)+B[u.sub.a(t)+W.sup.T(x(t))](14)
where W.sup.T[.sup.1W.sub.p.sup.T,(.sup.1I.sub.mm)K].sup.(s+n)m is an unknown aggregated weight matrix and .sup.T(x(t))[.sub.p.sup.T(x.sub.p(t)),x.sup.T(t)]
.sup.s+n is a known aggregated basis function. Considering (14), let the adaptive control law be
u.sub.a(t)=.sup.T(t)(x(t))(15)
where (t).sup.(s+n)m is the estimate of W satisfying the parameter adjustment mechanism
{dot over ()}(t)=y(x(t))e.sup.T(t)PB, (0)=.sub.0(16)
where .sub.+ is the learning rate, and system error reads,
e(t)x(t)x.sub.r(t)(17)
with x.sub.r(t).sup.n being the reference state vector satisfying the reference system
{dot over (x)}.sub.r(t)=A.sub.rx.sub.r(t)+B.sub.rc(t),x.sub.r(0)=x.sub.r.sub.
and P.sub.+.sup.nnS.sup.nn is a solution of the Lyapunov equation
0=A.sub.r.sup.TP+PA.sub.r+R(19)
with R.sub.+.sup.nnS.sup.nn. Since A.sub.r is Hurwitz, it follows that there exists a unique P
.sup.nnS.sup.nn satisfying (19) for a given R
.sub.+.sup.nnS.sup.nn. Although we consider a specific yet widely studied parameter adjustment mechanism given by (16), one can also consider other types of parameter adjustment mechanisms without changing the essence of this invention.
[0039] Based on the given problem formulation, the next section analyzes the stability of the coupled inner and outer loop architectures depicted in
[0040] Fundamental Stability Limit Calculation
[0041] To analyze the stability of the coupled inner and outer loop architectures introduced in the previous section, we first write the system error dynamics using (14), (15), and (18) as
(t)=A.sub.re(t)B{tilde over (W)}.sup.TT(t)(x(t)),e(0)=e.sub.0(20)
where
{tilde over (W)}(t)(t)W.sup.(s+n)m(21)
is the weight error and e.sub.0x.sub.0x.sub.r.sub.
{dot over ({tilde over (W)})}(t)=(x(t))e.sup.T(t)PB, {tilde over (W)}(0)={tilde over (W)}.sub.0(22)
where {tilde over (W)}.sub.0(0)W. The following lemma is now immediate.
[0042] Lemma 1.
[0043] Consider the uncertain dynamical system given by (4) subject to (5), the reference model given by (18), and the feedback control law given by (12), (13), (15), and (16). Then, the solution (e(t), {tilde over (W)}(t)) is Lyapunov stable for all (e.sub.0, {tilde over (W)}.sub.0).sup.n
.sup.(s+n)m and t
.sub.+.
[0044] Proof.
[0045] To show Lyapunov stability of the solution (e(t), {tilde over (W)}(t)) given by (20) and (22) for all (e.sub.0, {tilde over (W)}.sub.0).sup.n
.sup.(s+n)m and t
.sub.+, consider the Lyapunov function candidate
V(e,{tilde over (W)})=e.sup.TPe+.sup.1tr({tilde over (W)}.sup.1/2).sup.T({tilde over (W)}.sup.1/2)(23)
[0046] Note that V(0,0)=0,V(e,{tilde over (W)})>0 for all (e,{tilde over (W)})(0,0), and V(e,{tilde over (W)}) is radially unbounded. Differentiating (23) along the trajectories of (20) and (22) yields
{dot over (V)}(e(t),{tilde over (W)}(t))=e.sup.T(t)Re(t)0(24)
where the result is now immediate.
[0047] Since the solution (e(t),{tilde over (W)}(t)) is Lyapunov stable for all (e.sub.0,{tilde over (W)}.sub.0).sup.n
.sup.(s+n)m and t
.sub.+ from Lemma 1, this implies that e(t)L.sub. and {tilde over (W)}(t)L.sub.. At this stage in our analysis, it should be noted that one cannot use the Barbalat's lemma to conclude lim.sub.t.fwdarw. e(t)=0. To elucidate this point, one can write
{umlaut over (V)}(e(t),{tilde over (W)}(t))=2e.sup.T(t)R[A.sub.re(t)B{tilde over (W)}.sup.T(t)(e(t)+x.sub.r(t))](25)
where since x.sub.r(t) can be unbounded due to the coupling between the inner and outer loop architectures, one cannot conclude the boundedness of (25), which is necessary for utilizing the Barbalat's lemma in (24). Motivated from this standpoint, we next provide the conditions to ensure the boundedness of the reference model states x.sub.r(t), which also reveal the fundamental stability limit (FSL) for guaranteeing the closed-loop system stability. It is noted that two FSLs are provided below; namely, a delay-independent FSL and a delay-dependent FSL.
[0048] Delay-Independent FSL
[0049] A linear time invariant system subject to time delay can in some cases be stable regardless of how large the time delay is. We present the mathematical conditions under which the system at hand can be delay-independent stable. For this, start with using (2) in (18), and first write
[0050] Next, it follows from (1) that
{dot over ()}(t)=A.sub.h(t)B.sub.hE.sub.hx.sub.r(t)B.sub.hE.sub.he(t)+B.sub.rr(t)(27)
[0051] Finally, by letting (t)[x.sub.r.sup.T(t),.sup.T(t)].sup.T, and using (26) and (27), one can write
{dot over ()}(t)=A.sub.0(t)+A.sub.(t)+(.), (0)=.sub.0(28)
where
[0052] As a consequence of Lemma 1 and the boundedness of the reference r (t), one can conclude that (.)L.sub.. We now state the following lemma that is necessary for the main result of this invention.
[0053] LEMMA 2. Let P.sub.+.sup.(n+n.sup.
.sub.+.sup.(n+n.sup.
holds. Then, (t) of the dynamical system given by (28) is bounded for any .sub.+ and for all (t)
.sup.n+n.sup.
.sub.+.
[0054] PROOF. Consider the Lyapunov-Krasovskii functional candidate given by
V()=.sup.TP+.sub..sup.0.sup.T(t+)d(33)
and, since (.)L.sub., let *.sub.+ be such that (.).sub.2*. Differentiating (33) along the trajectory of (28) yields
{dot over (V)}((t)).sup.T(t)F(t)+2.sub.max(P)*|(t).sub.2(34)
where (t)[.sup.T(t),.sup.T(t)].sup.T. Since (32) holds, let k.sub.+ be such that k .sub.min(F). Now, it follows from (34) that
{dot over (V)}((t))k(t).sub.2((t).sub.22k.sup.1.sub.max(P)*)(35)
and hence, there exists a compact set R({(t).sup.2(n+n.sup.
.sub.+ and for all (0)
.sup.n+n.sup.
.sub.+.
[0055] Lemma 2 establishes the boundedness of not only the reference model states, the dynamics of which are given by (18), but also the internal human dynamics given by (1), and hence, x.sub.r(t)L.sub. and (t)L.sub..
[0056] Theorem 1.
[0057] Consider the uncertain dynamical system given by (4) subject to (5), the reference model given by (18), the feedback control law given by (12), (13), (15), and (16), and the human dynamics given by (1), (2), and (3). Then, e(t)L.sub. and {tilde over (W)}(t)L.sub.. If, in addition, there exist P.sub.+.sup.(n+n.sup.
.sub.+.sup.(n+n.sup.
[0058] Proof.
[0059] As a consequence of Lemma 1, recall that e(t)L.sub. and {tilde over (W)}(t)L.sub.. In addition, note that (.)L.sub. in (28). Next, if there exist P.sub.+.sup.(n+n.sup.
.sub.+.sup.(n+n.sup.
[0060] For the boundedness of all closed-loop system signals and lim.sub.t.fwdarw. e(t)=0, Theorem 1 requires the fundamental stability limit given by the LMI (32) to hold. Note that this fundamental stability limit can be equivalently written in an equality form as
0=A.sub.0.sup.TP+PA.sub.0P+A.sub.S.sup.1A.sub..sup.TP+S+Q(36)
where P.sub.+.sup.(n+n.sup.
.sub.+.sup.(n+n.sup.
.sub.+.sup.(n+n.sup.
[0061] Notice above that we have employed a time-domain technique based on a Lyapunov-Krasovskii functional to prove delay independent stability. A large body of literature was devoted to this effort where one main focus was to reduce the inherent conservatism imposed by the choice of candidate functionals. Another method would be to employ frequency domain tools where one instead studies the eigenvalues of the corresponding linear time invariant system with time delay. For example, consider the nominal part of (28); e.g., (.)=0, with .fwdarw.. In this case, the system will behave like an open loop system whose stability is determined by the eigenvalues of A.sub.0. For the system to be stable in this setting, A.sub.0 must be Hurwitz, which also makes it invertible. Next, we note that the characteristic function of the dynamical system
f:=det[sIA.sub.0A.sub.e.sup.s](37)
can be rearranged as
det[I(sIA.sub.0).sup.1A.sub.e.sup.s]*det[sIA.sub.0](38)
[0062] Note that for the class of time-delay systems being considered here, as a parameter of interest; e.g., delay, changes, the system can switch from a stable to unstable regime (or vice versa) if and only if the system has imaginary eigenvalues s=j. Investigation of whether or not such a switch could arise then requires studying the zeros of the system characteristic function (38) at s=j, where <0 without loss of generality. On the imaginary axis however only the first determinant can be zero since the second determinant is always non-zero owing to A.sub.0 being Hurwitz. Denoting with (.) the spectral radius and noticing that |e.sup.j|=1, we have the following theorem.
[0063] Theorem 2.
[0064] The dynamical system given by (28) with (.)=0 is asymptotically stable independent of delay if and only if
[0065] i) A.sub.0 is asymptotically stable;
[0066] ii) ((jIA.sub.0).sup.1A.sub.)<1, >0; and
[0067] iii) either a) (A.sub.0.sup.1A.sub.)<1 or b) (A.sub.0.sup.1A.sub.)=1 and det(A.sub.0+A.sub.)0.
[0068] Implementing the steps in the above theorem are straightforward. Condition i) can be checked by a standard eigenvalue computation, while condition ii) requires sweeping of the frequency >0. Here one generates the matrix (jIA.sub.0).sup.1A.sub. and for a given , computes the eigenvalues. If all these eigenvalues fall into the unit circle then condition ii) is satisfied for this . This process is repeated for all . Note that the inverse matrix operation here will guarantee that, for sufficiently large , condition ii) will always be satisfied as the spectral radius will keep shrinking. Checking of condition iii) is much simpler as it does not require parametric scanning but only computation of eigenvalues. Note that condition iii) is the special case of condition ii) computed at =0.
[0069] Corollary 1.
[0070] Let the human dynamics given by (1), (2), and (3) be a single-input single-output system (SISO) with gain k.sub.p. Then, for (28) with (.)=0 to be delay-independent stable, it is necessary that
holds.
[0071] Proof.
[0072] Start with (29) and (30) and rewrite the characteristic function (37) explicitly as
f:=det[sIA.sub.r+B.sub.r(C.sub.h(sIA.sub.h).sup.1B.sub.h+D.sub.h)E.sub.he.sup.s](40)
which simplifies to
f:=det[sIA.sub.r+B.sub.rE.sub.hG(s)e.sup.s](41)
where G(s) is the scalar transfer function corresponding to the SISO system given by (1) and (2). Note that the above expression is in the exact form as (37); hence, for (28) with (.)=0 to be delay-independent stable, it is necessary that condition i) of Theorem 2 holds, which in this case requires that A.sub.r must be Hurwitz. As per the construction in (13) this always holds. Then, invoking condition ii) in Theorem 2 at =0, and recalling that k.sub.p=G(0), we have
((A.sub.r).sup.1(B.sub.rE.sub.h)G(0))<1(42)
which gives (39), and hence, the proof is now complete.
[0073] It is worthy to note that the results in Corollary 1 can be further improved in many practical situations. For example, observe that the reference input to the human model and the human command are of dimension one in the SISO case. In addition, since generally the outer loop and inner loop command following objectives are the same, note that E.sub.h.sub.
[0074] Corollary 2.
[0075] Given E.sub.h.sub.
k.sub.p<1(43)
[0076] Proof.
[0077] Note that A.sub.r.sup.1B.sub.r and E.sub.h in (39) are column vectors. Therefore, we have (A.sub.r.sup.1B.sub.rE.sub.h)=|E.sub.hA.sub.r.sup.1B.sub.r|. Since in the scalar case, E.sub.hA.sub.r.sup.1B.sub.r=1, then (43) follows.
[0078] In the above corollary, we prove that the human gain must be less than one such that (28) with (.)=0 can have a chance to be delay-independent stable. The sufficiency can be numerically checked by studying condition ii) of Theorem 2 (see the next section). What is interesting in the above analysis is that human's aggressiveness as measured by k.sub.p can be a strong limiting factor that ruins delay-independent stability. In the case when MRAC deals with a highly aggressive human behavior with k.sub.p>1, it is impossible to avoid instability for some delay values . Moreover, since by the design of stable MRAC we have zero steady-state error in tracking, the necessary condition k.sub.p<1 is solely inherent to the human's gain and holds irrespective of the controller gain K. While in many cases it is reasonable to assume that the human model can be considered as SISO dynamics; e.g., when the human produces a single output to steer a manipulator, in the case when an auto-human model is utilized in multi-input multi-output (MIMO) form, the necessary condition (42) can be revised as follows
(A.sub.r.sup.1B.sub.r|G(0)|E.sub.h)<1(44)
where [G(0)] denotes the matrix transfer function of the MIMO auto-human model with s=0 in its all entries.
[0079] It is important to note that while guaranteeing delay-independent stability in a dynamical system is attractive as this makes the system completely immune to destabilizing effects of delays, in some cases by the nature of the problem, delay-independent stability cannot be possible as is the case above for k.sub.p>1. Moreover, a trade-off in delay-independent stable cases is system's performance, which may deteriorate for large delays although stability is preserved. In light of this, we now turn our attention to the case when delay-independent stability is not possible, or not desired, and hence, system stability is affected by the numerical value of the delay in the dynamical system.
[0080] Delay Dependent FSL
[0081] Delay-independent FSL given in the previous section guarantees the boundedness of all closed loop system signals and lim.sub.t.fwdarw. e(t)=0 for any .sub.+. Since the time delay in human dynamics can in general be known in practice for certain applications, at least within a certain range, it is possible to relax these conditions by utilizing the delay information in the stability analysis. Towards this goal, we first provide the following lemma.
[0082] Lemma 3.
[0083] Consider the following system dynamics given by
(t)=Fz(t)+Gz(t)+h(t,z(t)),z(0)=z.sub.0(45)
where z(t).sup.n is the state vector, F
.sup.nn and G
.sup.nn are constant matrices, is the time delay, and h(t, z(t)) is piecewise constant and bounded nonlinear forcing term, which is in general a function of state z. If the homogeneous dynamical system given by
(t)=Fz(t)+Gz(t)(46)
is asymptotically stable, then the states of the original inhomogeneous dynamical system given by (45) remains bounded for all times.
[0084] Proof.
[0085] Since h(t, z(t)) is piecewise continuous and bounded, this signal can be considered as an exogenous input to the system with the transfer function
G(s)=(sI(F+Ge.sup.s)).sup.1(47)
[0086] Under the assumption that the homogeneous system (46) is asymptotically stable, then we have that all of the infinitely many roots of the characteristic equation
det(sI(F+Ge.sup.s))=0(48)
of the system (47), have strictly negative real parts. Therefore, the output z(t) of the dynamical system remains bounded.
[0087] Having established Lemma 3, we are now ready to state the second main result of this invention, which provides a more relaxed delay-dependent stability condition for the overall human-in-the-loop system and convergence of the system error, e(t), to zero.
[0088] Theorem 3.
[0089] Consider the uncertain dynamical system given by (4) subject to (5), the reference model given by (18), the feedback control law given by (12), (13), (15), and (16), and the human dynamics given by (1), (2), and (3). Then, e(t)L.sub. and {tilde over (W)}(t)L.sub.. If, in addition, the real parts of all the infinitely many roots of the following characteristic equation
det(sI(A.sub.0+A.sub.e.sup.s))=0(49)
have strictly negative real parts, then x.sub.r(t)L.sub., (t)L.sub., and lim.sub.t.fwdarw. e(t)=0.
[0090] Proof.
[0091] As a consequence of Lemma 1, recall that e(t)L.sub. and {tilde over (W)}(t)L.sub.. In addition, note that (.)L.sub. in (28). Therefore, if all of the roots of the characteristic equation given by (49) have strictly negative real parts, making the homogeneous equation
{dot over ()}(t)=A.sub.0(t)+A.sub.(t)(50)
asymptotically stable, then, per Lemma 3, (t)[x.sub.r.sup.T(t),.sup.T(t)].sup.TL.sub.. Finally, since e(t)L.sub., x.sub.r(t)L.sub., and {tilde over (W)}(t)L.sub. ensure the boundedness of (25), it now follows from the Barbalat's lemma that lim.sub.t.fwdarw. e(t)=0.
[0092] Note that there are several methods in the literature for the analysis of the root locations of (49). The four most-used methods are TRACE-DDE, DDE-BIFTOOL, QPMR, and Lambert-W function. In essence, one provides the matrices A.sub.0 and A.sub. as well as the delay to these methods, which then return the numerical values of the rightmost root locations of (49). In some sense, these methods perform a nontrivial approximation with which they are able to identify the most relevant rootsthe rightmost roots. In the illustrative numerical example provided below, we employ TRACE-DDE readily available for download at https://users.dimi.uniud.it/dimitri.breda/research/software/.
Illustrative Example
[0093] Consider the longitudinal motion of a Boeing 747 airplane linearized at an altitude of 40 kft and a velocity of 774 ft/sec with the dynamics given by
{dot over (x)}(t)=A.sub.px(t)+B.sub.p(u(t)+W.sup.T(x(t))), x(0)=x.sub.0(51)
where x(t)=[x.sub.1(t),x.sub.2(t),x.sub.3(t),x.sub.4(t)].sup.T is the state vector. Note that (51) can be equivalently written as (4) with =I. Here, x.sub.1(t), x.sub.2(t), and x.sub.3(t) respectively represent the components of the velocity along the x, z and y axes of the aircraft with respect to the reference axes (in crad/sec), and x.sub.4(t) represents the pitch Euler angle of the aircraft body axis with respect to the reference axes (in crad). Recall that 0.01 radian=1 crad (centriradian). In addition, u(t) represents the elevator control input (in crad). Finally, W
.sup.3 is an unknown weighting matrix and (x(t))=[1, x.sub.1(t), x.sub.2 (t)].sup.T is a known basis function. In the following simulations, we set W=[0.1 0.3 0.3]. The dynamical system given in (51) is assumed to be controlled using a model reference adaptive controller. In addition, the aircraft is assumed to be operated by a pilot whose Neal-Schmidt Model is given by
Where k.sub.p is the positive scalar pilot gain, T.sub.p and T.sub.z are positive scalar time constants, and is the pilot reaction time delay. The values of the parameters used in the simulations are provided in Table 1.
[0094] To obtain the nominal controller K, a linear quadratic regulator (LQR) approach is utilized with the following objective function to be minimized
J(.)=.sub.0.sup.(x.sup.T(t)Qx(t)+u.sup.2(t))dt(53)
where Q is a positive-definite weighting matrix of appropriate dimension and is a positive weighting scalar. Notice that the framework developed above is not limited to a particular design method for the nominal controller. To this end, this task can be handled by a number of different ways. Here LQR is utilized for convenience reasons. In this setting, the selection of the weighing matrices, as expected, will affect the resulting nominal controller gain K in (13), which in turn will determine the reference model dynamics (18). In the following simulation studies, the effect of the weighting matrices, and thus the effects of reference model parameters on system stability are investigated for various values of pilot model parameters. To facilitate the analysis, reference model parameter variations is achieved mainly by manipulating the control penalty variable .
TABLE-US-00001 TABLE 1 T.sub.p 1 T.sub.z 5 0.5 A.sub.p [0.003 0.039 0 0.322; 0.065 0.319 7.740 0; 0.020 0.101 0.429 0; 0 0 1 0] B.sub.p [0.010; 0.1800; 1.160; 0] B.sub.p [0.0100 0.1800 1.1600 0].sup.T E.sub.p [0 0 0 1] E.sub.h [0 0 0 1 0] B.sub.r [0 0 0 0 1].sup.T Q diag([0 0 0 1 2.5])
[0095] Note that the purpose of the numerical examples provided in this section is to verify the theoretical stability predictions of the proposed framework. Therefore, the simulation results are created to present the stability/instability of the closed loop system without paying attention to enhanced transient response characteristics.
[0096] Delay-Independent Stability: LMI Approach:
[0097] We set k.sub.p= without loss of generality and investigate whether or not the closed loop is delay-independent stable. Specifically, we first use the LQR control designer in MATLAB with =1.0 to design K, which returns K=[0.0185, 0.0815, 1.5809, 2.7560, 1.5811]. Next the matrices A.sub.0 and A.sub. are constructed based on the information provided on Table 1. Assigning P and S as positive definite variables greater than 0.5I.sup.(n+n.sup.
.sup.(n+n.sup.
[0098] Delay-Independent Stability: Frequency-Domain Approach:
[0099] To be consistent with the previous subsection, we set k.sub.p= and =1.0 in the LQR optimization. Based on Corollary 2, since k.sub.p<1 and A.sub.r is Hurwitz, the necessary conditions for delay-independent stability are satisfied. Next, the sufficient conditions in Theorem 2 are to be checked simply by computing the metric in condition ii)-iii) of the theorem with respect to 0. We find out that the metric value starts at k.sub.p= when =0 (condition iii)) and decreases for larger 0 (condition ii)), remaining always less than 1. That is, the closed loop system will remain stable for any choice of delay . Keeping =1.0 but letting k.sub.p=0.95 has only negligible effects on K, again with the system remaining delay independent stable under the conditions of Theorem 2. On the other hand, selecting k.sub.p=1.05 violates the theorem and the system loses its delay-independent stability characteristics.
[0100] Delay-Dependent Stability: Effect of Control Penalty on System Stability for Different Pilot Reaction Time Delays:
[0101] To investigate the effects of the reference model parameter variations on the stability of the closed loop system, the control weight is manipulated by assigning values in the range [0, 50]. Then, the rightmost pole (RMP) of the system, whose characteristic equation is given by (49), is plotted against these values. This procedure is repeated for various pilot reaction time delays and the results are presented in
[0102]
[0103] It is predicted in
[0104] Delay-Dependent Stability: Effect of Control Penalty on System Stability for Different Values of Pilot Model Poles:
[0105] The poles of the pilot model (52) represent how fast the pilot responds to changes in the aircraft pitch angle, which can also be interpreted as pilot aggressiveness. In this section, the effect of pilot aggressiveness on system stability is investigated while assigning values to the control penalty from 0 to 50.
[0106]
[0107]
[0108] Delay-Dependent Stability: Effect of Control Penalty on System Stability from Different Values of Pilot Model Zeros:
[0109] In this section, the effect of zeros of the pilot transfer function (52) on system stability is investigated when control penalty pi takes values in the range [0,50]. The pole location and the time delay of the pilot transfer function are kept at their nominal values of 0.2 and 0.5, respectively. Changes in the zero location of the model can be interpreted as an adjustment to the lead nature of the pilot, which is related to pilot's anticipation capabilities.
[0110] As seen in
[0111]
[0112] Delay-Dependent Stability: Effect of Control Penalty on System Stability for Different Values of Pilot Model Gains:
[0113] The pilot gain in kp in (52) determines the intensity of the response that the pilot gives to the pitch angle deviations in the aircraft. In some sense, this gain also represents the aggressiveness of the pilot.
[0114] Stability properties of the pilot-in-the-loop system depending on the nominal control penalty and the pilot gain k.sub.p is presented in
[0115] It is predicted in
[0116] To summarize, the presented invention analyzed human-in-the-loop model reference adaptive control architectures and explicitly derived fundamental stability limit for both delay-independent and delay-dependent stability cases. Specifically, this stability limit results from the coupling between outer and inner loop architectures, where the outer loop portion includes the human dynamics modeled as a linear dynamical system with time delay and the inner loop portion includes the uncertain dynamical system, the reference model, the parameter adjustment mechanism, and the controller. We showed that when the given set of human model and reference model parameters satisfy this stability limit, the closed-loop system trajectories are guaranteed to be stable. The theoretical stability predictions of the proposed approach were verified via several simulation studies presented above. While the main focus of this invention was to reveal and compute stability limit of human-in-the-loop model reference adaptive control architectures, the effect of the controller design parameters on the transient response is also another important research direction that will be taken into consideration as a future research direction.
[0117] The techniques described above can be applied and adapted in various ways. For example,
[0118] The method begins by applying the operator model to the MRAC (step 1301), for example, as described above in reference to
[0119] If the selected MRAC is confirmed to provide control-variable-independent stability for a selected control variable (e.g., time-delay-independent stability), then the MRAC is validated and the MRAC is used to control the vehicle system as illustrated in the example of
[0120] In some implementations, the method of
[0121] The techniques and framework described above can also be adapted to be govern the operation of a vehicle using the controller 101.
[0122] For example, in reference to
[0123] Thus, the invention provides, among other things, systems and methods for validating and ensuring the stability of a control architecture. Various features and advantages of the invention are set forth in the following claims.