NEURAL NETWORK ADAPTIVE TRACKING CONTROL METHOD FOR JOINT ROBOTS

Abstract

The present disclosure discloses a neural network adaptive tracking control method for joint robots, which proposes two schemes: robust adaptive control and neural adaptive control, comprising the following steps: 1) establishing a joint robot system model; 2) establishing a state space expression and an error definition when taking into consideration both the drive failure and actuator saturation of the joint robot system; 3) designing a PID controller and updating algorithms of the joint robot system; and 4) using the designed PID controller and updating algorithms to realize the control of the trajectory motion of the joint robot. The present disclosure may solve the following technical problems at the same time: the drive saturation and coupling effect in the joint system, processing parameter uncertainty and non-parametric uncertainty, execution failure handling during the system operation, compensation for non-vanishing interference, and the like.

Claims

1. A neural network adaptive tracking control method for joint robots, the neural network adaptive tracking control method comprising: 1) establishing a joint robot system model:
D.sub.q(q){umlaut over (q)}+C.sub.q(q,{dot over (q)}){dot over (q)}+G.sub.q(q)+τ({dot over (q)},t)=u.sub.a in the joint robot system model mentioned above, q represents a position vector of the joint robot, {dot over (q)} represents a velocity vector of the joint robot, {umlaut over (q)} represents an acceleration vector of the joint robot action, u.sub.a represents a control input of the joint robot system, the system parameter D.sub.q(q) represents an inertia matrix of the joint robot system, the system parameter C.sub.q(q,{dot over (q)}) represents a centrifugal matrix of the joint robot system, the system parameter G.sub.q(q) represents a universal gravitation matrix of the joint robot system, and the system parameter τ({dot over (q)},t) represents uncertainty and interference factors of the joint robot system; 2) establishing a state space expression and an error definition when taking into consideration both the drive failure and actuator saturation of the joint robot system:
u.sub.a(t)=ρ(t)[Γ(0)+L(ξ)ν+ε(ν)]+ε(t)=ρ(t)L(ξ)ν+[ρ(t)Γ(0)+ρ(t)ε(ν)+ε(t)]
e=x.sub.1−q*
ë={umlaut over (x)}.sub.1−{umlaut over (q)}*=D.sub.q.sup.−1(q)ρ(t)L(ξ)ν+D.sub.q.sup.−1(q)[ρ(t)Γ(0)+ρ(t)ε(ν)+ε(t)]+F(⋅)+Q(x.sub.1,t)−{umlaut over (q)}* In the above formulas, u.sub.a(t) represents a system control input signal considering both drive failure and actuator saturation, Γ(0)+L(ξ)ν+ε(ν) represents a control signal in the case of actuator saturation, wherein ν represents an actual controller design quantity of the system, Γ(0)+L(ξ)ν represents a smooth function proposed according to the mean value theorem of ν, Γ(0) is a bounded matrix, L(ξ) is a non-negative positive definite matrix, ε(ν) is a bounded approximate error and represents an uncertain factor of the controller; ρ(t) represents a health coefficient of the driver, ε(t) represents an interference factor of the driver; e((or e(⋅)) represents a dynamic error of the system (e(⋅) is written as e for simplification in subsequent derivation), ë represents the second derivative of the dynamic error, wherein x.sub.1=q represents a motion trajectory of the joint robot, {umlaut over (x)}.sub.1 represents an acceleration of the joint robot motion, q* represents a given joint tracking trajectory; {umlaut over (q)}* represents an acceleration of the given joint tracking F(⋅)=D.sub.q.sup.−1(q)(C.sub.q(q){dot over (q)}+G.sub.q(q)), and Q(x.sub.1,t)=D.sub.q.sup.−1(q)τ({dot over (q)},t). 3) designing a PID controller and updating algorithms of the joint robot system: the PID controller ν is expressed as $v = - (k_{D 0} + {Δ k}_{D} (t)) (2 γ e (.Math.) + γ^{2} \int_{0}^{t} e (.Math.) d τ + \frac{de (.Math.)}{dt})$ wherein γ is a parameter that the designer can design at will, and k.sub.D0 is a constant that is designed at the designer's option; wherein the updating algorithms consist of two algorithms as follows: (1) algorithm based on the robust adaptive control: the robust adaptive algorithm is designed for automatically updating the controller parameters at an updating rate of: ${Δ k}_{D} (t) = \hat{c} φ_{0}^{2} (.Math.)$ ${\begin{matrix} \dot{\hat{c}} = - σ_{0} \hat{c} + σ_{1} φ_{0}^{2} (.Math.) {.Math. E .Math.}^{2} \\ \hat{c} (0) \geq 0 \end{matrix}$ wherein, σ.sub.0 and σ.sub.1 are positive constants that the designer needs to design; ${\begin{matrix} c = \max {a_{1}, \frac{1}{2} γ_{d}} \\ φ_{0} (.Math.) = φ_{1} (.Math.) + .Math. \dot{q} .Math. .Math. E .Math. \end{matrix},$ wherein ĉ is an estimated value of c; a.sub.1=max {γ.sub.da.sub.f, γ.sub.dγ.sup.2, 2γ.sub.dγ,γ.sub.dx.sub.2}, φ.sub.1(⋅)=φ.sub.f(⋅)+∥e∥+∥ė∥+1, wherein a.sub.fφ.sub.f(⋅) is a product of the constant a.sub.f and the scalar function φ.sub.f(⋅), representing the upper bound of the system uncertainty factor D.sub.q.sup.−1(q)[ρ(t)Γ(0)+ρ(t)ε(ν)+ε(t)]+F(⋅)+Q(x.sub.1,t)−{dot over (q)}*, x.sub.2 is the upper bound of an second derivative {umlaut over (q)}* of a given joint motion trajectory, γ.sub.d is the upper bound of an system parameter D.sub.q(q), and it is set that $E = 2 γ e (.Math.) + γ^{2} \int_{0}^{t} e (.Math.) d τ + \frac{de (.Math.)}{dt};$ (2) algorithm based on the neural adaptive control: the neural adaptive algorithm is designed for automatically updating the controller parameters at an updating rate of: ${\begin{matrix} \dot{\hat{b}} = - σ_{0} \hat{b} + σ_{1} φ^{2} (.Math.) {.Math. E .Math.}^{2} \\ \hat{b} (0) \geq 0 \end{matrix} {Δ k}_{D} (t) = \hat{b} φ^{2} (.Math.)$ wherein: θ.sub.0 and θ.sub.1 are positive constants that the designer needs to design; ψ(⋅)=∥S(⋅)∥+1, wherein S(⋅) is a primary function of a neural network, S(⋅) and a number of neurons are determined at the designer's will; b=max{∥W.sup.T∥,m}, wherein {circumflex over (b)} is an estimated value of b, W.sup.T is an ideal unknown weight, and m is the upper limit of an reconstruction error ∥η(⋅)∥ of the model; $E = 2 γ e (.Math.) + γ^{2} \int_{0}^{t} e (.Math.) d τ + \frac{de (.Math.)}{dt};$ 4) using the PID controller and the updating algorithms designed in step 3) for the joint robot system to control the trajectory motion of the joint robot.

Description

BRIEF DESCRIPTION OF THE DRAWINGS

[0022] FIG. 1 is a block diagram of the algorithm design control of the system. According to the present disclosure, the controller gain is adaptively adjusted by using the robust adaptive algorithm and the neural adaptive algorithm respectively, so that the joint motion trajectory of the controlled robot reaches an ideal trajectory.

[0023] FIG. 2 is a schematic diagram of actuator saturation, which includes an asymmetric and non-smooth saturation function and a smooth approximation function, and when ν reaches a certain value, the controller input u will reach a saturation status.

[0024] FIGS. 3 and 4 are respectively a simulation diagram of controller gain adjustment and a joint robot position tracking curve, which both adopt the robust adaptive control method in the embodiment to carry out the simulation control, wherein Δk.sub.P, Δk.sub.I, Δk.sub.D respectively represent changes of three time-varying gains of the PID controller, and e.sub.1, e.sub.2, and e.sub.3 respectively represent the trajectory errors of three joint motions of the robot.

[0025] FIGS. 5 and 6 are respectively a simulation diagram of controller gain adjustment and a joint robot position tracking curve, which both adopt the neural adaptive control method in the embodiment to carry out the simulation control, wherein Δk.sub.P, Δk.sub.I, Δk.sub.D respectively represent changes of three time-varying gains of the PID controller, and e.sub.1, e.sub.2, and e.sub.3 respectively represent the trajectory errors of three joint motions of the robot.

[0026] FIG. 7 is a diagram of the joint robot model.

DETAILED DESCRIPTION

[0027] The present disclosure will be further described with reference to figures and embodiments below to enable the implementation by those skilled in the art according to the text of the specification.

[0028] In this embodiment, the neural network adaptive tracking control method for joint robots, including the following steps:

[0029] 1) Establishing a joint robot system model:

D.sub.q(q){umlaut over (q)}+C.sub.q(q,{dot over (q)}){dot over (q)}+G.sub.q(q)+τ({dot over (q)},t)=u.sub.a

[0030] In the model mentioned above, q represents a position vector of the joint robot, {dot over (q)} represents a velocity vector of the joint robot, {umlaut over (q)} represents an acceleration vector of the joint robot action, u.sub.a represents a control input of the joint robot system, the system parameter D.sub.q(q) represents an inertia matrix of the joint robot system, the system parameter C.sub.q(q,{dot over (q)}) represents a centrifugal matrix of the joint robot system, the system parameter G.sub.q(q) represents a universal gravitation matrix of the joint robot system, and the system parameter τ({dot over (q)},t) represents uncertainty and interference factors of the joint robot system;

[0031] 2) Establishing a state space expression and an error definition when taking into consideration both the drive failure and actuator saturation of the joint robot system:

u.sub.a(t)=ρ(t)[Γ(0)+L(ξ)ν+ε(ν)]+ε(t)=ρ(t)L(ξ)ν+[ρ(t)Γ(0)+ρ(t)ε(ν)+ε(t)]

e=x.sub.1−q*

ë={umlaut over (x)}.sub.1−{umlaut over (q)}*=D.sub.q.sup.−1(q)ρ(t)L(ξ)ν+D.sub.q.sup.−1(q)[ρ(t)Γ(0)+ρ(t)ε(ν)+ε(t)]+F(⋅)+Q(x.sub.1,t)−{umlaut over (q)}*

[0032] In the above formulas, u.sub.a(t) represents a system control input signal considering both drive failure and actuator saturation, Γ(0)+L(ξ)ν+ε(ν), represents a control signal in the case of actuator saturation, wherein ν represents an actual controller design quantity of the system, Γ(0)+L(ξ)ν represents a smooth function proposed according to the mean value theorem of ν, Γ(0) is a bounded matrix, L(ξ) is a non-negative positive definite matrix, ε(ν) is a bounded approximate error and represents an uncertain factor of the controller; ρ(t) represents a health coefficient of the driver, ε(t) represents an interference factor of the driver; e((or e(⋅)) represents a dynamic error of the system (e(⋅) is written as e for simplification in subsequent derivation), ë represents the second derivative of the dynamic error, wherein x.sub.1=q represents a motion trajectory of the joint robot, {umlaut over (x)}.sub.1 represents an acceleration of the joint robot motion, q* represents a given joint tracking trajectory; {umlaut over (q)}* represents an acceleration of the given joint tracking F(⋅)=D.sub.q.sup.−1(q)(C.sub.q(q){dot over (q)}+G.sub.q(q)), and Q(x.sub.1,t)=D.sub.q.sup.−1(q)τ({dot over (q)},t). The nonlinearity and uncertainty factors in the system set may be determined by the upper bound of the product of a constant and a scalar real-valued function, so as to obtain a robust adaptive control scheme; or the system is reconstructed through a neural network based on a radial primary function so as to produce the neural network adaptive control scheme.

[0033] 3) Designing a PID controller and updating algorithms of the joint robot system:

[0034] The PID controller ν is expressed as

[00007] $v = - (k_{D 0} + Δ k_{D} (t)) (2 γ e (.Math.) + γ^{2} \int_{0}^{t} e (.Math.) d τ + \frac{de (.Math.)}{d t})$

Wherein γ is a parameter that the designer can design at will, and k.sub.D0 is a constant that is designed at the designer's option;

[0035] Wherein the updating algorithms consist of two algorithms as follows:

[0036] (1) Algorithm based on the robust adaptive control:

[0037] The robust adaptive algorithm is designed for automatically updating the controller parameters at an updating rate of:

[00008] $\begin{matrix} Δ k_{D} (t) = \hat{c} φ_{0}^{2} (.Math.) {\begin{matrix} \dot{\hat{c}} = - σ_{0} \hat{c} + σ_{1} φ_{0}^{2} (.Math.) {.Math. E .Math.}^{2} \\ \hat{c} (0) \geq 0 \end{matrix} \end{matrix}$

Wherein, σ.sub.0 and σ.sub.1 are positive constants that the designer needs to design;

[00009] ${\begin{matrix} c = \max {a_{1}, \frac{1}{2} γ_{d}} \\ φ_{0} (.Math.) = φ_{1} (.Math.) + .Math. \dot{q} .Math. .Math. E .Math. \end{matrix},$

wherein ĉ is an estimated value of c; a.sub.1=max {γ.sub.da.sub.f, γ.sub.dγ.sup.2, 2γ.sub.dγ,γ.sub.dx.sub.2}, φ.sub.1(⋅)=φ.sub.f(⋅)+∥e∥+∥ė∥+1, wherein a.sub.fφ.sub.f(⋅) is a product of the constant a.sub.f and the scalar function φ.sub.f(⋅), representing the upper bound of the system uncertainty factor D.sub.q.sup.−1(q)[ρ(t)Γ(0)+ρ(t)ε(ν)+ε(t)]+F(⋅)+Q(x.sub.1,t)−{dot over (q)}*, x.sub.2 is the upper bound of an second derivative {umlaut over (q)}* of a given joint motion trajectory, γ.sub.d is the upper bound of an system parameter D.sub.q(q), and it is set that

[00010] $E = 2 γ e (.Math.) + γ^{2} \int_{0}^{t} e (.Math.) d τ + \frac{de (.Math.)}{d t};$

[0038] (2) Algorithm based on the neural adaptive control:

[0039] The neural adaptive algorithm is designed for automatically updating the controller parameters at an updating rate of:

[00011] $\begin{matrix} {\begin{matrix} \dot{\hat{b}} = - θ_{0} \hat{b} + θ_{1} ψ^{2} (.Math.) {.Math. E .Math.}^{2} \\ \hat{b} (0) \geq 0 \end{matrix} Δ k_{D} (t) = \hat{b} ψ^{2} (.Math.) \end{matrix}$

Wherein: θ.sub.0 and θ.sub.1 are positive constants that the designer needs to design; ψ(⋅)=∥S(⋅)∥+1, wherein S(⋅) is a primary function of a neural network, S(⋅) and a number of neurons are determined at the designer's will; b=max{∥W.sup.T∥,m}, wherein {circumflex over (b)} is an estimated value of b, W.sup.T is an ideal unknown weight, and m is the upper limit of an reconstruction error ∥η(⋅)∥ of the model;

[00012] $E = 2 γ e (.Math.) + γ^{2} \int_{0}^{t} e (.Math.) d τ + \frac{de (.Math.)}{dt};$

[0040] 4) Using the PID controller and the updating algorithms designed in step 3) for the joint robot system to control the trajectory motion of the joint robot.

[0041] A detailed description will be provided below for the derivation processes of the PID controller and the updating algorithms designed in this embodiment.

[0042] A generalized error E is assumed to simplify the stability analysis of the PID controller, so we have

[00013] $E = 2 γ e (.Math.) + γ^{2} \int_{0}^{t} e (.Math.) d τ + \frac{de (.Math.)}{dt}$ $D_{q} (q) \dot{E} = ρ (t) L (ξ) v + [ρ (t) Γ (0) + ρ (t) .Math. (v) + r (t)] + D_{q} (q) F (.Math.) + D_{q} (q) Q (x_{1}, t) + D_{q} (q) (γ^{2} e + 2 γ \dot{e} - {\overset{.Math.}{q}}^{*}) = J (x_{1}, t) v + I (x_{1}, t)$

Wherein: J(x.SUB.1.,t)=ρ(t)L(ξ),

[0043]
I(x.sub.1,t)=[ρ(t)Γ(0)+ρ(t)ε(ν)+r(t)]+D.sub.q(q)F(⋅)+D.sub.q(q)Q(x.sub.1,t)+D.sub.q(q)(γ.sup.2e+2γė={umlaut over (q)}*)

[0044] To simplify the control design and stability analysis, the function is defined as follows:

Ψ(⋅)=I(x.sub.1,t)+½D.sub.qĖ

[0045] (1) Algorithm based on the robust adaptive control:

[0046] The nonlinearity and uncertainty factors in the system set may be determined by the upper bound of the product of a constant and a scalar real-valued function like:

I(x.sub.1,t)≤γ.sub.da.sub.fφ.sub.f(⋅)=γ.sub.dγ.sup.2e+2γ.sub.dγė−γ.sub.d{umlaut over (q)}*≤a.sub.1φ.sub.1(⋅)

Wherein, γ.sub.d is the upper bound of the system parameter D.sub.q, a.sub.fφ.sub.f(⋅) is the upper bound of the system uncertainty factor D.sub.q.sup.−1[ρ(t)Γ(0)+ρ(t)ε(ν)+ε(t)]+F(⋅)+Q(x.sub.1,t)−{dot over (q)}*, and

[00014] ${\begin{matrix} a_{1} = \max {γ_{d} a_{f}, γ_{d} γ^{2}, 2 γ_{d} γ, γ_{d} {\overline{x}}_{2}} \\ φ_{1} (.Math.) = φ_{f} (.Math.) + .Math. e .Math. + .Math. \dot{e} .Math. + 1 \end{matrix}$

[0047] So that Ψ(⋅)≤a.sub.1φ.sub.1(⋅)+½γ.sub.d∥{dot over (q)}∥∥E∥≤cφ.sub.0(⋅)

Wherein

[0048] [00015] ${\begin{matrix} c = \max {a_{1}, \frac{1}{2} γ_{d}} \\ φ_{0} (.Math.) = φ_{1} (.Math.) + .Math. \dot{q} .Math. .Math. E .Math. \end{matrix}$

[0049] Therefore, the robust adaptive algorithm is designed for automatically updating the controller parameters at an updating rate of:

[00016] ${Δ k}_{D} (t) = \hat{c} φ_{0}^{2} (.Math.)$ ${\begin{matrix} \dot{\hat{c}} = - σ_{0} \hat{c} + σ_{1} φ_{0}^{2} (.Math.) {.Math. E .Math.}^{2} \\ \hat{c} (0) \geq 0 \end{matrix}$

Wherein σ.sub.0 and σ.sub.1 are positive constants that the designer needs to design; and {tilde over (c)}=c−ĉ is selected as the error value of c.

[0050] Based on the design of the above controller and the selection of the update rate, by selecting the Lyapunov function

[00017] $V = \frac{1}{2} E^{T} D_{q} E + \frac{1}{2 σ_{1}} {\tilde{c}}^{2}$

to correspondingly verify and analyze the designed controller, it can be proved that under the effect of the designed controller, all signals in the joint robot system will eventually converge to a global scope, thus ensuring that the tracking error of the system is globally consistent and bounded.

[0051] (3) Algorithm based on the neural adaptive control:

[0052] The system is reconstructed against the uncertainty factor of the function defined above by using the way of a neural network adaptive approximation, wherein it is set that

Ψ(⋅)=W.sup.TS(⋅)=η(⋅)

[0053] Wherein the primary function S(⋅) and the number of neurons of the neural network are determined at the designer's will, so they satisfy

[00018] $.Math. Ψ (.Math.) .Math. \leq .Math. W^{T} .Math. .Math. S (.Math.) .Math. + .Math. η (.Math.) .Math. \leq .Math. W^{T} .Math. .Math. S (.Math.) .Math. + m \leq b_{ψ} (.Math.)$

Wherein

[0054]
ψ(⋅)=∥S(⋅)∥+1

b=max{∥W.sup.T∥,m}

[0055] ∥η(⋅)∥≤m,∥W.sup.T∥≤b, taking the time-varying nature of system parameters and the unknown weight of the system into consideration, we have chosen the estimated parameter b for design and system analysis, so that the design update rate is:

[00019] ${\begin{matrix} \dot{\hat{b}} = - σ_{0} \hat{b} + σ_{1} φ^{2} (.Math.) {.Math. E .Math.}^{2} \\ \hat{b} (0) \geq 0 \end{matrix} {Δ k}_{D} (t) = \hat{b} φ^{2} (.Math.)$

Wherein θ.sub.0 and θ.sub.1 are positive constants that the designer needs to design; and {tilde over (b)}=b−{circumflex over (b)} is selected as the error value of b

[0056] Based on the design of the above controller and the selection of the update rate, by selecting the Lyapunov function

[00020] $V = \frac{1}{2} E^{T} D_{q} E + \frac{1}{2 θ_{1}} \tilde{b^{2}}$

to correspondingly verify and analyze the designed controller, it can be proved that under the effect of the designed controller, all signals in the system will eventually converge to a global scope, thus ensuring that the tracking error of the system is bounded, globally consistent and bounded.

[0057] The neural network adaptive tracking control method for joint robots provided in this embodiment may ensure that the system perfectly tracks the ideal trajectory in the case of drive failure and drive saturation, and at the same time realize the bounded tracking error. Compared with traditional PID controllers, this controller is relatively simple in structure, which may handle the drive saturation and coupling effect in the joint system to a better extent, the parameter uncertainty and non-parametric uncertainty, and the execution failure during the system running. In addition, this controller may compensate the non-vanishing interference, thereby greatly reducing the complexity of control algorithms in the prior art.

[0058] Finally, it is noted that the above embodiments are only for the purpose of illustrating the technical scheme of the present disclosure without limiting it. Although a detailed specification is given for the present disclosure by reference to preferred embodiments, those of ordinary skills in the art should understand that the technical schemes of the present disclosure can be modified or equivalently replaced without departing from the purpose and scope of the technical schemes thereof, which should be included in the scope of claims of the present disclosure.

NEURAL NETWORK ADAPTIVE TRACKING CONTROL METHOD FOR JOINT ROBOTS

Inventors

Cpc classification

Classification Explorer

G05B2219/39271

PHYSICS

Classification Explorer

B25J9/161

PERFORMING OPERATIONS; TRANSPORTING

Classification Explorer

G05B6/02

PHYSICS

Classification Explorer

G05B2219/42042

PHYSICS

Classification Explorer

G05B13/027

PHYSICS

Classification Explorer

B25J9/163

PERFORMING OPERATIONS; TRANSPORTING

International classification

Classification Explorer

B25J9/16

PERFORMING OPERATIONS; TRANSPORTING

Classification Explorer

G05B13/02

PHYSICS

Classification Explorer

G05B6/02

PHYSICS

Abstract

Claims

Description