Train automatic stopping control with quantized throttle and braking
10093331 ยท 2018-10-09
Assignee
Inventors
Cpc classification
B61L27/04
PERFORMING OPERATIONS; TRANSPORTING
B61L25/025
PERFORMING OPERATIONS; TRANSPORTING
B61L27/16
PERFORMING OPERATIONS; TRANSPORTING
B61L25/021
PERFORMING OPERATIONS; TRANSPORTING
B61L15/0027
PERFORMING OPERATIONS; TRANSPORTING
B61L15/0058
PERFORMING OPERATIONS; TRANSPORTING
International classification
B60W50/06
PERFORMING OPERATIONS; TRANSPORTING
B61L3/00
PERFORMING OPERATIONS; TRANSPORTING
B61L27/00
PERFORMING OPERATIONS; TRANSPORTING
B61L15/00
PERFORMING OPERATIONS; TRANSPORTING
G05D1/00
PHYSICS
B61L25/02
PERFORMING OPERATIONS; TRANSPORTING
Abstract
Methods and systems for controlling a train movement to a stop at a stopping position between a first position and a second position. Determining constraints of a velocity of the train with respect to a train position forming a feasible region (FR) for a state of the train during the movement, such that a lower curve bounding the FR has a zero velocity only at the first position, and an upper curve bounding the FR has a zero velocity only at the second position. Determining a control invariant subset (CIS) of the FR, wherein for each state within the CIS there is at least one control action having a value selected from a finite set of values that maintains the state of the train within the CIS. Controlling train movement subject to constraints by selecting a control action maintaining the state of the train within the CIS of the FR.
Claims
1. A method for controlling a movement of a train to a stop at a stopping position between a first position and a second position, comprising: determining constraints of a velocity of the train with respect to a position of the train forming a feasible region for a state of the train during the movement, such that a lower curve bounding the feasible region has a zero velocity only at the first position, and an upper curve bounding the feasible region has a zero velocity only at the second position; determining a control invariant subset of the feasible region, wherein for each state within the control invariant subset there is at least one control action having a value selected from a finite set of values that maintains the state of the train within the control invariant subset; and controlling the movement of the train subject to the constraints by selecting a control action maintaining the state of the train within the control invariant subset of the feasible region, wherein the steps of the method are performed by a processor.
2. The method of claim 1, further comprising: determining iteratively the control invariant subset using a backward-reachable region computation starting from the feasible region, wherein each iteration comprises: shrinking the feasible region with a quantization error defined by the finite set of values and a quantization rule of the finite set of values to produce a shrunk feasible region; and determining the backward-reachable region, such that for each state within the backward-reachable region there is at least one control action moving the state of the train within the feasible region for all parameters from the set of possible parameters of the train; and replacing the feasible region with the backward-reachable region, wherein the iterations are performed until a termination condition is met.
3. The method of claim 2, wherein the constraints are linear inequalities and train dynamics are represented as a set of linear models subject to additive disturbances wherein, the backward-reachable region computation uses the train dynamics and includes: determining a worst case effect of an additive disturbance; and determining the backward-reachable region as an intersection for backward reachable regions of the linear models in the set.
4. The method of claim 3, wherein the additive disturbance includes the quantization error.
5. The method of claim 4, wherein the shrinking comprises: determining a set of planes approximating a boundary surface of the feasible region; determining a direction normal to each plane to produce a set of directions; determining a worst case quantization error for each direction; and moving a plane inward the feasible region into the direction normal to the plane by a distance equals the worst case quantization error determined for the direction.
6. The method of claim 5, wherein the worst case quantization error for each direction is determined using a linear program.
7. The method of claim 5, further comprising: determining the quantization rule reducing the worst case quantization error.
8. The method of claim 3, wherein the linear models and the additive disturbance are such that the state of the train, control inputs, and the train dynamics are within a convex combination of the linear models and values of additive disturbance for any values of parameters of the train.
9. The method of claim 1, wherein the constraints are linear inequalities, such that the train dynamics are represented as a set of linear models subject to additive disturbances, and wherein optimizing is obtained by a constrained quadratic programming.
10. A method for controlling a movement of a train to a stop, at a stopping position between a first position and a second position over a finite horizon of time, comprising: determining constraints of a velocity of the train with respect to a position of the train forming a feasible region for a state of the train during the movement, such that a lower curve bounding the feasible region has a zero velocity only at the first position, and an upper curve bounding the feasible region has a zero velocity only at the second position; determining a control invariant subset of the feasible region, wherein for each state within the control invariant subset there is at least one control action having a value selected from a finite set of values that maintains the state of the train within the control invariant subset; and controlling the movement of the train subject to the constraints by selecting a control action maintaining the state of the train within the control invariant subset of the feasible region over the finite horizon of time, wherein the steps of the method are performed by a processor.
11. The method of claim 10, wherein the selection of the control action is repeated for each time step within the finite horizon of time on the basis of an optimizing the state of the train within the feasible region, wherein the optimizing includes a cost function representing movement of the train subject to the constraints defined by that control invariant subset of the feasible region, as compared with the optimization within the feasible region itself, so the train stops at the second position.
12. The method of claim 11, wherein the cost function includes a combination of an energy consumption of the train during the finite horizon of time, the finite horizon of time, both the energy consumption and the finite horizon of time, the energy consumption for a predetermined time for the finite horizon of time, or a smoothness of a stopping trajectory stopping at the second position.
13. A system for controlling a movement of a train to a stop at a stopping position between a first position and a second position, comprising: a set of sensors that monitor and collect data relating to operation of the train; a memory having stored therein train data; at least one processor, coupled to the memory, and instructions stored therein, for execution by the at least one processor to: determine constraints of a velocity of the train with respect to a position of the train forming a feasible region for a state of the train during the movement, such that a lower curve bounding the feasible region has a zero velocity only at the first position, and an upper curve bounding the feasible region has a zero velocity only at the second position; determine a control invariant subset of the feasible region, wherein for each state within the control invariant subset there is at least one control action having a value selected from a finite set of values that maintains the state of the train within the control invariant subset; and control the movement of the train subject to the constraints by selecting a control action maintaining the state of the train within the control invariant subset of the feasible region.
14. The system of claim 13, wherein the at least one processor is further configured to: determine iteratively the control invariant subset using a backward-reachable region computation starting from the feasible region, wherein each iteration comprises: shrink the feasible region with a quantization error defined by the finite set of values and a quantization rule of the finite set of values to produce a shrunk feasible region; and determine the backward-reachable region, such that for each state within the backward-reachable region there is at least one control action moving the state of the train within the feasible region for all parameters from the set of possible parameters of the train; and replace the feasible region with the backward-reachable region, wherein the iterations are performed until a termination condition is met.
15. The system of claim 13, wherein the constraints are linear inequalities and train dynamics are represented as a set of linear models subject to additive disturbances, such that the at least one processor is configured to compute the backward-reachable region using the train dynamics that includes: determining a worst case effect of an additive disturbance; and determining the backward-reachable region as an intersection for backward reachable regions of the linear models in the set.
16. The system of claim 13, wherein the additive disturbance includes the quantization error.
17. The system of claim 14, wherein the at least one processor is configured to shrink the feasible region that includes shrinking by: determining a set of planes approximating a boundary surface of the feasible region; determining a direction normal to each plane to produce a set of directions; determining a worst case quantization error for each direction; and moving a plane inward the feasible region into the direction normal to the plane by a distance equals the worst case quantization error determined for the direction.
18. The system of claim 17, wherein the at least one processor is configured to determine the worst case quantization error for each direction by using a linear program.
19. The system of claim 17, wherein the at least one processor is configured to determine the quantization rule by reducing the worst case quantization error.
20. The system of claim 13, wherein the memory has stored therein train data that includes historical data including states of the train and current states of the train.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) The presently disclosed embodiments can be further explained with reference to the attached drawings. The drawings shown are not necessarily to scale, with emphasis instead generally being placed upon illustrating the principles of the presently disclosed embodiments.
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17)
(18)
(19)
(20)
(21)
(22)
(23) While the above-identified drawings set forth presently disclosed embodiments, other embodiments are also contemplated, as noted in the discussion. This disclosure presents illustrative embodiments by way of representation and not limitation. Numerous other modifications and embodiments can be devised by those skilled in the art which fall within the scope and spirit of the principles of the presently disclosed embodiments.
DETAILED DESCRIPTION
(24) The following description provides exemplary embodiments only, and is not intended to limit the scope, applicability, or configuration of the disclosure. Rather, the following description of the exemplary embodiments will provide those skilled in the art with an enabling description for implementing one or more exemplary embodiments. Contemplated are various changes that may be made in the function and arrangement of elements without departing from the spirit and scope of the subject matter disclosed as set forth in the appended claims.
(25) Specific details are given in the following description to provide a thorough understanding of the embodiments. However, understood by one of ordinary skill in the art can be that the embodiments may be practiced without these specific details. For example, systems, processes, and other elements in the subject matter disclosed may be shown as components in block diagram form in order not to obscure the embodiments in unnecessary detail. In other instances, well-known processes, structures, and techniques may be shown without unnecessary detail in order to avoid obscuring the embodiments. Further, like reference numbers and designations in the various drawings indicated like elements.
(26) Also, individual embodiments may be described as a process which is depicted as a flowchart, a flow diagram, a data flow diagram, a structure diagram, or a block diagram. Although a flowchart may describe the operations as a sequential process, many of the operations can be performed in parallel or concurrently. In addition, the order of the operations may be re-arranged. A process may be terminated when its operations are completed, but may have additional steps not discussed or included in a figure. Furthermore, not all operations in any particularly described process may occur in all embodiments. A process may correspond to a method, a function, a procedure, a subroutine, a subprogram, etc. When a process corresponds to a function, the function's termination can correspond to a return of the function to the calling function or the main function.
(27) Furthermore, embodiments of the subject matter disclosed may be implemented, at least in part, either manually or automatically. Manual or automatic implementations may be executed, or at least assisted, through the use of machines, hardware, software, firmware, middleware, microcode, hardware description languages, or any combination thereof. When implemented in software, firmware, middleware or microcode, the program code or code segments to perform the necessary tasks may be stored in a machine readable medium. A processor(s) may perform the necessary tasks.
(28)
(29) A control system 150 controls the movement of the train 119 traveling towards a station 2 (
(30) The current position d 10 of the train can be determined as the distance of a specific point 8 of the train, such as the center of the first door 9, from the origin 5 of the reference system, where d is negative when the train is at a position before the origin with respect to the normal direction of movement of the train. The velocity 11 of the train 119 is v, where v is positive when the train is moving in its normal direction of the movement.
(31) A control system 150 of the train 119 can include one or combination of a constraints generating unit 1, a control invariant subset generating unit 30, a train control device 52, and a control computer 17. In some embodiments, the constraints generating unit 1 determines stopping constraints 111 of a velocity of the train with respect to a position of the train forming a feasible area for a state of the train during the movement leading the train to the stop, and the control computer 17 controls the movement of the train subject to the constraints. The control can be achieved, e.g., by generating a control input 117 to the train control device 52 controlling 115 the break system of the train 119. The control system 150 can be in communication with a control center 141, wherein input 142 from the control center is provided to the control system. The control center can provide additional station dependent information to the control system such as the width of the desired stopping range, the minimum and maximum approach velocity, and the local weather conditions that can affect rail friction.
(32) In various embodiments, the stopping constraints are determined without having a predetermined run-curve, or conventional velocity profile, leading the train from the current position to the stopping position. For example, if a distance along the route is denoted by z, then a desired velocity v(z) at position z describes the run curve, or conventional velocity profile. The conventional velocity profile has to obey legal and mechanical constraints of the route, e.g. speed limits, safety margins, and must be physically realizable by mechanisms of the train. In effect, these special stopping constraints of the present disclosure, control of the movement of the train, without generating conventional velocity profiles/patterns that are prone to errors.
(33) Accordingly, some embodiments of the present disclosure transform the tracking problem into an optimization problem subject to these constraints. Such transformation is advantageous, because the constrained control can guarantee that the constraints are always satisfied, among other things.
(34) For example, some embodiments determine, for each time step of control, a control action moving the train from a current position to a next position within the feasible region. In those embodiments, the controlling includes determining a sequence of control inputs forming an ad-hoc run-curve leading the train from the current position to the stopping position. Such ad-hoc run-curve determination is advantageous because it eliminates efforts needed to generate and test predetermined run-curves. Also, reformulating the stopping into a constrained problem allows handling the stopping constraints with other constraints on the movement of the trains, such as constraints on traction and braking force range, actuator rate, and/or maximum and minimum speed of the train.
(35) However, due to the nature of optimization-based receding horizon control, the existence of a solution for a certain horizon does not by itself guarantees the existence of the solution for a subsequent horizon. This is exacerbated by quantization errors which cause the implemented braking action to deviate from the requested braking action. Thus, some embodiments also include the control invariant subset generating unit 30 for selecting a control invariant subset 113 from the feasible region defined by the stopping constraints. These embodiments are based on yet another realization that it is possible to select a subset of the feasible region, such that from any state of the train, any possible variations in the parameters of the movement of the train, and any quantization error there is a control from the finite set of values maintaining the state of the train within the subset, as noted above.
(36) For example, some embodiments design a controller that select the braking system action that to maintain the state of the train within the feasible region by repeatedly solving an optimization problem. Accordingly, if a cost function representing the movement of the train is optimized subject to constraints defined by that special control invariant subset of the feasible region, as contrasted with the optimization within the feasible region itself, there is a guarantee that the train stops within the predetermined stopping range. For example, in various embodiments, the cost function represents a combination of the energy consumption of the train during the trip, a time of the trip, both the energy consumption and the time of the trip, the energy consumption for a predetermined time of the trip, or the smoothness of the stopping trajectory. The optimization problem can directly select braking actions from the finite set of values in which case the optimization problem is a mixed-integer problem. In other embodiments, the control is selected by solving a convex optimization problem and applying a quantization rule which chooses a braking action from the finite set of values.
(37) Soft Landing Constraints
(38) For example, to stop the train at the stopping position within the stopping range, it is sufficient for the train distance from target d, and velocity v, to satisfy at any time instant soft landing constraints
v(t).sub.max(.sub.maxd(t))
v(t).sub.min(.sub.mind(t))(1)
wherein .sub.max(s) .sub.min(s) are the upper border function and the lower border function that are defined in the range s(, C] where c.sub.max, are continuous, greater than 0 when their arguments are positive, smaller than 0 when their arguments are negative, and 0 when their arguments are 0. Furthermore for any s(, c], .sub.max(s).sub.min(s) and .sub.max(c)=.sub.min(c).
(39)
(40) Intuitively, if the feasible area 215 includes the current position of the train and the state of the train is controlled to be maintained within the feasible area 215, at some instant of time the state of the train is guaranteed to be on a segment 216 between the points 213 and 212, which corresponds to a zero velocity of the train at the predetermined stopping range.
(41) For example, when d<.sub.min the constraints (1) forces the train velocity to be positive, so that the train moves towards the target, when the position is beyond the stopping range d>.sub.max the constraints (1) forces the train velocity to be negative and hence the train backs-up towards the target, and that hence any trajectory of the train must include a point of zero velocity in the range of positions between .sub.max and .sub.min, which means that the train stops at a desired stopped range.
(42)
(43) In such a manner, the embodiments provide for stopping a train at a position with an automatic control 240, but without the predetermined velocity profiles. This is because the constraints on the state of the movement of the train that guarantees the stopping of the train at the predetermined stopping range can be generated without the velocity profiles. For example, instead of generating multiple velocity profiles, only two constraints defining a lower and an upper curve of the feasible region can be determined. Also realized in this present disclosure is that the selection of the constraints affects the minimum and maximum arrival time of the train at the position, such that the time of arrival can be used as guidance for generating those constraints.
(44) For example, some embodiments determine a lower curve and an upper curve bounding a velocity of the train with respect to a position of the train, such that the upper curve has a zero velocity only at the farthest border of a stopping range, and the lower curve has a zero velocity only at the nearest border of the stopping range, and determine the feasible region for a state of the train using the lower and the upper curves and mechanical and/or legal constraints on the movement of the train. For example, in one embodiment the upper curve can be a first line with a first slope, and the lower curve can be a second line with a second slope. Usually, the first slope is greater than the second slope to enforce a sufficient size for the feasible region. This embodiment can reduce the selection of the constraints only to the values for the slopes of the first and the second lines.
(45) Also realized is that the selection of the constraints affects the minimum and maximum arrival time of the train at the stopping range, and the desired arrival time can be used in the selection of the two parameters. For example, one embodiment selects the value of the first slope based on a minimal stopping time, and selects the value of the second slope based on a maximal stopping time.
(46)
(47) Still referring to
(48) A human machine interface 209 within the computer system 200 can connect the system to a keyboard 210 and display device 211. The computer system 200 can be linked through the bus 206 to a display interface 217 adapted to connect the system 200 to a display device 218, wherein the display device 218 can include a computer monitor, camera, television, projector, or mobile device, among others.
(49) Still referring to
(50) Still referring to
(51) The computer system 200 may be connected to external sensors 231, the control center 241, other computers 242 and other controlling devices 244. For example, the train automatic stopping control can be connected to low level controllers such as traction controllers, train brake controllers, etc. For example, the train automatic stopping controller can connect to other computers such as the passenger information system to provide estimated arrival times, and the door controllers in order to ensure that the doors do not open until the train is fully stopped. The external sensors 231 may include sensors for, speed, direction, air flow, distance to the station, weather conditions, track grade etc. Contemplated is that the processor 251 of
(52)
(53) For example, the constraints can be written in a linear form according to
v(t).sub.max(.sub.maxd(t))
v(t).sub.min(.sub.mind(t)),(2)
wherein .sub.max, .sub.min are two coefficients where .sub.min>0.sub.max>.sub.min. If the constraints in (2) are satisfied at any time instants, then the train stops between .sub.max and .sub.min.
(54) A cone-shaped a region 301 in the space of train positions 310 and train velocities 320 is referred herein as a soft landing cone. The region 301 is delimited by two lines, each corresponding to one of the equations in (2), satisfied with equality. The upper border 302 of the soft landing cone is defined by .sub.max and .sub.max where .sub.max determines the slope 303 and .sub.max determines the intersect 304 of the upper border with the line of 0 velocity. Similarly, the lower border 305 of the soft landing cone is defined by .sub.min and .sub.min where .sub.min determines the slope 306 and .sub.min determines the intersect of 307 the lower border with the line of zero velocity.
(55) If the train positions and velocities remain in the soft landing cone the train stops at the stopping range. The parameters .sub.max and .sub.min define the desired stopping range, because the train stops in the area 308 between positions .sub.max and .sub.min including the stopping position 309 with d=0.
(56) In addition, some variations of this embodiment determine the parameters .sub.max and .sub.min using the desired timing to stop. For example, the embodiment can select the first slope 303 based on a minimal stopping time, and select the second slope 306 based on a maximal stopping time.
(57)
(58)
wherein the upper and lower bounds correspond to corresponds to the time of the sequence of positions and velocities described by a line 402 for the upper bound and by a line 403 for the lower bound.
(59) Similarly, as shown in
(60)
(61)
(62) which corresponds to the sequence of positions and velocities described by a line 502 for the upper bound and a line 503 for the lower bound.
(63) Reducing a value of the parameter .sub.min increases the maximum time to reach the stopping position. Increasing a value of the parameter .sub.max decreases the minimum time to reach the stop. Also, taking .sub.max and .sub.min with closer values reduces the difference between minimum and maximum time to stop, while on the other hand reduces the area of the soft landing cone which amounts to reducing the number of possible train trajectories in such a cone.
(64) Constrained Control
(65) Constrained control of the train that enforces the constraints in (1) guarantees that the train stops in the stopping range. However, the train position and velocity depends on the actual train dynamics generated by actuating the traction and braking system of the train. Thus, some embodiments of the present disclosure determine a control system to actuate the train traction and braking system so that the train dynamics satisfies the constraints in (1).
(66) The train dynamics can be described by
{dot over (x)}(t)=f(x(t),q(t),p)
y(t)=h(x(t))(5)
where x is the train state, q is the train input, p are the train parameters, y=[d v] is the output vector, f describes the variation of the state as a function of the current state, current input and current parameters, and h describes the output as a function of the current state, only.
(67) The state and input variables in (5) are subject to the constraints
x(6)
q(7)
where (6) define a set of admissible values for the state variables, and (7) defines a finite set of admissible values for the input variables in (5).
(68) In one embodiment of the present disclosure, for a train provided with rolling stocks (wheels) the train dynamics (5) is described by an affine model obtained by considering a velocity-affine model for the resistance force to motion,
F.sub.res(t)=c.sub.0gc.sub.1v(t)(8)
where c.sub.0 is the coefficient of the constant term which models rolling resistance, and c.sub.1 is the coefficient of the linear term which models bearing friction and air resistance at low speeds, is the friction coefficient between the rails and the rolling stocks, g is the gravity acceleration constant. In this embodiment the train dynamics is described by
(69)
where m is the train mass, r is the radius of the wheels, k.sub.a is the maximum force, .sub.a is the actuator time constant.
(70) The affine model of the train dynamics is
{dot over (x)}(t)=A(p)x(t)+B(p)q(t)+B.sub.ww(p)(10)
where the state is x=[d v x] the input q is the command to the force generating actuators from traction (when positive) and braking (when negative), w is the constant resistance term obtained from (9) and the matrices A(p), B(p), are obtained also from (9), where the vector of parameters p include the train mass, the friction coefficient, the gravity acceleration constant, the maximum force, the actuator time constant. In model (10)
(71)
(72) In other embodiments of the present disclosure, the disturbance w will include the quantization errors produced by replacing the control input q from the finite set of values with a continuous input u from a convex set
conv(
) and a quantization error w=qu.
(73) The train control system selects the values for the train input function q that generates admissible solution for
{dot over (x)}(t)=f(x(t),q(t),p)
y(t)=h(x(t))
v(t).sub.max(.sub.maxd(t))
v(t).sub.min(.sub.mind(t)
x(t),q(t)(12)
where the set describe admissible values for the state (e.g., maximum and minimum velocity, etc.), the set describes a finite set of admissible values for the input, and the solution is sought from current time T for all times in the future (i.e., [T, t.sub.f], where t.sub.f=).
(74) For instance, the constraint
{dot over (v)}0
which imposes that the train constantly decelerates, i.e., no increase in velocity is allowed, or its relaxed form
{dot over (v)}(d)
where is a nonnegative, monotonically decreasing function, while d<0 relaxes the previous constraints by allowing greater acceleration when the train is closer to the stopping position, to improve accuracy of the control.
(75) Some embodiments of the present disclosure, optimizing the movement of the train from the current state to subsequent states, and determine a solution to (12) by solving the constrained optimal control problem
min F(x(t.sub.f))+.sub.t.sub.
s.t {dot over (x)}(t)=f(x(t),q(t),p)
y(t)=h(x(t))
v(t).sub.max(.sub.maxd(t))
v(t).sub.min(.sub.mind(t))
x(t),q(t)
x(t)=x.sub.0(13a)
(76) where t.sub.0 is the initial time, x.sub.0 is the state at the initial time, F is the terminal cost function and L is the stage cost function. If the problem in (13) can be solved for final time t.sub.f=, then the stopping constraints are always satisfied and the train stops where required.
(77) However, the problem described in Equations (12) and (13a) requires the computation of an infinitely long sequence of control inputs q(t) for a system subject to an infinite number of constraints are difficult to solve in the train control system directly. Thus, some embodiments solve the problem in described in Equations (12) and (13a) in a receding horizon fashion.
(78)
(79) The method selects and applies 604 a first control input from the sequence of control inputs specifying the control action for a next time step of control. For example, the finite horizon control input signal q is applied during the time interval [T, T+dh]. Then 605, at time t.sub.0+dh, where dh<h a new problem is solved with t.sub.0=T+dh, t.sub.f=T+dh+h and the newly computed input signal is applied, and the steps of the method are iteratively repeated.
(80) When the optimization problem (13a) directly choses a control input q from the finite set of values it is called a mixed-integer optimization problem. Mixed integer optimization problems can be difficult to solve in the small time window dh between updates to the control input q. Thus some embodiments of the present disclosure select an input u from a bounded convex set
conv(
) by solving a convex optimization problem
min F(x(t.sub.f))+.sub.t.sub.
s.t. {dot over (x)}(t)=f(x(t),u(t)w(t),p)
y(t)=h(x(t))
v(t).sub.max(.sub.maxd(t))
v(t).sub.min(.sub.mind(t))
x(t),u(t) for all w(t)
x(t.sub.0)=x.sub.0(13b)
and then applying a quantization rule q:.fwdarw.
to obtain a feasible input q(t)=q(u(t))
in the finite set of values
. The difference w(t)=u(t)q(u(t)) between the convex input u(t) and the quantized input q(t)=q(u(t))
is called the quantization error. The set of possible quantization errors w produced by the quantization rule q can be bounded by a set
since
conv(
) is bounded. The optimization problem (13b) is solved robustly, that is, in a manner such that any quantization error w that satisfies the bounds
will not cause a constraint violation. Thus the quantizing the convex input q(t)=q(u(t))
does not produce constraint violations even though the optimization problem (13b) does not known the actual value of the quantization error w.
(81) Control Invariant Subset
(82)
(83) Due to the nature of receding horizon control, the existence of a solution for a certain horizon does not by itself guarantees the existence of the solution for a subsequent horizon. Specifically, while the receding horizon solution makes the problems (13a) and (13b) computationally feasible, it is not possible to guarantee that such problem always has a solution. In particular, it is possible that the problem (13a) or (13b) solved at time T has a solution, but the one to be solved at time T+dh does not. This is due to the fact that as the horizon is shifted, the constraints in (2), (6), (7) have to be enforced on a new piece of the trajectory, i.e., during the time interval [T+h, T+dh+h] that was not account for before.
(84) For example, the state of the machine and a state of the train 720 can be optimal and feasible for one iteration, but all control actions 721-723 that controller is allowed to take during the next iteration can bring a state of the train outside of the feasible region 101.
(85) Some embodiments of the present disclosure are based on yet another realization that it is possible to select a subset 401 of the feasible region 101, such that from any state of the train within that subset, there is a control action in the finite set of values maintaining the state of the train within the subset. For example, for any state such as a state 730 within the subset 401 and within all possible control actions 731-734 that the controller can execute, there is at least one control action in the finite set of values, e.g., actions 731 and 732, that maintains the state of the train within the control invariant subset 410.
(86) Accordingly, if a control action for controlling the operation is selected such that the state of the train remains in that special subset 401 of the feasible region, and the feasible region is generated also according to Equation (1), then there is a guarantee that it is possible to determine the sequence of control actions forming an ad-hoc run-curve leading the train from the current position to the stopping position.
(87) For example, one embodiment determines a discretized version of the problem in (13) by considering a sampling period dh and obtaining a discrete time model for the dynamics in (5) which is
x(t+dh)=f.sub.d(x(t),q(t),p)
y(t)=h.sub.d(x(t))(14)
wherein given a state x and a quantized control input q, f.sub.d(x q, p) is the updated state. Based on the discrete time model, the constrained control is
(88)
wherein x(k+i) is the predicted state value at time t+i dh, x(t+i dh). At any time t of the control one embodiment solves the problem (15a) on the future interval [t, t+N dh] and a first control input q(0) from the sequence of control inputs specifying the control action for a next time step of control is applied during [t, t+dh] then the new state x(t+dh) is read and a new problem is solved.
(89) Problem (15a) is a mixed-integer optimization problem. Some embodiments solve the convex optimization problem
(90)
and apply a quantization rule q:.fwdarw.
to obtain a feasible input q(k)=q(u(k))
in the finite set of values
. At any time t of the control, one embodiment solves the problem (15b) on the future interval [t, t+N dh] and a first control input q(0)=q(u(0)) from the sequence of control inputs specifying the control action for a next time step of control is applied during [t, t+dh] then the new state x(t+dh) is read and a new problem is solved.
(91) Problems (15a) and (15b) are not guaranteed to be feasible. However, some embodiments modify the constraints to guarantee the feasibility. The set of the feasible states .sub.f is the set that includes all the values for the state x satisfying the Equations (2), (6), (7). The control invariant subset of the set of feasible states used by some embodiments is control invariant with respect to dynamics (14) and constraints (2), (6), (7) that is, if for every x
, there exists a value q
such that f.sub.d (x, q, p)
.
(92) Accordingly, some embodiments select a control action for the movement of the train by solving the mixed-integer optimization problem
(93)
(94) If x(t)then the modified problem is feasible, and when the input q is applied to the train, the problem generated at the next time step t+dh is going to be feasible because x(t+dh)=f.sub.d(x(t), q(t), p)
. Thus, if the first problem generated when the controller is initialized is feasible, the generated trajectory always satisfies constraints (2) and hence the train stops where required.
(95) In other embodiments, the control input is obtained by solving the convex optimization problem
(96)
and applying a quantization rule q:.fwdarw.
to obtain a feasible input q(t)=q(u(t))
in the finite set of values
. The set
is control invariant for the quantization rule q:
.fwdarw.
if for every x
, there exists a value u
such that f.sub.d (x, uw, p)
for every possible quantization error w
.
(97) Robust Control Invariant Set for Quantization Errors
(98) In some cases, the values of the variables in the parameter vector p in (5) are not exactly known. For instance, only an upper and lower bound may be known, or more generally that the parameter vector p has one of the values in a set P, which may also be constantly changing within this set.
(99) It is realized that the control strategy can be modified to guarantee precise stopping in the presence of constraints by ensuring that the constraints in (2), (6), (7) are satisfied at any time instant for all value of the parameter vector. For example, some embodiments determine the control invariant subset for a set of possible parameters of the train, such that for each state within the control invariant subset, there is at least one control action maintaining the state of the train within the control invariant subset for all parameters from the set of possible parameters of the train.
(100) To this end in place of the set in (16), some embodiments use the set
(P), which is a subset of .sub.f such that for all states x that are in
(P), there exists an input q
such that f.sub.d(x, q, p)
(P), for all the values p in P.
(101) Thus the problem for stopping the train with uncertain parameter values and quantization errors is
(102)
where estimate of the unknown parameter {circumflex over (p)}P may not be the actual value. However the design of the control invariant set (P), which incorporates uncertainty in the parameters P, guarantees that the train state remains in the feasible region.
(103) It can be difficult to compute a control invariant set (P) when the input set
is finite or a finite set of values. Thus some embodiments instead compute a control invariant set for a particular quantization rule. A set
(P,
) is control invariant for some quantization rule q:
.fwdarw.
if for every x
(P,
) there exists u
such that f.sub.d(x, uw, p)
for any quantization error w
. Thus the problem for stopping the train with uncertain parameter values and quantization errors is
(104)
where estimate of the unknown parameter {circumflex over (p)}P may not be the actual value. The control invariant set (P,
) incorporates uncertainty in the parameters P and bounds on the quantization error
to guarantee that the train state remains in the feasible region.
(105) If x(t)(P,
) then the modified problem is feasible, and when the input q=uw is applied to the train, the problem generated at the next time step t+dh is also feasible because x(t+dh)
(P,
) for all real values of p in P and the quantization error w in
. Thus, if the first problem generated when the controller is initialized is feasible, the generated trajectory always satisfies constraints in (2), (6), (7) and hence the train stops where required.
(106)
(107) Control Invariant Set Computation
(108) (P), for uncertainty set P. The set
(P) can be generated by the same computation where the set P includes only a single value.
(109) The backward-reachable region computation initializes 801 a current set .sub.c to the feasible set .sub.f and determines 802 a predecessor set of states .sub.p as a subset of the current set .sub.c such that for all states x in .sub.p there exists an input q in such that for all the possible values of the parameters p in P, the updated state lies in the current set .sub.c.
(110) If 803 the predecessor set .sub.p is empty, it is not possible 804 to guarantee feasibility of problem (17a), which means that it is not possible to guarantee precise stopping with the amount uncertainty P of the train parameters. If the current set 805 and the predecessor set are equal 806 then the current set .sub.c is a control invariant set (P)=.sub.c. Otherwise the predecessor set .sub.p is assigned 807 to be the current set .sub.c=.sub.p and the computation iterates 808 again.
(111)
(112) The algorithm in . Accordingly some embodiments of the present disclosure, replace the finite input set of values
with a polyhedral set
conv(
) and a quantization rule q:
.fwdarw.
that maps the convex input u
to a quantized value q(u)
. The quantization error is the difference w=uq(u) between the convex input
and the quantized input q=q(u)
. The algorithm shown in
to construction a control invariant set that is robust to quantization errors. The set of possible quantization error can be over-bounded by a set
.
(113) .fwdarw.
and the set that bounds quantization errors
generated by q. The difference between the algorithm in
912 at the beginning of each iteration. This ensures that the states in the predecessor set .sub.p can be mapped into the current set .sub.c no matter what the value quantization error w assumes in the set
.
(114) The computation of the backward-reachable set can be simplified when the constraint sets .sub.f, and
are polyhedral and the parameter dependent dynamics are described by a set of linear models
f(x,q,p)conv({A.sub.ix(k)+B.sub.iq(k)}.sub.i=1.sup.l)(18)
where the matrices A.sub.i and B.sub.i capture all possible behaviors of the system for different parameter values p in P. The linear models in (18) can be computed for instance by taking the maximum and minimum of the parameters that form vector p allowed by P, and/or of their combinations. Equation (18) also covers the case where all the parameters are perfectly known, since in that case only one model is used l=1.
(115) and
are polyhedral and the dynamics are described by (18). The method considers the current set as
.sub.c={x:h.sub.i.sup.(c)xk.sub.i.sup.(c),i=1, . . . ,m}.(19)
(116) The worst-case quantization error wi is determined for each constraint h.sub.i.sup.(c)xk.sub.i.sup.(c) by solving the linear optimization problem
(117)
(118) The worse-case quantization error wi is used to shrink the current set according to
.sub.s={x:h.sub.i.sup.(c)xk.sub.i.sup.(c)
(119) Finally the predecessor set is computed by finding the set of state and input pairs (x,u) such that successive state A.sub.ix+B.sub.iu.sub.s is inside the shrunken set .sub.s for every extreme model i=1, . . . , l.
(120)
(121) Quantization Rule
(122) If the train-stopping problem has too much parameter uncertainty P or the quantization errors are too large , then the control invariant set
(P,
) will be empty. This means that it is not possible to guarantee precision stopping of the train for all possible values of the train parameters and all possible quantization errors. The uncertainty set P for the train parameters cannot be changed. However the set
that bounds the quantization errors can be changed by choosing a different quantization rule q:
.fwdarw.
. Thus, one embodiment of the present disclosure discloses a system and a method for designing a quantization rule q Q that ensures that the control invariant set
(P,
) is not empty and therefore it is possible to stopping the train in the desired location.
(123) Referring to are vectors then the quantization errors w=uq(u) will have a magnitude and direction. It was further realized that quantization errors in different directions have different effects on the ability of the train to precisely stop. FIG. 10A shows an example of how the direction of the quantization error can affect stopping precision of the train. In this example the finite input set of values are two dimensional and therefore the quantization error is also two dimensional. 1001A shows the position 110 versus velocity 120 trajectories of the train under constant braking. The automatic train-stopping controller places the train on one of the trajectories 1001A that terminate within the desired stopping range 108 between 107 and 104. A quantization error in the direction 1003 can push the train onto an undesirable trajectory. On the other hand, a quantization error in the direction 1002A will advance the along the safe trajectory and thus has less effect on stopping precision.
(124)
(125) Accordingly, some embodiments of the present disclosure disclose a method and a system for selecting a quantization rule that produces small quantization errors in directions that can reduce stopping precision. The quantization rule is designed using an optimization problem that incorporates information about the dynamics and constraints of the train to minimize the effects of quantization error on stopping precision.
(126) The quantization rule q:.fwdarw.
, used by some embodiments of the present disclosure, maps the convex input u
to the closest element q
in the finite set of values
under some weighted distance function
(127)
where uq.sub.W.sup.2=(uq).sup.TW(uq) is the weighted distance and W=W.sup.T0 is a positive definite matrix. This quantization rule is non-obvious for two reasons. First since the weighting matrix W is not necessarily diagonal, it can be used to parameterize very non-intuitive distance functions. Thus the quantization rule may round the convex input u
to a finite value q(u) that is far away in terms of the intuitive Euclidean distance function. The second reason the quantization rule is non-obvious is that it is not obvious how the dynamics and constraints of the train is to be used to design the weighting matrix W in order to minimize the effects for quantization error on stopping precision.
(128) The quantization rule (22) minimizes the size w.sub.W.sup.2=uq(u).sub.W.sup.2 of quantization errors w=uq(u). We call a quantization error w small if it satisfies w.sub.W.sup.2=uq(u).sub.W.sup.21. The weighting matrix W is be chosen such that if the quantization errors w=uq(u) are small w.sub.W.sup.21 then the effects on the train are small. In other words, the quantization rule is to be designed to maximize the volume of small quantization errors that have small effects on the train. Thus the weighting matrix W is chosen to maximize the volume of the set of small quantization errors
(W)={w:w.sub.W.sup.21}(23)
for which it is possible to stop the train within the desired stopping range. The set (23) is an ellipsoid parameterized by the weighting matrix W.
(129) The constraints on the train state can be satisfied for quantization errors in the set (W) is there exists a linear controller u=F.sub.x that satisfies input constraints u and keeps the state inside a subset
.Math..sub.f of the feasible region .sub.f. In the present disclosure, the subset
=(P) is an ellipsoid parameterized by a positive definite matrix P. The state of the train never leaves the set
=(P) for any quantization error w(W) if the following matrix inequalities hold
(130)
for i=1, . . . , l and some (0, 1). The set =(P) is a subset of the feasible region .sub.f if the following matrix inequalities hold
(131)
(132) for each inequality h.sub.i,xxk.sub.i,x that defines the feasible region .sub.f={x:h.sub.i,xxk.sub.i,x, i=1, . . . , m.sub.x} The linear controller u=F.sub.x that satisfies input constraints u for every state x
if the following matrix inequalities hold
(133)
for each inequality h.sub.i,uxk.sub.i,u that defines the input set ={x:h.sub.i,uxk.sub.i,ui=1, . . . , m.sub.u}. Thus if there exists matrices P, F, and W that satisfy the matrix inequalities then it is possible to guarantee that the train state is feasible for any quantization error w=uq(u) in the set (23) of small quantization errors.
(134) The larger the set of errors (23) that do not cause constraint violations, the smaller the effect of the quantiztion errors on the system. The volume of the set (23) can be maximized by solving the optimization problem
(135)
where the volume of the set (23) is propotional to the determinate det W.sup.1 of the distance-weighting matrix W. The optimization problem (25) uses information about the dynamics A.sub.i and B.sub.i, and constraints and conv(
) on the train to find a quantization rule of the form (22) that produces quantization errors that have the minimal effect on the stopping precision of the train.
(136) If the quantization rule (22) chooses the finite input value q then the quantization error is bounded by the Voronoi cell
.sub.q={uq:uq.sub.W.sup.2up.sub.W.sup.2p
}.
(137) The worst-case quantization error for the quantization rule (22) is bounded by the convex-hull of the quantization error for each finite input value q=conv{
.sub.q:q
}(26)
(138) The control invariant set (P,
) will be non-empty if the quantization errors (26) are small w.sub.W.sup.2=uq(u).sub.W.sup.21 i.e.
.Math.(
). This means that it is possible to precisely stop the train within the desired range while satisfying constraints despite parameter uncertainty and quantization errors. The optimal choice (25) for the weighting matrix W depends on the choice of the input set
conv(
).
(139) conv(
). The convex input set is initialized 1101 as the convex hull
=conv(
) of the finite input set of values
. The optimization problem (25) is solved 1102 to find the optimal weighting matrix W for this input set
. Next the quantization error set (26) for the quantization rule (22) is computed 1103. If the quantization errors are not small 1105
.Math.(W) then the previously computed weighting matrix W 1106 is to be used in the quantization rule (22). Otherwise the convex input set can be expanded 1107 and the design process is repeated 1108.
(140) to the closest finite input value q
in terms of the intuitive Euclidean distance. This nave approach is the current state-of-the-art. The convex input set 1201A is a box around the set of finite input values 1202A. For any convex input in the region 1203A the quantization rule select the quantized input 1202A. The quantization rule shown in
(141) The quantization rule shown in (P,
) is not empty and therefore the train is guaranteed to stop in the desired stopping range for any value of the parameters p in P. The quantization rule shown in
(142) Train Stopping Control Systems based on Control Invariant Sets and Soft Landing Constraints
(143)
(144) Based on such information the controller 1301 selects commands for the propulsion force needed to influence the train motion which are sent to the train 1319 and used in the propulsion system, where a positive force is actuated by the traction motors, and a negative force is actuated from the braking system. The controller 1301 may solve the problems (12) or (13) from current time T to t.sub.f=, thus obtaining full trajectory for the input that is sent to the train propulsion system. More commonly, the controller 1301 operates in a receding horizon strategy as described in
(145) If the constrained control of Equations (12) or (13) or (15) or (16) or (17) is solved always with a feasible solution, then the train stops in the desired range of locations. Furthermore, for the control described in Equations (16) and (17) guarantees that if the first problem solved when the control system is first activated is feasible, all the subsequent problems are feasible, and hence the train stops in the desired range of locations. It is also realized that in order for the first problem to be feasible, it is enough to initialize the controller when the current state x(t) of the train system is in the control invariant set, x(t) for (16), x(t)
(P) for (17a), and x(t)
(P,
) for (17b).
(146) Furthermore, it is realized that by using the control invariant subset determined using the backward-reachable region computation starting from the feasible region, the train control system does not require a calibration to achieve the primary target, because the control invariant subset is determined independently of all the controller calibration parameters, such as the length of the horizon, h, and the cost function components L, F.
(147) These parameters can be selected to obtain secondary objectives of the controller such as minimum time stopping, for which L are selected as
L=d.sup.2,(26)
minimum braking effort
L=F.sup.2,(27)
which also provides smooth deceleration, minimum velocity stopping
L=v.sup.2,(28)
minimum energy
L=u.sub.v.sup.2,(29)
which penalizes only the use of traction motors by defining u.sub.vF, or a combination of the above functions. For (27), (28), F=L, for (27), (29) F=0. The horizon length h can be selected based on timing requirements since longer horizon provides better performance with respect to the select secondary objective, but requires longer computations for the controller to generate the commands.
(148) In the embodiment using the dynamics on the right hand side of (18) are used, and the stopping constraints includes linear inequalities, the problems (15), (16), (17) can be converted into quadratic programming problems that can be solved more effectively.
(149) for a braking command q
that keeps the train state inside the control invariant set.
(150) For example, the embodiment acquires 1401 the train state from sensors 1405, 1306. Then it selects 1402 one of the finite control input set of values q and uses the train model to test 1404 whether the train state resulting from applying this control input q
will keep the state inside the control invariant set. If so, then the control input q
is applied 1405. Otherwise 1406 the controller checks and/or tests another control input. By the definition of the control invariant set, at least one control input q
in the finite set of values
will ensure that the future state of the train lies in the control invariant set.
(151) Closed-Loop Train Behavior
(152)
(153)
(154) In
(155) In
(156) Another advantage of the train automatic stopping control disclosed in the present disclosure, among many possible advantages, is that the ad-hoc run-curve is re-computed online at each sample-time based on the most recent measurement of the train state. The cost function of the optimization problem ensures that the ad-hoc run-curve is the optimal run-curve for the train given its current position and velocity. The constraints of the optimization problem ensure that the ad-hoc run-curve is always physically realizable by the dynamics train. Run-curves that are pre-computed offline are not necessarily optimal nor are they necessarily physically realizable.
(157) The above-described embodiments of the present disclosure can be implemented in any of numerous ways. For example, the embodiments may be implemented using hardware, software or a combination thereof. When implemented in software, the software code can be executed on any suitable processor or collection of processors, whether provided in a single computer or distributed among multiple computers. Such processors may be implemented as integrated circuits, with one or more processors in an integrated circuit component. Though, a processor may be implemented using circuitry in any suitable format.
(158) Also, the various methods or processes outlined herein may be coded as software that is executable on one or more processors that employ any one of a variety of operating systems or platforms. Additionally, such software may be written using any of a number of suitable programming languages and/or programming or scripting tools, and also may be compiled as executable machine language code or intermediate code that is executed on a framework or virtual machine. Typically, the functionality of the program modules may be combined or distributed as desired in various embodiments.
(159) Also, the embodiments of the present disclosure may be embodied as a method, of which an example has been provided. The acts performed as part of the method may be ordered in any suitable way. Accordingly, embodiments may be constructed in which acts are performed in an order different than illustrated, which may include performing some acts concurrently, even though shown as sequential acts in illustrative embodiments. Further, use of ordinal terms such as first, second, in the claims to modify a claim element does not by itself connote any priority, precedence, or order of one claim element over another or the temporal order in which acts of a method are performed, but are used merely as labels to distinguish one claim element having a certain name from another element having a same name (but for use of the ordinal term) to distinguish the claim elements.
(160) Although the present disclosure has been described with reference to certain preferred embodiments, it is to be understood that various other adaptations and modifications can be made within the spirit and scope of the present disclosure. Therefore, it is the aspect of the append claims to cover all such variations and modifications as come within the true spirit and scope of the present disclosure.