Optimal Control and Reachability with Competing Inputs

About This Presentation

Title:

Description:

Number of Views:32

Avg rating:3.0/5.0

Slides: 15

Provided by: IanM51

Transcript and Presenter's Notes

Title: Optimal Control and Reachability with Competing Inputs

1
Optimal Control and Reachabilitywith Competing
Inputs

2
Competing Inputs

What do we do when there are multiple parameters,
some of which we can choose but some of which
have unknown and uncontrolled value
Control input denoted by u (or a)
Disturbance input denoted by d (or b)
Choose control input as before to optimize
trajectory or achieve safety
Due to disturbance input, system remains
nondeterministic even if control signal is fixed

3
Two Treaments of Disturbance

4
Markov Decision Process

5
Stochastic Differential Equations (SDEs)

6
Continuous Backward Reachable Tubes

Continuous System Dynamics
Target Set G(0)
Backward Reachable Set G(t)
7
Reachable Tubes (controlled input)

Continuous System Dynamics
8
Reachable Tubes (uncontrolled input)

Continuous System Dynamics
9
Two Competing Inputs

Continuous System Dynamics
10
Objective Function

11
Who Goes First?

One input is chosen to maximize and the other to
minimize the objective
But what knowledge is available when choosing an
input?
Current state? Other input?
Non-anticipative strategies
One player gets to know the other players input
value (as well as current state)
However, that player must declare their strategy
(reaction to every input) in advance

12
Zero Sum Game Value Function

Value function is then defined as optimization
over appropriate strategy and input signal pair
Lower value function, since disturbance
(minimizer) has the advantage
Parallel upper value function can be defined
If inputs are independent, optimal strategy will
ignore additional information about the other
input
Upper and lower value functions will be equal

13
Competing Inputs Final Comments

Feedback control is more realistic implementation
If order of input decision is irrelevant (upper
and lower value functions are equal), then
nonanticipative strategy results will be
equivalent to feedback results
For robustness, give advantage (eg strategy) to
the disturbance input if it matters (potentially
pessimistic)
Input signals still drawn from set of measureable
functions
Two player concepts have been extended to
viability theory and set-valued analysis

14
Optimal Controland Reachability with Competing
Inputs