Modelbased Monitoring, Diagnosis and Recovery - PowerPoint PPT Presentation

1 / 46
About This Presentation
Title:

Modelbased Monitoring, Diagnosis and Recovery

Description:

Basic principles are often 50 or 100 years old ... anti-lock brakes, traction control, air bags. NASA Research Challenges. Every machine is unique ... – PowerPoint PPT presentation

Number of Views:38
Avg rating:3.0/5.0
Slides: 47
Provided by: codeickeys
Category:

less

Transcript and Presenter's Notes

Title: Modelbased Monitoring, Diagnosis and Recovery


1
Model-based Monitoring, Diagnosis and Recovery
  • James Kurien
  • Autonomy and Robotics Area
  • NASA Ames Research Center
  • Thesis Committee

Pandu Nayak PurpleYogi USRA/RIACS
Tom Dean Brown University
Leslie Kaelbling MIT
Also includes joint work with Dave Smith, NASA ARC
2
An Observation on Machine Evolution
1965
1995
1954
1995
  • Basic principles are often 50 or 100 years old
  • The basic mechanical design often persists for
    decades
  • Yet efficiency, robustness, reliability radically
    improve
  • One difference on-board sensing, computation
    actuation
  • computer controlled, closed loop electronic fuel
    injection
  • anti-lock brakes, traction control, air bags

3
NASA Research Challenges
  • Every machine is unique
  • Some machines must survive years without repair
  • Failure and redundancy complicate control
  • Relatively short down time can destroy a mission
  • Development operation costs must be contained
  • Challenge Easily developed, highly capable
    control systems

4
Problem Statement
  • Given
  • A model of a physical system such as a spacecraft
  • The internal actions taken and observations
    received thus far
  • Determine
  • The most likely internal states of the system
  • The commands needed to move to a desirable state

Configuration Goals
Model
Action Selection
State Estimate
Command
Observations
5
Typical Domain
  • Engineers model the local, qualitative behavior
    of system components
  • Components are things like valves, switches,
    tanks, engines
  • Properties of interest are transmission of flow,
    voltage, etc
  • Goals are produce acceleration, maintain
    pointing ability, etc

Spacecraft Propulsion System Model
6
Assumptions
  • The outcome of actions is non-deterministic
  • The world is only partially observable
  • Time is discrete
  • Finite number of states and actions
  • Partially Observable Markov Decision Processes

7
Belief State
  • A belief state is a probability distribution over
    the possible states of the system
  • B(s) is the probability of being in state s given
    a stream of commands and observations
  • For a POMDP, there is a simple procedure for
    updating B(s) given a command, an observation,
    and the previous B(s) for all s

8
The Problem with Belief States
Spacecraft Propulsion System Model
Robot Navigation Model
1
  • Few variables
  • Typically 104 states
  • Smooth
  • Hundreds of variables
  • Typically 10150 states
  • Not smooth

Can we compute B(s) for a subset of the state
space?
9
Valve Driver Example
Valve Driver
command
Flowv1
Valve1
Pump
Flowv2
Valve2
  • Valve Driver (VDU) sends command to valves
  • Pump pressurizes the valves
  • Flow measured at each valve
  • VDU may hang, valves may stick shut

10
Track the Most Likely State?
Probability
Time
  • Start with a known state
  • Track most likely outcomes given current
    observations
  • Was very fast when used on Deep Space 1
  • MLS may become arbitrarily unlikely or
    inconsistent

11
The Whole Idea
Incremental Belief State Generation
Probability
Observe no flow.
time
  • Avoid committing to a small number of
    trajectories
  • Build a structure that compactly represents all
    evolutions
  • Generate additional trajectories in likelihood
    order as needed or as time allows

12
Basic Approach
13
Compositional, Consistency-Based POMDP
  • Transitions specified by non-deterministic
    automata
  • Observations specified by propositional logic
    model
  • Computational Leverage
  • Assume transition probabilities of automata are
    independent
  • States that are logically inconsistent with
    observations have zero probability

14
Encoding the Valve Automaton
Cmdinopen
open
closed
Flowzero
Flow?Pressure
Cmdinclose
stuck
Flowzero
  • State constraints are quite easy to represent
    propositionally
  • Valveclosed ? Flowzero
  • How can we represent stochastic transitions?

15
Building a One Step Model
Propositional Constraint System
Time t
Time t1
off
vdu
tvdu
???
cmdin
open
none
cmdout
closed
v1
tv1
???
Flowv1
zero
v2
closed
tv2
???
Flowv2
zero
16
Trajectory Tracking
t1
t0
t2
t0
t1
t2
off
on
on
off
vdu
vdu
tvdu
tvdu
nom
nom
cmdin
open
on
none
cmdin
on
open
none
open
none
cmdout
none
none
cmdout
closed
open
closed
v1
closed
v1
tv1
tv1
nom
nom
Flowv1
zero
zero
high
Flowv1
zero
closed
v2
closed
open
closed
v2
tv2
tv2
nom
nom
Flowv2
zero
zero
high
Flowv2
zero
zero
zero
  • The assignments to t capture every possible
    trajectory
  • Trajectories can be enumerated in prior
    probability order
  • P(trajectory) S P(t assignment)
  • Each trajectory can be checked for agreement with
    observations

17
Naive Tracking Solution
  • To track the n most likely trajectories
  • for t0 t ! ? t
  • Create full copy of the model for the new time
    step t
  • Assign command and observation variables
  • Loop over assignments to t in order of prior
    probability
  • Install t assignment into model and predict
    observations
  • Check for consistency with actual observations
  • Throw assignment out if inconsistent
  • If n consistent trajectories found, exit loop
  • Report the t-length assignments

18
An algorithm with only 2 problems!
  • Problem 1 Space
  • Representation grows linearly with t
  • Source of the problem is the algorithm step
  • Create full copy of the model for the new time
    step t
  • Approach Develop an approximate representation
  • Problem 2 Time
  • Search space grows exponentially with t
  • Checks an exponential number of obviously wrong
    candidates
  • Source of the problem is the algorithm steps
  • Loop over assignments to t in order of prior
    probability
  • Check for consistency with actual observations
  • Approach Conflict-based algorithm rather than
    generate test

19
Approximate Representations
20
Minimizing Each Time Step
  • Need we distinguish a failure at time t from one
    at t-1?
  • State at t2 is independent of whether V1 stuck
    at t1 or t0
  • State of V1 at t3 depends upon if VDU hung at
    t2 or t3
  • Intuition command transmission allows future
    observations to witness the exact time of a past
    failure

21
Minimizing Each Time Step
  • Static analysis tells us when to avoid
    introducing variables

zero
zero
  • This is proven to be a conservative approximation
  • We are dropping observations
  • Does not discard any consistent trajectories but
    allows some imposters
  • Imposter trajectories should be knocked out by
    future observation stream
  • Future work Add back in observations at critical
    time points
  • Future work Discard observations for search, use
    them in a fast soundness check

22
History Truncation
t7
t3
  • Many assignments to old t remain untried after
    many observations
  • They represent an exponential of increasing
    unlikely trajectories
  • We commit to the n initial trajectories that look
    likely thus far

23
Complete Representation
Complete Model
Conservative Approximation
Gross Approximation
Time
Present
hung
off
on
on
off
vdu
off
tvdu
nom
nom
nom
nom
nom
reset
on
cmdin
open
close
off
cmdout
open
off
close
closed
v1
open
closed
closed
open
closed
tv1
nom
nom
nom
nom
nom
high
zero
Flowv1
zero
open
v2
closed
closed
closed
open
closed
tv2
nom
nom
nom
nom
nom
high
zero
Flowv2
zero
Variables
24
In-Situ Propellant Production
25
ISPP Model Growth
26
Circuit Breaker (CB) Cascade
  • 15 Circuit Breakers, 8 Lights, 1 Power Source
  • Familiar domain, but with many ambiguous
    diagnoses

27
Circuit Breaker Model Growth
28
Search Algorithms
29
Desired Search Properties
  • Assume all failures are equally likely. The
    algorithm uses infinitesimal probabilities. This
    is easier to explain.
  • Let n be the minimal number of failures in a
    consistent trajectory
  • Track all consistent trajectories with n failures
  • Little computation as long as an n-failure
    trajectory is consistent
  • If no trajectories with n failures remain, track
    all with n1 failures
  • Focused search that avoids obviously wrong
    trajectories

30
Focusing on Consistent Trajectories
  • Suppose we consider the all nominal trajectory
    when Flowv1,2 zero
  • We discover the following is inconsistent with
    the model
  • Flowv1 zero ? tvdu,0nom ? tv1,1nom ? tv1,0nom
  • No consistent trajectory contains tvdu,0nom,
    tv1,1nom, tv1,0nom
  • tvdu,0nom, tv1,1nom, tv1,0nom is a nogood
    or conflict

31
Conflict Coverage
  • Suppose we start with the following nogoods
  • tvdu,0nom, tv1,1nom, tv1,0nom,
    tvdu,0nom, tv2,1nom, tv2,0nom
  • Assign n of the t to failure values, covering all
    nogoods
  • NP-hard hitting set problem
  • Equivalent to finding all n-failure trajectories
  • At each time t, extend trajectories without
    adding failures
  • Check trajectories for consistency (generating
    more nogoods)
  • If any n-failure trajectories remain consistent
  • If all n-failure trajectories become
    inconsistent, we know we need at least an
    (n1)-failure trajectory

32
Initial Experiments
  • Conflict coverage run on ISPP and circuit breaker
    models
  • ISPP runs were 30 steps
  • Circuit breaker runs were 620 steps
  • Multiple runs with multiple failure injections
    per run
  • Representation parameters
  • Full model for 1 time step
  • No history truncation
  • Conservative approximation portion allowed to
    grow unboundedly

33
Single Diagnosis Results
  • History has little performance impact for a
    single diagnosis
  • Rank is low, conflict set is small. Hitting set
    problem is easy.
  • nogood length does not seem to have a big impact
    in this case

CPU time used per time step, for 620 step CB
simulation with failure on step 519 Apologies for
this graph
34
Failure Independence
  • Time to diagnose a failure at time t often
    independent of any failures diagnosed at previous
    time steps
  • Horrible ISPP valve failure followed by a simple
    heater failure

35
Failure Interaction
  • Time to diagnose a failure at time t can be
    increased by failures diagnosed at previous time
    steps
  • Determining factor seems fair
  • May be mitigated by history cutoff, simple
    algorithm changes
  • A simple, repairable circuit breaker failure,
    occurring 40 times.

36
Failure Independence
  • Imagine diagnosing horrible valve failure (V),
    then encountering the heater failure (H)

nogood set
0. Rule out nominal trajectory, introducing
nogoods 1. Generate hitting set, yielding V
candidates 2. Check consistency of V candidates,
introducing more nogoods 3. Occurrence of H
rules out V trajectories, generating more
nogoods 4. Generate hitting set, yielding (VH)
candidates
  • Checking consistency of V hitting set introduces
    more nogoods
  • These nogoods memoize the previous hitting set
    consistency computation
  • Recomputing hitting set for V takes near-zero
    time, returns no inconsistent candidates
  • Performance suggests hitting set for H is solved
    independently

37
Action Selection
38
The Safe Planning Problem
  • Desire Respect uncertainty when acting

System Evolution
time
  • A plan is conformant if all states reach the goal
  • Safe planning generalizes conformant planning
  • A plan is safe if some states reach the goal, and
    no safety constraints are violated regardless
  • If we dont reach the goal, we safely get
    information

39
Conformant Techniques
Start state 1
Start state 2
Start state 3
Start state 4
  • Start states represented as explicit disjunction
    or BDD
  • Conformant graphplan (Smith Weld, AAAI98)
  • Conformant planning via symbolic model checking
    (Cimatti Roveri, ECP99)
  • Planner considers the result of an action in all
    states
  • Can we use our bag of tricks to do better?

40
Conflict-based Repair
BlackBox (Kautz Selman, IJCAI 99)
BlackBox is a fast planner for generating
plans from a known initial state
Graphplan/SAT translator
Domain Action Model
SAT Solver
Plan
41
Summary
42
Progress in State Identification
  • Formulation of the trajectory tracking problem
  • Conditions that equate trajectory tracking
    state tracking
  • Identification as specialization of POMDP
  • Identification as a generalization of model-based
    diagnosis
  • Formulation of the transition system
    representation
  • Development of scalable approximations
  • Conservative (completeness) proof
  • Development of conflict coverage search
  • Soundness check approximations (in progress)
  • Implementation and experimental results

43
Progress in Action Selection
  • Formulation of the safe planning problem
  • Conflict-based approach to safe or conformant
    planning
  • Safe planning algorithms (in progress)

44
Proposed Future Work
  • State Identification
  • Soundness check on diagnoses generated from
    approximation
  • Re-introduction of most relevant observation
    variables
  • Finite horizon experiments
  • Interleave coverage generation and consistency
    checking
  • Action Selection
  • Algorithms for safe planning
  • First approach will be mapping to SATPLAN
  • Algorithm implementations
  • Experimental results in safe conformant
    planning

45
Related Work
  • State identification
  • Livingstone (Williams and Nayak, AAAI96)
  • Belief state approximation (Boyen Koller,
    UAI98)
  • Conflict-directed search (deKleer Williams AIJ
    Vol 32)
  • Planning
  • Conformant graphplan (Smith Weld, AAAI98)
  • Conformant planning via symbolic model checking
    (Cimatti Roveri, ECP99)
  • Blackbox (Kautz Selman, IJCAI 99)
  • General
  • Infinitesimals (Goldszmidt Pearl, KR92)
  • POMDP (Sondik 1971, PhD, Cheng 1988, PhD,
    Littman, Cassandra, Kaelbling, ML95)

46
Acknowledgements
  • U.S. taxpayers fund this work
  • NASA supports this line of research and provides
    flight opportunities
  • Brian Williams Pandu Nayak developed
    Livingstone (AAAI 96)
  • My committee, Dave Smith, Daniel J. Clancy,
    Shirley Pepke and anonymous reviewers provided
    valuable input
Write a Comment
User Comments (0)
About PowerShow.com