PRISM - PowerPoint PPT Presentation

About This Presentation
Title:

PRISM

Description:

the system is operational at a given point of time // (100 hours) is at least 0.99. ... The probability that system will go down in 1000 // hours is less than 0.01. ... – PowerPoint PPT presentation

Number of Views:92
Avg rating:3.0/5.0
Slides: 12
Provided by: sci109
Category:
Tags: prism | down | of | system

less

Transcript and Presenter's Notes

Title: PRISM


1
PRISM
  • A Probabilistic Model Checker, Birmingham
  • Supports 3 models
  • 1.Discrete-time Markov chain(DTMC)
  • 2.Markov decision processes(MDP)
  • 3.Continuous-time Markov
    chain(CTMC)

2
  • Supports 2 languages
  • 1.PCTL
  • 2. CSL

3
DTMC
PCLT Model Checker
System description
MDP
Results
CTMC
CSL Model Checker
PCTL formulas
Properties file
CSL formulas
4
Our Model Web server Model
application
application
hardware
hardware
Detect_and_restart
5
System Parameters
Time Unit hour
  • Application failure rate 0.001
  • Hardware failure rate0.0001
  • Time to detect app. failure 1/12
  • Time to detect hardware failure1/30
  • Time to restart app.1/20
  • Time to restart hardware 1/12
  • Probability of unsuccessful restart2

6
Some modeling codes
  • Software and hardware crashes with some rate
  • (s1 1) - gamma_p (s1' 0)
  • (h1 1) - gamma_m (h1' 0)(s1' 0)

7
Continue...
  • Primary Web server module
  • module primary
  • inspect_p_soft (sr1 0)(s1 0)(h1 1)
  • !((s2 1 )(h2 1
    ))(sr2 0)
  • - 1 (sr1' 1)
  • restart_p_soft (sr1 1)(s1 0)
  • - 0.98(s1' 1)(sr1'
    0)
  • 0.02(sr1' 0)
  • inspect_p_hard (hr1 0)(h1 0)
  • - 1(hr1' 1)
  • restart_p_hard (hr1 1)(h1 0)
  • - 1(h1' 1)(hr1'
    0)
  • endmodule

8
Continue...
  • Detect and restart module
  • f0..1 init 0
  • inspect_p_soft (f0) - 12.0(f'1)
  • inspect_p_hard (f0) - 30.0(f'1)
  • inspect_s_soft (f0) - 12.0(f'1)
  • inspect_s_hard (f0) - 30.0(f'1)
  • restart_p_soft (f1) - 20.0(f'0)
  • restart_p_hard (f1) - 12.0(f'0)
  • restart_s_soft (f1) - 20.0(f'0)
  • restart_s_hard (f1) - 12.0(f'0)

9
Properties verified to be valid
  • // (1). Availability for at least 99 of the
    time, the
  • // system is operational on the long run .
  • S0.99(s1 1 h1 1)(s2 1 h2 1)
  • // (2). Instantaneous availability the
    probability that
  • // the system is operational at a given point of
    time
  • // (100 hours) is at least 0.99.
  • P0.99true U100,100 (s1 1 h1 1)(s2 1
    h2 1)
  • // (3). The system will eventually be
    operational.
  • P1true U (s1 1 h1 1)(s2 1 h2 1)
  • // (4). The probability that system will go down
    in 1000
  • // hours is less than 0.01.
  • P
  • // (5). If primary software is down, it will
    eventually
  • // recover
  • s1 0 - P1true U s1 1 h1 1

10
Continue...
  • // (6). The probability that the whole system is
    not
  • // working in the 10,100, 1000, 10000 time unit
    is less
  • // than 0.001
  • P
  • P
  • P
  • P
  • P
  • P
  • // (7). There's at least a 99 chance that the
    system
  • // will stabilize such that it is up to more than
    90 in
  • // the long run.
  • P0.99true U S0.90(s1 1 h1 1)(s2 1
    h2 1)
  • // (8). When the software of one machine is down,
    the
  • // probability that the system will be available
    within 0.6
  • // time units is at least 0.99. This is an
    extension of
  • // property (5).

11
Ongoing research
From Dave Parker_at_cs.bham.ac.uk
  • 1. Problems
  • scalability and state-space explosion
  • 2. Solutions
  • (1).parallel/distributed implementation
  • of the model checking algorithm
  • (2).Exploit high-level properties of system
    such as symmetry and compositionality
Write a Comment
User Comments (0)
About PowerShow.com