Title: Quantitative Risk Assessment Monte Carlo Simulation
1Quantitative Risk Assessment Monte Carlo
Simulation
2What is Monte Carlo Analysis?
- It uses random number generation, rather than
analytic calculations, to combine distributions - It is increasingly popular due to high speed
personal computers
3Risk of infection from drinking water containing
protozoan micro parasites. (e.g. cryptosporidium)
4The model
Viability I
Water source concentration
Treatment Process DR - effectiveness
Volume consumed V
5Variables Conc - Concentration of pathogenic
micro-organisms in untreated (surface) water I -
Fraction of the detected pathogens that is
capable of infection (viability) DR - Removal or
inactivation efficiency of the treatment process,
expressed as its Decimal Reduction factor V
Daily individual consumption of unboiled drinking
water
6Best estimate values
7Variability Uncertainty
Key Qusetion of QRA How do we model the
variability of a given parameter?
8Min, Best Guess, Max
How likely is this value?
3 X 3 X 3 X 3 243 combinations
9How likely is each value?
Probability
What about values in between?
10
4
19
10Use information from sampled values
Is there a mathematical function which matches
the distribution?
Cryptomontecarlowhatif.xls
11Probability Density Function (pdf)
0.16
Probability Density Function (PDF) describes the
relative likelihood that a random variable will
assume a particular value
0.12
0.08
Frequency / Probability
Area under a PDF is always unity
The PDF is alternatively referred to in the
literature as the probability function or the
frequency function
0.04
0.00
0.0
20.0
40.0
60.0
80.0
100.0
Parameter Value
12Cumulative Distribution Function
1.00
0.80
Cumulative Distribution Function (CDF) gives the
probability that the variable will have a value
less than or equal to the selected value
0.60
Probability of Value lt X Axis Value
0.40
CDF is the integral of the corresponding PDF
The CDF is alternatively referred to in the
literature as the distribution function, cumulativ
e frequency function, or the cumulative
probability function.
0.20
0.00
0.0
20.0
40.0
60.0
80.0
100.0
Parameter Value
13Representing Uncertainty
Risk analysis relies on the appropriate use of
probability distributions to accurately
represent the uncertainties of the problem
14Probability Distributions
- Risk analysis relies on the appropriate use of
probability distributions to accurately represent
the uncertainties of the problem - Distributions can be
- Discrete or Continuous
- Bounded or Unbounded
- Parametric or Non-parametric
- Inappropriate use of probability distributions
is a very common failing of risk analysis models
_at_Risk Help file Define Distribution
15Discrete Distributions
May take one of a set of identifiable values
Each of which has a calculable probability of
occurrence Probability of occurrence sometime
called probability mass
Probability Mass Function
Sum of all these values must add up to 1
- Examples
- Binomial Negative Binomial Distributions
- Poisson Distribution
- Geometric Hypergeometric Distributions
- Generalised Discrete Distribution
161.00
e
u
CDF
l
a
0.80
V
s
i
x
A
X
0.60
lt
e
u
l
a
V
0.40
f
o
y
t
i
l
i
b
a
b
0.20
o
r
P
0.00
2
3
4
5
6
7
8
Parameter Value
17Continuous Distributions
Used to represent a variable that can take any
value within a defined range
Probability Density Function
- Examples
- Normal, Lognormal
- Beta, Triangular
- Weibull
18CDF
19Continuous Distributions
- Measurement could be infinitely divisible
- e.g. Time, mass distance
- Also for 'discrete' variables where the gap or
increment between them is insignificant e.g.
project costs - Descriptors
- Mean, variance, standard deviation, coefficient
of variation, confidence interval
20Bounded Unbounded Distributions
- A bounded distribution is confined to lie
between two determined values - Uniform - bounded between a minimum maximum
- Triangular - bounded between a minimum maximum
- Beta - bounded between 0 1
- Binomial - bounded between 0 n
- An unbounded distribution extends from minus
infinity to plus infinity - Truncated Normal Distribution
- Partially bounded
- Removes nonsensical tails
21_at_Risk probability Distributions
22How do the variables vary?
How can we combine the variations? What is the
value that is not exceeded 95 of the time?
23Using Excel to model variation
Cryptosporid with excel probs
24Monte Carlo Simulation
- Each uncertain variable modelled by a probability
distribution rather than a single value - Structure of a quantitative risk assessment model
similar to a deterministic model i.e. with all
the multiplication's, additions etc. - Exception is that probability distributions are
used to describe the variables rather than single
values - Objective is to calculate the combined impact of
the model's various uncertainties in order to
determine a probability distribution of the
possible outcomes
25Monte Carlo Simulation
- Random sampling of each probability distribution
within the model - Produces 1000's or even 10,000's of scenarios
- i.e. Iterations or trials
- Each probability distribution is sampled in a
manner that reproduces the distribution's shape - Distribution of the values calculated for the
model outcome therefore reflects the PROBABILITY
of the values that could occur - Developed in World War II from Atomic Bomb
research - Monte Carlo or Latin Hypercube Sampling
26Methodology
- Use data to inform the choice of input
distributions for model parameters - The choice of input distribution should always be
based on all information (both qualitative and
quantitative) available for a parameter
Data
Expert knowledge
27Methodology
- In selecting a distributional form, the risk
assessor should consider the quality of the
information in the database ask a series of
questions including (but not limited to) - Is there any mechanistic basis for choosing a
distributional family? - Is the shape of the distribution likely to be
dictated by physical or biological properties or
other mechanisms? - Is the variable discrete or continuous?
- What are the bounds of the variable?
- Is the distribution skewed or symmetric?
- If the distribution is thought to be skewed, in
which direction? - What other aspects of the shape of the
distribution are known?
28Methodology
Random Number Generated
Process Repeated Many Times
Distributions Sampled
Output Value Calculated
29Monte Carlo Sampling
1.00
0.80
Random Number Generated
Curve Sampled
0.60
0.40
0.20
Parameter Value
0.00
0.0
20.0
40.0
60.0
80.0
100.0
30Monte Carlo Sampling
1.00
0.80
0.60
Clustering - unless a large number of samples
taken
0.40
0.20
Parameter Value
0.00
0.0
20.0
40.0
60.0
80.0
100.0
31Monte Carlo Sampling
- Random sampling from input distribution
- Random number generated between zero one
- Used to sample the probability distribution
- Entirely random sampling technique
- Sample may fall anywhere within the range of the
input distribution - Randomness of sampling may mean that it will over
under sample from various parts of the
distribution - Cannot be relied upon to replicate the input
distribution's shape unless a very large number
of iterations are performed - Pure randomness of Monte Carlo sampling not
really relevant
32Latin Hypercube Sampling
1.00
Curve stratified into layers
0.80
Each layer only sampled once
0.60
0.40
0.20
Parameter Value
0.00
0.0
20.0
40.0
60.0
80.0
100.0
33Latin Hypercube Sampling
- Sampling method that appears random
- BUT guarantees to reproduce the input
distribution with much greater efficiency than
Monte Carlo sampling - Uses 'stratified sampling without replacement'
- Sampling is forced to recreate the original input
probability distribution - Overcomes clustering problems associated with
Monte Carlo sampling
34Random Number Generator Seeds
- Many algorithms developed to generate a random
number between 0 1 with equal probability
density for all possible values - Start or initial seed value
- Possible to select this seed value
- If model is not changed the same simulation
results can be repeated exactly if same seed
value set - Possible to consider the effects of changing a
distribution on the effects on the model's
output - Hence results due to changes in the model not
as a result of the randomness of the sampling
35Advantages of Monte Carlo Method
- Complex mathematics can be included with no
extra difficulty - Widely accepted technique
- Model behaviour can be investigated with ease
- Changes can be made quickly compared with
previous models
36Advantages of Monte Carlo Method
- Distributions of the model's variables can be
precisely defined - Sensitivities, Correlations other inter -
dependencies can be modelled - Computer does all of the work required
- Commercially available software
- _at_Risk, CrystalBall
- Level of mathematics required to perform a Monte
Carlo Simulation is quite basic - Greater levels of precision achieved by
increasing number of iterations
37_at_RISK Example
38References
- Guiding Principles for Monte Carlo Analysis
(EPA/630/R-97/001) - http//www.epa.gov/ncea/monteabs.htm
- Guidance on Assigning Values to Uncertain
Parameters in Subsurface Contaminant Fate and
Transport ModellingA McMahon, J Heathcote, M
Carey, A Erskine J Barker June 2001 - www.environment-agency.gov.uk/commondata/105385/nc
_99_38_3.pdf - Vose, D. (2000) Risk analysis A Quantitative
Guide. John Wiley Sons.