Title: Process Equivalence: Comparing Two Process Models Based on Observed Behavior
1Process EquivalenceComparing Two Process Models
Based on Observed Behavior
- Wil van de Aalst Ana Karla Medeiros Ton Weijters
- Eindhoven University of Technology
- Department of Information Systems
- a.k.medeiros_at_tm.tue.nl
2Outline
- Process Equivalence
- Process Mining
- Metrics for Equivalence Quantification
- Applications
- Conclusions and Future Work
3Process Equivalence Objective
How similar are the behaviors of two process
models?
100? 80? 20? 0?
Our aim Quantify equivalence!
4Process Equivalence Current Approaches
1) Structural comparison
Do the processes have the same structure?
Different structures do not necessarily imply
different behavior!
5Process Equivalence Current Approaches
2) Trace Equivalence
Do the processes generate the same set of traces?
Set of traces must be finite! Does not consider
frequency of traces!
6Process Equivalence Current Approaches
3) Other approaches bisimilarity, branching
bisimilarity, etc.
Yes/no answer!
7Process Equivalence Our Approach
Compare two process models based on example
behavior!
Log
8Process Mining Objective
Log
Mining Techniques
9Process Mining Motivation
- Before deployment
- Objective picture of how the process has been
executed - After deployment
- Feedback mechanism
10Process Mining Evaluation
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
Original Process
Logs
Genetic Miner
Mined Process
How much of the behavior in the log can be
generated by the mined process? How close is the
behavior of the mined process to the original one?
11Process Mining Evaluation
aaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaa
- How much of the behavior in the log can be
generated by the mined process? - Genetic Miner How fit is the mined model?
- Fitness percentage of the log that can be
correctly parsed by the mined model - Continuous semantics parsing
- Problems are registered, but tasks are replayed
even when disabled
12Process Mining Fitness
13Process Mining Fitness
14Process Mining Fitness
15Process Mining Fitness
16Process Mining Fitness
17Process Mining Fitness
?
18Process Mining Fitness (F)
?
?
?
?
F 1.0
19Process Mining Fitness
?
F (40 ) / 160
20Process Mining Fitness
?
?
F (40 85 ) / 160
21Process Mining Fitness
?
?
?
F (40 85 ) / 160
22Process Mining Fitness
?
?
F (40 85 ) / 160
23Process Mining Fitness
?
?
F (40 85 ) / 160
24Process Mining Fitness
?
?
3/4
F (40 85 (153/4) ) / 160
25Process Mining Fitness
?
?
3/4
3/4
F (40 85 (153/4) (203/4)) / 160 0.945
26Process Mining Fitness (F)
F
1.0
0.945
0.828
1.0
27Metrics for Equivalence Quantification
- Behavioral precision (Bp) and recall (Br)
- How much behavior two models (M1 and M2) have in
common with respect to a log (L)? - Main idea
- Compare the set of enabled tasks in both models
while replaying the traces in the log - Consider the frequency of log traces
28Metrics for Equivalence Quantification
- Behavioral precision (Bp) and recall (Br)
M1
M2
29Metrics for Equivalence Quantification
- Behavioral precision (Bp) and recall (Br)
M1
M2
A
A
Intersection Enabled(M1,M2,pos)
A
30Metrics for Equivalence Quantification
- Behavioral precision (Bp) and recall (Br)
M1
M2
B, C, D
B, C, D
Intersection Enabled(M1,M2,pos)
B,C,D
31Metrics for Equivalence Quantification
- Behavioral precision (Bp) and recall (Br)
M1
M2
D
D
Intersection Enabled(M1,M2,pos)
D
32Metrics for Equivalence Quantification
- Behavioral precision (Bp) and recall (Br)
M1
M2
E
E, F
Intersection Enabled(M1,M2,pos)
E
33Metrics for Equivalence Quantification
- Behavioral precision (Bp) and recall (Br)
M1
M2
Intersection Enabled(M1,M2,pos)
34Metrics for Equivalence Quantification
M1
- Behavioral Precision
- M2 with respective to M1, based on the log L
- Behavioral Recall
- M1 with respective to M2, based on the log L
- Frequency of the traces are taken into account!
- Example
- Bp(M1,M2,L) 0.75 and Br(M1,M2,L)1.0
L
M2
35Metrics for Equivalence Quantification
- Behavioral precision (Bp) and recall (Br)
36Applications
- Process Mining
- Assess the quality of the mined models
- Do they really portrait the most frequent
behavior in the log? - Log may be incomplete or contain noise
- Detect over-general models
- Models that can parse
- any log with the tasks
- in the input log
37Applications
- Metrics are implemented in the ProM framework,
available at processmining.org
38Conclusions
- Behavioral precision and recall are useful
metrics to quantify how similar two process
models are with respect to a log - Main advantages
- Frequencies of the log traces are considered
- Log is not required to be exhaustive
- Models do not need to be able to parse all the
behavior in the log - Metrics are implemented in the open source tool
ProM
39Future Work
- Apply the metrics to other scenarios beyond the
process mining scope - If available, extend the metrics to consider
semantic information
www.ip-super.org
40http//www.processmining.org
Thank you! Questions? Comments?