Action Rules Discovery /Lecture I/ - PowerPoint PPT Presentation

1 / 22
About This Presentation
Title:

Action Rules Discovery /Lecture I/

Description:

Action Rules Discovery /Lecture I/ by Zbigniew W. Ras UNC-Charlotte, USA – PowerPoint PPT presentation

Number of Views:151
Avg rating:3.0/5.0
Slides: 23
Provided by: Zbigni
Category:

less

Transcript and Presenter's Notes

Title: Action Rules Discovery /Lecture I/


1
Action Rules Discovery/Lecture I/
  • by
  • Zbigniew W. Ras
  • UNC-Charlotte, USA

2
Interestingness measure
  • Rule two conditions occur together, with
    some confidence

Data Mining Task For a given dataset D,
interestingness measure ID and threshold c,
find association E such that ID(E) gt c.

Knowledge Engineer defines c
3
Interestingness Function
Two types of Interestingness Measure
Silberschatz and Tuzhilin, 1995 subjective
and objective. Subjective measure user-driven,
domain-dependent. Include unexpectedness
Silberschatz and Tuzhilin, 1995, novelty,
actionability Piatesky-Shapiro Matheus,
1994. Objective measure data-driven and
domain-independent. They evaluate rules based on
statistics and structures of patterns, e.g.,
support, confidence, etc.
4
Objective Interestingness
Basic Measures for ? ? ? Domain
card? Support or Strength card ???
Confidence or Certainty Factor
card???/card? Coverage Factor
card???/card? Leverage card???/n
card?/ncard?/n Lift n ?
card???/card?card?
5
Subjective Interestingness
  • Rule is interesting if it is
  • unexpected, if it contradicts the user belief
    about the domain and therefore surprises the user
  • novel, if to some extent contributes to new
    knowledge
  • actionable, if the user can take an action to
    his/her advantage based on this rule

Unexpectedness Suzuki, 1997 /does not depend
on domain knowledge/ If r A?B1 has a high
confidence and r1 AC?B2 has a high
confidence, then r1 is unexpected.
Padmanabhan Tuzhilin A ? B is unexpected with
respect to the belief ? ? ? on the dataset D if
the following conditions hold B ? ? ? False
B and ? logically contradict each other A ?
? holds on a large subset of D ? A? ? B
holds which means A? ? ??
6
Actionable rules
  • Action rules suggest a way to re-classify
    objects (for instance customers) to a desired
    state.
  • Action rules can be constructed from
    classification rules.
  • To discover action rules it is required that the
    set of conditions (attributes) is partitioned
    into stable and flexible.
  • For example, date of birth is a stable attribute,
    and interest rate on any customer account is a
    flexible attribute (dependable on bank).

The notion of action rules was proposed by Ras
Wieczorkowska, PKDD00. Slowinski at al JETAI,
2004 introduced similar notion called
intervention.
7
Decision table
Action Rules
  • Any information system of the form
  • S (U, AFl ? ASt ? d), where
  • d ? AFl ? ASt is a distinguished attribute
    called decision.
  • ASt - stable attributes, AFl ? d - flexible
  • Action rule Ras Wieczorkowska
  • t(ASt) ? (b1, v1? w1) ? (b2, v2 ? w2) ? ?
    (bp, vp ? wp)(x)
  • ? (d, k1 ? k2)(x), where (?i)(1? i ? p) ?
    (bi ?AFl)

E-Action rule Ras Tsay t(ASt) ? (b1, ? w1)
? (b2, v2 ? w2) ? ? (bp, ? wp)(x) ?
(d, k1 ? k2)(x), where (?i)(1? i ? p) ? (bi
?AFl)
8
Action Rules Discovery (Tsay Ras)
Objects a b c d
x1, x2, x3, x4 0 L
x1, x3 0 L
x2, x4 2 L
x2, x4 1 L
x5, x6 3 L
x7, x8 2 1 H
x7, x8 1 2 H
Objects a b c
x1, x2, x3, x4 0
x1, x3 0
x2, x4 2
x2, x4 1
x5, x6 3
Stable Attribute a, c Flexible Attribute
b Decision Attribute d
a ?
a 0
Table Set of rules R with supporting objects
Objects b c
x1, x3 0
x2, x4 2
x2, x4 1
x5, x6 3
Objects b c
x1, x2, x3, x4
Objects a b c
x7, x8 2 1
x7, x8 1 2
c ?
Objects b
x1, x2, x3, x4
c 1
c ?
c 0
a 2
a ?
T6
Objects b c
x7, x8 1
Objects b
x2, x4 2
x5, x6 3
Objects b
x1, x3
Objects b
x2, x4
Objects b c
x7, x8 1 2
T4
T5
c ?
c 2
Figure of (d, L)-tree T2
T3
Objects b
x7, x8 1
Objects b
x7, x8 1
(T3, T1) (a 2) ? (b, 2?1) ? ( d, L ? H)
(a 2) ? (b, 3?1) ? ( d, L ? H)
T1
T2
Figure of (d, H)-tree T1
9
Application domain Customer Attrition
Facts
  • On average, most US corporations lose half of
    their customers
  • every five years (Rombel, 2001).
  • Longer a customer stays with the organization,
    the more
  • profitable he or she becomes (Pauline, 2000
    Hanseman, 2004).
  • The cost of attracting new customers is five to
    ten times
  • more than retaining existing ones.
  • About 14 to 17 of the accounts are closed for
    reasons
  • that can be controlled like price or service
    (Lunt, 1993).
  • Action
  • Reducing the outflow of the customers by 5
    can double
  • a typical companys profit (Rombel, 2001).

10
Action Rules Discovery
Decision table S (U, AFl ? ASt ?
d). Assumption a1,a2,...,ap ? ASt,
b1,b2,...,bq ? AFl, ai,1?
Dom(ai), bi,1? Dom(bi). Rule r a1,1 ?
a2,1 ? ... ? ap,1 ? b1,1 ? b2,1 ? ... ? bq,1
? d1 stable part
flexible part Question Do we have to consider
pairs of classification rules in order to
construct action rules?
11
Action Rules Discovery
Decision table S (U, AFl ? ASt ?
d). Assumption a1,a2,...,ap ? ASt,
b1,b2,...,bq ? AFl, ai,1?
Dom(ai), bi,1? Dom(bi). Rule r a1,1 ?
a2,1 ? ... ? ap,1 ? b1,1 ? b2,1 ? ... ? bq,1
? d1 stable part
flexible part Action rule rd2 ? d1
associated with r and re-classification task
(d, d2 ? d1) a1,1 ? a2,1 ? ... ? ap,1 ?
(b1, ? b1,1 )? (b2, ? b2,1)? ... ? (bq, ? bq,1)
? (d, d2 ? d1)
12
Action Rules Discovery
Action rule rd2 ? d1 a1,1 ? a2,1 ? ... ?
ap,1 ? (b1, ? b1,1 )? (b2, ? b2,1)? ... ?
(bq, ? bq,1) ? (d, d2 ? d1) Support Sup(rd2 ?
d1) x ? U (a1(x)a1,1) ? (a2(x)a2,1)?...?(a
p(x)ap,1) ? (d(x)d2). /d2-objects which
potentially can be reclassified by rd2 ? d1 to
d1/ Sup(Rd2 ? d1) ?Sup(rd2 ? d1) r ?
R, where R- classification rules extracted
from S. /d2-objects which potentially can be
reclassified by rd2 ? d1 to d1/
13
Action Rules Discovery
Action rule rd2 ? d1 a1,1 ? a2,1 ? ... ?
ap,1 ? (b1, b1,1? b1,1 )? (b2, b2,1? b2,1)?
... ? (bq, ? bq,1) ? (d, d2 ?
d1) Support Sup(rd2 ? d1) x ? U
(b1(x)b1,1) ? (b2(x)b2,1) ? (a1(x)a1,1) ?
(a2(x)a2,1) ?...? (ap(x)ap,1) ? (d(x)d2).
/d2-objects which potentially can be reclassified
by
rd2 ? d1 to d1/
14
Action Rules Discovery
Let Ud2 x ? U d(x)d2. Then Bd2 ? d1
Ud2 - Sup(Rd2 ? d1) is a set of d2-objects
in S which are d1-resistant.
Let Sup(R ? d1) ?Sup(Rd2 ? d1) d2 ?
d1. Then B? d1 U - Sup(R ? d1)
is a set of objects in S which are
d1-resistant (can not be
re-classified to class d1).
15
Action Rules Discovery
Action rules rd2 ? d1, rd2 ? d3 are
p-equivalent (?), if r/bi r'/bi always
holds when r/bi, r'/bi are both defined, for
every bi ? ASt ? AFl. Let x ? Sup(rd2 ? d1).
We say that x positively supports rd2 ? d1
if there is no action rule rd2 ? d3
extracted from S, d3 ? d1, which is
p-equivalent to rd2 ? d1 and x ? Sup( rd2 ?
d3).
16
Action Rules Discovery
Let Sup(Rd2 ? d1) x ? Sup(rd2 ? d1) x
positively supports rd2 ? d1. Confidence Conf
(rd2 ? d1) cardSup(rd2 ?
d1)/cardSup(rd2 ? d1) ? Conf(r). Conf(r
? d1) cardSup(r ? d1)/cardSup(r ?
d1) ? Conf(r).
17
Cost of Action Rule Tzacheva Ras
  • Assumption S (X, A, V) is information
    system, Y ? X.
  • Attribute b ? A is flexible in S and b1,
    b2 ? Vb.
  • By ?S(Y, b1, b2) we mean a number from (0, ?
    which describes the average predicted cost of
    approved action associated with a possible
    re-classification of qualifying objects in Y from
    class b1 to b2. Object
  • x ? Y qualifies for re-classification from b1 to
    b2, if b(x) b1.
  • ?S(Y, b1, b2) ?, if there is no action
    approved which is
  • required for a possible re-classification of
    qualifying objects
  • in Y from class b1 to b2

If Y is uniquely defined, we often write ?S(b1,
b2) instead of ?S(Y, b1, b2).
18
Cost of Action Rule
  • Action rule r
  • (b1, v1? w1) ? (b2, v2? w2) ? ?( bp, vp?
    wp)(x) ?
  • (d, k1? k2)(x)
  • The cost of r in S
  • costS(r) ??S(vi , wi) 1 ? i ? p
  • Action rule r is feasible in S, if costS(r)
    lt?S(k1, k2).
  • For any feasible action rule r, the cost of the
    conditional
  • part of r is lower than the cost of its
    decision part.

19
Cost of Action Rule
  • Assumption Cost of r is too high!
  • r (b1, v1 ? w1) ? ? (bj, vj ? wj) ? ? ( bp,
    vp ? wp)(x) ?
  • (d, k1 ? k2)(x)
  • r1 (bj1, vj1 ? wj1) ? (bj2, vj2 ? wj2) ? ?(
    bjq, vjq ? wjq)(x)
  • ? (bj, vj ? wj)(x)
  • Then, we can compose r with r1 and the same
    replace
  • term (bj, vj ? wj) by term from the left hand
    side of r1
  • (b1, v1 ? w1) ? ? (bj1, vj1 ? wj1) ? (bj2,
    vj2 ? wj2) ? ?
  • ( bjq, vjq ? wjq) ? ?( bp, vp ? wp)(x) ? (d,
    k1 ? k2)(x)

20
Class movability-index
FS - decision attribute ranking positive
integer associated with a decision value
/objects of higher decision attribute ranking
are seen as objects more preferably movable
between decision classes than objects of lower
rank/. Nj i ? N FS(dj) FS(di) ?
0. Class movability-index assigned to Nj,
ind(Nj) ?FS(dj) FS(di) i?Nj
X a b d FS
x1 a1 b1 d3 1
x2 a2 b1 d2 2
x3 a1 b2 d2 2
x4 a3 b2 d1 3
21
Class movability-index
Let Pj(i) Sup(rdj ?di) /Pj(i) all
objects in U which can be reclassified from the
decision class dj to the decision class
di Pj(N) ?Pj(i) i ? N, i?j, for any N
?1,2,,k where d1,d2,,dk are all decision
classes. Class movability-index (m-index)
assigned to dj-object x indS(x)
maxind(Nj) Nj ?1,2,,k ? x ?Pj(N)
22
Questions?
Thank You
Write a Comment
User Comments (0)
About PowerShow.com