Recent Advanced in Causal Modelling Using Directed Graphs - PowerPoint PPT Presentation

1 / 43
About This Presentation
Title:

Recent Advanced in Causal Modelling Using Directed Graphs

Description:

Can we predict crime rates from abortion rates 20 years ago. Causal Questions: ... Overview of Search Methods. Constraint Based Searches. TETRAD. Scoring Searches ... – PowerPoint PPT presentation

Number of Views:69
Avg rating:3.0/5.0
Slides: 44
Provided by: christop241
Category:

less

Transcript and Presenter's Notes

Title: Recent Advanced in Causal Modelling Using Directed Graphs


1
Automatic Causal Discovery
Richard Scheines Peter Spirtes, Clark Glymour,
and many others Dept. of Philosophy CALD
Carnegie Mellon
2
Outline
  • Motivation
  • Representation
  • Discovery
  • TETRAD vs. Regression

3
Tetrad 4 Demo
  • Login
  • philoso
  • guest
  • www.phil.cmu.edu/projects/tetrad_download/
  • Launch Tetrad

4
1. Motivation
  • Non-experimental Evidence
  • Typical Predictive Questions
  • Can we predict aggressiveness from the amount of
    violent TV watched
  • Can we predict crime rates from abortion rates
    20 years ago
  • Causal Questions
  • Does watching violent TV cause Aggression?
  • Does abortion reduce crime?

5
Causal Estimation
When and how can we use non-experimental data to
tell us about the effect of an intervention?
  • Manipulated Probability P(Y X set x, Zz)
  • from
  • Unmanipulated Probability P(Y X x, Zz)

6
Conditioning vs. Intervening
P(Y X x1) vs. P(Y X set x1)Teeth Slides
7
2. Representation
  • Representing causal structure, and modeling
    interventions
  • Statistical Causal Models
  • Bayes Networks
  • Structural Equation Models

8
Causation Association
X and Y are associated iff ?x1 ? x2 P(Y X
x1) ? P(Y X x2)
  • X is a cause of Y iff
  • ?x1 ? x2 P(Y X set x1) ? P(Y X set x2)

9
Direct Causation
  • X is a direct cause of Y relative to S, iff
  • ?z,x1 ? x2 P(Y X set x1 , Z set z)
  • ? P(Y X set x2 , Z set z)
  • where Z S - X,Y

10
Causal Graphs
  • Causal Graph G V,E
  • Each edge X ? Y represents a direct causal
    claim
  • X is a direct cause of Y relative to V

11
Modeling Ideal Interventions
  • Ideal Interventions (on a variable X)
  • Completely determine the value or distribution of
    a variable X
  • Directly Target only X
  • (no fat hand)
  • E.g., Variables Confidence, Athletic Performance
  • Intervention 1 hypnosis for confidence
  • Intervention 2 anti-anxiety drug (also muscle
    relaxer)

12
Modeling Ideal Interventions
Interventions on the Effect
Pre-experimental System
Post
13
Modeling Ideal Interventions
Interventions on the Cause
Pre-experimental System
Post
14
Interventions Causal Graphs
  • Model an ideal intervention by adding an
    intervention variable outside the original
    system
  • Erase all arrows pointing into the variable
    intervened upon

Intervene to change Inf Post-intervention graph?
Pre-intervention graph
15
Bayes Networks
The Joint Distribution Factors According to the
Graph, i.e., for all X in V P(V)
?P(XParents(X))
P(S,YF,LC) P(S) P(YF S) P(LC S)
16
Bayes Networks
P(S,YF,LC) P(S) P(YF S) P(LC S)
P(S 0) .7 P(S 1) .3 P(YF 0
S 0) .99 P(LC 0 S 0) .95 P(YF 1
S 0) .01 P(LC 1 S 0) .05 P(YF 0
S 1) .20 P(LC 0 S 1) .80 P(YF 1
S 1) .80 P(LC 1 S 1)
.20 P(S1,YF1,LC1) P(S1)P(YF1S1)P(LC1S
1) .3
.80 .20 .048
17
Causal Bayes Networks
The Joint Distribution Factors According to the
Causal Graph, i.e., for all X in V P(V)
?P(XImmediate Causes of(X))
  • P(S 0) .7
  • P(S 1) .3
  • P(YF 0 S 0) .99 P(LC 0 S 0) .95
  • P(YF 1 S 0) .01 P(LC 1 S 0) .05
  • P(YF 0 S 1) .20 P(LC 0 S 1) .80
  • P(YF 1 S 1) .80 P(LC 1 S 1) .20

P(S,Y,F) P(S) P(YF S) P(LC S)
18
Structural Equation Models
Causal Graph
Statistical Model
  • 1. Structural Equations
  • 2. Statistical Constraints

19
Structural Equation Models
Causal Graph
  • Structural Equations
  • One Equation for each variable V in the
    graph
  • V f(parents(V), errorV)
  • for SEM (linear regression) f is a linear
    function
  • Statistical Constraints
  • Joint Distribution over the Error terms

20
Structural Equation Models
Causal Graph
  • Equations
  • Education ?ed
  • Income ????Education????income
  • Longevity ????Education????Longevity
  • Statistical Constraints
  • (?ed, ?Income,?Income ) N(0,?2)
  • ?????????2?diagonal
  • - no variance is zero

SEM Graph (path diagram)
Path Diagram
21
Tetrad 4 Demo
  • www.phil.cmu.edu/projects/tetrad_download/
  • Launch Tetrad
  • 1. Build a Causal Graph
  • 2. Parameterize it as Bayes net
  • 3. Parameterize it as a SEM
  • 4. Generate Pseudo-random data from each

22
The Markov Condition
  • Causal
  • Structure

Statistical Predictions
Markov Condition
Independence X __ Z Y i.e., P(X Y) P(X
Y, Z)
Causal Graphs
23
Causal Structure ? Statistical Data
24
Causal DiscoveryStatistical Data ? Causal
Structure
25
D-separation Equivalence
  • D-separation Equivalence Theorem (Verma and
    Pearl, 1988)
  • Two acyclic graphs over the same set of variables
    are d-separation equivalent iff they have
  • the same adjacencies
  • the same unshielded colliders

26
Representations ofD-separation Equivalence
Classes
  • We want the representations to
  • Characterize the Independence Relations Entailed
    by the Equivalence Class
  • Represent causal features that are shared by
    every member of the equivalence class

27
Patterns PAGs
  • Patterns (Verma and Pearl, 1990) graphical
    representation of an acyclic d-separation
    equivalence - no latent variables.
  • PAGs (Richardson 1994) graphical representation
    of an equivalence class including latent variable
    models and sample selection bias that are
    d-separation equivalent over a set of measured
    variables X

28
Patterns
29
Patterns What the Edges Mean
30
Patterns
31
Patterns
32
Patterns
Not all boolean combinations of orientations of
unoriented pattern adjacencies occur in the
equivalence class.
33
PAGs Partial Ancestral Graphs
What PAG edges mean.
34
PAGs Partial Ancestral Graph
35
Overview of Search Methods
  • Constraint Based Searches
  • TETRAD
  • Scoring Searches
  • Scores BIC, AIC, etc.
  • Search Hill Climb, Genetic Alg., Simulated
    Annealing
  • Very difficult to extend to latent variable
    models
  • Heckerman, Meek and Cooper (1999). A Bayesian
    Approach to Causal Discovery chp. 4 in
    Computation, Causation, and Discovery, ed. by
    Glymour and Cooper, MIT Press, pp. 141-166

36
Tetrad 4 Demo
  • www.phil.cmu.edu/projects/tetrad_download/
  • 1. Apply search to Causal Graph
  • 2. Include latent variables
  • 3. Apply search to your own data
  • 4. Generate data - send to partner

37
4. Problems with Using Regession for Causal
Inference
38
Regression to estimate Causal Influence
  • Let V X,Y,T, where
  • - Y measured outcome
  • - measured regressors X X1, X2, , Xn-
    latent common causes of pairs in X U Y T T1,
    , Tk
  • Let the true causal model over V be a Structural
    Equation Model in which each V ? V is a linear
    combination of its direct causes and independent,
    Gaussian noise.

39
Regression to estimate Causal Influence
  • Consider the regression equation
  • Y b0 b1X1 b2X2 ..bnXn
  • Let the OLS regression estimate bi be the
    estimated causal influence of Xi on Y.
  • That is, holding X/Xi experimentally constant, bi
    is an estimate of the change in E(Y) that
    results from an intervention that changes Xi by 1
    unit.
  • Let the real Causal Influence Xi ? Y bi
  • When is the OLS estimate bi an unbiased estimate
    of bi ?

40
Regression Example
? 0 ?
b1
b2
? 0 X
b3
? 0 X
X2
X1
X3
PAG
Y
41
Regression Bias
  • If
  • Xi is d-separated from Y conditional on X/Xi in
    the true graph after removing Xi ? Y, and
  • X contains no descendant of Y, then
  • bi is an unbiased estimate of bi
  • See Using Path Diagrams .

42
Tetrad 4 Demo
  • www.phil.cmu.edu/projects/tetrad_download/
  • 1. Build Causal Graph among X1,X2,X3,Y (with
    latent variables)
  • 2. Build SEM - generate psuedo-random data
    (N10,000), save to desktop.
  • 3. Apply FCI
  • 4. Use Minitab to do a regression

43
References
  • Causation, Prediction, and Search, 2nd Edition,
    (2000), by P. Spirtes, C. Glymour, and R.
    Scheines ( MIT Press)
  • Causality Models, Reasoning, and Inference,
    (2000), Judea Pearl, Cambridge Univ. Press
  • Computation, Causation, Discovery (1999),
    edited by C. Glymour and G. Cooper, MIT Press
  • Causality in Crisis?, (1997) V. McKim and S.
    Turner (eds.), Univ. of Notre Dame Press.
  • TETRAD IV www.phil.cmu.edu/projects/tetrad
  • Web Course on Causal and Statistical Reasoning
    www.phil.cmu.edu/projects/csr/
Write a Comment
User Comments (0)
About PowerShow.com