Addressing Confounding Errors When Using NonExperimental, Observational Data to Make Causal Inferenc - PowerPoint PPT Presentation

1 / 29
About This Presentation
Title:

Addressing Confounding Errors When Using NonExperimental, Observational Data to Make Causal Inferenc

Description:

Addressing Confounding Errors When Using Non-Experimental, Observational Data to ... of the cases in which social epidemiologists are interested, it is not clear how ... – PowerPoint PPT presentation

Number of Views:81
Avg rating:3.0/5.0
Slides: 30
Provided by: pamelajojo
Category:

less

Transcript and Presenter's Notes

Title: Addressing Confounding Errors When Using NonExperimental, Observational Data to Make Causal Inferenc


1
Addressing Confounding Errors When Using
Non-Experimental, Observational Data to Make
Causal InferencesERROR06 Conference
  • Pamela Jo Johnson Andrew Ward
  • University of Minnesota

2
Introduction
  • The focus of the presentation is on the problem
    of confounding bias in non-experimental,
    observational studies in social epidemiology
  • J. Michael Oakes and Jay S. Kaufman define Social
    Epidemiology as the study of how a societys
    innumerable social arrangements, past and
    present, yield differential exposures and thus
    differences in health outcomes among the persons
    who comprise the population.
  • We believe that warranted causal inferences using
    non-experimental, observational data requires
    careful and complete conceptual analysis for
    this reason, there is much that philosophical
    approaches can contribute.

3
Non-Experimental Studies
  • Many social phenomena are not amenable to
    experimental investigation ethical and
    complexity issues
  • Non-experimental studies may have random
    selection/sampling from a target population
    e.g., NHIS which is a four-panel, stratified,
    multistage, cross-sectional household interview
    survey in which sampling and surveys are
    continuous throughout the year.
  • Non-experimental studies lack random assignment
    the units being investigated (e.g., people) do
    not each have a known probability of being
    assigned to a (manipulable) treatment/exposure
    whose effect we want to investigate
  • In contrast, random assignment (assuming ideal
    experimental conditions following the random
    assignment) can yield unbiased estimates of the
    average treatment effect

4
A Central Problem for Non-Experimental,
Observational Studies
  • One of the central problems for non-
    experimental, observational studies is to
    control, or somehow take account of the possible
    bias that occurs because there is no random
    assignment of units (e.g., people) in a target
    population to treatments/exposures.
  • What can we do to come closer to the goal of all
    epidemiological studies an accurate estimation
    of the true effect of a treatment/ exposure in
    a target population?

5
An Approach to the Problem
  • Counterfactual Framework
  • Causal Contrasts
  • Propensity Score Matching
  • Instrumental Variables

6
Counterfactual Framework
  • Counterfactual framework
  • Framework for thinking about cause and effect
  • Potential outcomes model (Potential outcomes of
    differential exposures)
  • Compare the potential outcomes that would occur
    under different levels of exposure for the same
    unit (e.g. person)
  • Neyman-Rubin (Fisher, Holland, etc.) Model
  • Consider two variables Yt(u) and Yc(u)
  • Yt(u) the value of the response (Y) if the unit
    (u) were exposed to t Yc(u) the value of the
    response if the same unit were exposed to c
  • If we could simultaneously observe Yt(u) and
    Yc(u), then the Causal Contrast, Yt(u) - Yc(u),
    would tell us how much Y changed for unit u if
    treatment/exposure t was used instead of
    treatment/exposure c

7
Fundamental Problem of Causal Inference - 1
  • Given the Causal Contrast Yt(u) - Yc(u), Paul
  • Holland (1986) writes
  • Fundamental Problem of Causal Inference. It is
  • impossible to observe the value of Yt(u) and
    Yc(u)
  • on the same unit and, therefore, it is impossible
    to
  • observe the effect of t on u.
  • Put a bit differently (following Pearl, 2003),
    whereas association has to do with static
    relationships (joint distribution of observed
    variable values) causation has to do with the
    effect of changing variable values more
    specifically, changing the variable values for
    the same units during the same time period
    (hence, the counterfactual character of causal
    inference)

8
Fundamental Problem of Causal Inference - 2
  • When data are non-experimental, they are not the
    result of random assignment, and so the causal
    inference depends on a theory of the way that the
    data were generated, which goes beyond the data
    themselves. (Pearl, 2003)
  • It follows from this that causal inferences using
    only non-experimental data are not directly
    testable without additional assumptions, the
    most that we can directly test for are
    correlations/associations, and not causal
    relationships.
  • Thus, within a counterfactual framework, we need
    to find an observable substitute for the
    counterfactual which is identifiable with, and
    exchangeable for the counterfactual.

9
Causal Contrasts
Figure adapted from Maldonado Greenland.
(2002). Estimating causal effects. Int J
Epidemiol, 31(2), 422-429.
10
Confounding and Confounders
  • In the causal contrast scenario from the previous
    slide, we say that the TARGET population
    experiences exposure distribution 1 (i.e.,
    exposure to poverty) and that the SUBSTITUTE for
    the counterfactual target population experiences
    exposure distribution 0 (i.e., no exposure to
    poverty).
  • CONFOUNDING occurs just in case ELow Poverty/Flow
    Poverty is not identical to (?) ALow Poverty/BLow
    Poverty. When this happens, the substitute is an
    imperfect substitute for the counterfactual
    target population the exchangeability of the
    substitute for the counterfactual target
    population is imperfect because they are not
    identical. Thus, confounding is a property of
    the assignment mechanism. (Greenland, 1990)
  • COUNFOUNDER while confounding occurs because of
    imperfect substitution, a confounder is a
    variable that explains, partly or completely, why
    confounding occurs.

11
The Problem of Confounding
  • We cannot eliminate confounding in
    non-experimental studies by random assignment
    (caveat of natural experiments) In contrast,
    in experimental studies one can make the
    probability of severe confounding as small as
    preferred by increasing the sample size
    (Greenland, 1990)
  • At the same time, within the counterfactual
    framework we need an appropriate observable
    substitute for the counterfactual target
    population that cannot be directly observed
  • Thus, we need a different way to provide some
    assurance that the observable substitute we
    select for the counterfactual target population
    is as closely exchangeable with the
    counterfactual as possible
  • In other words, we need something that permits us
    to mimic random assignment in experimental
    studies
  • This is what leads to a consideration of
    propensity scores

12
What are Propensity Scores?
  • Paul Rosenbaum and Donald Rubin (1983) define a
    propensity score as the conditional probability
    of assignment to a particular treatment given a
    vector of observed covariates.
  • Propensity scores range from 0 to 1 in a
    randomized experiment, an equal probability
    assignment mechanism assigns people to one of two
    (or more) distributions of treatment, so each
    person will have a true propensity score of .5
    In a non-experimental, observational study,
    propensity scores must be estimated.
  • Two assumptions that are made are (1) Stable
    Unit-Treatment Value Assumption There is a
    unique value rti corresponding to unit i and
    treatment t. (2) Strongly Ignorable Treatment
    Assignment the responses, rti, are conditionally
    independent of the treatment assignment, t, given
    the observed covariates, and for each covariate
    the subjects have a positive probability of
    receiving the treatment.

13
Generating Propensity Scores
  • Propensity scores can be estimated using several
    methods, but the most commonly used method is
    logistic regression.
  • The regression uses observable covariates, and
    following Rubin and Thomas (1996) unless a
    variable can be excluded because there is a
    consensus that it is related to outcome or is not
    a proper covariate, it is advisable to include it
    in the propensity score model even if it is not
    statistically significant. Thus, Propensity
    Scores are Covariate -Promiscuous.
  • When logistic regression is used, the observed
    covariates are the predictors and the treatment
    assignment (dummy coded 0No Treatment/exposure,
    1treatment/exposure) is used as the dependent
    variable.
  • The predicted value (probability) is the
    propensity score and each person in the target
    population will end up with a propensity score,
    unless they have missing values on covariates.

14
Propensity Score Overlap - 1
15
Propensity Score Overlap - 2
16
Propensity Scores An Example of NO Overlap
17
General Steps of the Propensity Score Methodology
  • Estimate propensity scores for each causal
    contrast using logistic regression
  • Assess overlap in propensity scores across
    exposure groups
  • Match exposed to unexposed (counterfactual)
    subjects on propensity scores within calipers
    (i.e., a predetermined range of the exposed
    subjects estimated propensity score)
  • Assess covariate balance across exposure groups
    (e.g., using standardized differences in the
    distribution of covariates across the exposure
    groups, where what is wanted is standardized
    differences lt10)
  • Estimate average causal effects from matched
    samples (Average Effect of the Treatment/Exposure
    on the Treated/Exposed)
  • Bootstrap standard errors and confidence intervals

18
Propensity Score Matching
  • Propensity score estimation
  • Use of propensity scores reduces a collection of
    covariates to a single summary measure, which is
    conducive to matching
  • ? Propensity score overlap (if there is no
    overlap the groups are not comparable, and so the
    subjects not exchangeable)
  • If covariates are missing (missing data on the
    propensity score predictors), a propensity score
    cannot be calculated.
  • Matching on propensity scores
  • For example, match two infants with the same
    probability of exposure when in fact one was
    exposed and the other was not (we can match with
    replacement to handle cases of exposed/treated ?
    non-exposed/non-treated)
  • Matching on propensity scores will, in
    expectation, create balance on all covariates
    used to estimate it
  • ? The result is covariate balance across groups
    after matching (the observed concomitants that
    might affect the response are as similar as
    possible in the two groups)

19
Limitations of Propensity Scores - 1
  • Data Source Limitations (limitations of the
    example)
  • Linked Birth/Infant Death data
  • Not collected for research purposes
  • No data on individual/family poverty status
  • No data on whether infant ever left the hospital
  • Thus, good data collection techniques are
    needed
  • Propensity Score Methodology Limitations
  • Common support may induce selection bias (i.e.,
    excluding those with propensity scores on the
    tails for whom there is no overlap)
  • Excluding subjects not having all the observed
    covariates used in forming the propensity scores
    can induce selection bias (Case-wise deletion due
    to missing covariates is NOT unique to Propensity
    Score Estimation)
  • Matching with replacement can result in a single
    unexposed subject matched multiple times

20
Limitations of Propensity Scores - 2
  • In addition to the Methodological Limitations
    already noted, recall two important assumptions
    of propensity score analyses
  • (1) Stable Unit-Treatment Value Assumption
    (SUTVA)
  • (2) Strong Strongly Ignorable Treatment
    Assignment
  • The first assumption amounts to ignoring cases in
    which there are dynamic interaction effects
    between the covariates. Although SUTVA is
    implausible for most of the cases in which social
    epidemiologists are interested, it is not clear
    how to address violations of SUTVA (which seems
    to lead to computational intractability (Little
    and Rubin, 2000)) and little research about this
    has been done (though see (Blume and Durlauf,
    2006) for suggestive ideas)
  • The second assumption means that using propensity
    score matching does not account for bias due to
    UNOBSERVED covariates. However, structural
    equation modeling can be used to supplement
    propensity score matching.

21
Structural Equation Models - 1
  • Once the propensity score matching is done, a
    logistic model is created in which the dependent
    variable, Y, is the outcome of interest (e.g.,
    mortality good health)
  • Thus, what you really have (simplifying for the
    purposes of exposition), post-propensity score
    matching, is the equation
  • In this equation, stands for the vector of
    covariates used in determining the propensity
    scores (including the dummy treatment variable),
    Y is the outcome variable of interest (e.g.,
    mortality), and ? is the random error term.
  • The problem is that using propensity scores
    assumes that the observed covariates used to
    calculate propensity scores are sufficient to
    isolate (identify) the dependent variable in
    the equation. However, there may be unobserved
    confounding variables.

22
Structural Equation Models - 2
  • Although there are several forms the influence of
    unobserved variables could take, suppose we focus
    on the case where there is a single omitted
    common cause of X and Y in the previous
    equation, where the estimate of the effect, Y,
    captures both the effect of X and the effect of
    the unobserved variable Graphically this can be
    represented as

23
Structural Equation Models - 3
  • Failure to take account of the unobserved
    variable will result in a confounding bias (the
    unobserved variable is a confounder).
    Statisticians refer to this type of bias as
    spurious correlation.
  • More specifically, the bias occurs because there
    is an unobserved variable that affects
    treatment/exposure assignment. This will in turn
    affect the creation of the treatment/exposure
    group and the substitute of the counterfactual
    group. This is precisely the kind of bias that
    the use of matched propensity scores was intended
    to eliminate amongst the observed covariates.
    Now, however, it recurs because of the (possible)
    presence of an unobserved covariate.
  • Structural equations (specifically, Instrumental
    Variables Estimation) provides a way of dealing
    with this.

24
Structural Equation Models - 4
  • What we want is an instrument (an instrumental
    variable an IV) that satisfies the following
    requirements
  • (a) The instrumental variable is independent of
    the confounding variable (the unobserved
    variable)
  • (b) The instrumental variable affects the
    assignment into the treatment/exposure group
    versus the non-treatment/non-exposure group in
    other words, the instrumental variable is
    associated with the assignment variable, (we will
    refer to that variable as V).
  • (c) The instrumental variable satisfies (a) and
    (b) without affecting either Yt(u) or Yc(u) that
    is to say, the instrumental variable is
    independent of Y given V and the confounding
    variable.

25
Structural Equation Models - 5
  • We can represent, graphically, the idea of an
    instrumental variable (where V stands for the
    treatment variable, X the set of covariates used
    in the propensity score determination, Y stands
    for the outcome variable, and Z stands for the
    instrumental variable) as

26
Structural Equation Models - 6
  • Suppose (for the sake of simplicity) that there
    is a single instrumental variable, Z, a single
    treatment variable, V, that SUTVA and
    Monotonicity (same direction of effect) hold, and
    that there is a nonzero causal effect of Z on V.
  • Typically, Instrumental Variable Estimation
    (IVE) is used in the case of linear models.
    However, because we used logistic regression to
    regress X (containing V) on Y, and because linear
    IVE produces consistent estimates only if the
    endogenous regression is linear, we need to use
    Non-linear IVE.
  • This requires that the endogenous treatment
    regressor (by assumption, there is only one) is
    replaced in the estimator-defining equation by
    the appropriate non-linear instrument (that is,
    the instrument is the dependent variable).

27
Structural Equation Models - 7
  • Because, by assumption, there is only one
    instrument per treatment variable, V, then the
    model is just (as opposed to over-) identified,
    and the value of (the non-linear, Generalized
    Method of Moments estimate of ) calculated.
    Finally, this permits us to use the estimated
    value to calculate the estimated average effect
    of the treatment/exposure on the treated/exposed.
    (In effect, a non-linear extension of 2SLS
    though matters are more complicated when there
    are multiple instruments)
  • What do instrumental variables show?
  • If well chosen, the instrumental variable
    provides a way of accounting for confounding due
    to unobserved variables. Moreover, using GMM
    estimates in association with IVE permits their
    use with Propensity Score methods that also use
    logistic regression.

28
Structural Equation Models - 8
  • Problems with instrumental variable approach
  • One of the most difficult issues for using
    instrumental variables is the selection of the
    instrument(s) . Following (Moffitt, 2003), we can
    identify four types that have been used in a
    number of different applications
    environmental/ecological, demographic, twin and
    sibling, natural experiments
  • In connection with the first point, if the
    instrumental variable is associated with other
    error terms or with other unmeasured confounders,
    use of the selected instrumental variable could
    increase confounding.
  • IV corrections are large-sample corrections, and
    so standard errors can be large without large
    samples (standard errors can also be large if the
    instrument is weak).
  • The IV approach is more common when the model of
    interest is linear. When the model is non-linear
    (e.g., logistic), estimation can be more
    difficult.

29
Final Conclusions and Remarks
  • Propensity score matching only balances observed
    covariates. It does not directly address the
    case of non-observed covariates that are causally
    relevant. It is for this reason that propensity
    score balancing is combined with instrumental
    variables estimation.
  • Causal inferences using non-experimental,
    observational data depends on a number of
    assumptions this goes back to Pearls
    observation that causal inference depends on a
    theory of the way that the data were generated,
    which goes beyond the data themselves. (Pearl,
    2003)
  • To reiterate a our starting claim, warranted
    causal inferences using non-experimental,
    observational data requires careful and complete
    conceptual analysis for this reason, there is
    much that philosophical approaches can contribute.
Write a Comment
User Comments (0)
About PowerShow.com