Quantifying Statistical Control: the Threshold of Theoretical Randomization - PowerPoint PPT Presentation

1 / 35
About This Presentation
Title:

Quantifying Statistical Control: the Threshold of Theoretical Randomization

Description:

Can predict an effective n given an impact in a quasi-experiment ... Quasi-experiment could be more precise, but possibly biased ... – PowerPoint PPT presentation

Number of Views:125
Avg rating:3.0/5.0
Slides: 36
Provided by: CSG59
Category:

less

Transcript and Presenter's Notes

Title: Quantifying Statistical Control: the Threshold of Theoretical Randomization


1
Quantifying Statistical Control the Threshold of
Theoretical Randomization Kenneth A. Frank Minh
Duong Spiro Maroulis Michigan State
University Ben Kelcey University of
Michigan Presented at Groningen May 21 2008
2
Focal Example The Effect of Kindergarten
Retention on Reading and Math Achievement(Hong
and Raudenbush 2005)
  • 1. What is the average effect of kindergarten
    retention policy? (Example used here)
  • Should we expect to see a change in childrens
    average learning outcomes if a school changes its
    retention policy?
  • Propensity based questions (not explored here)
  • 2. What is the average impact of a schools
    retention policy on children who would be
    promoted if the policy were adopted?
  • Use principal stratification (Frangakis and
    Rubin 2002).
  • 3. What is the effect of kindergarten retention
    on those who are retained?
  • How much more or less kindergarten retainees
    would have learned, on average, had they been
    promoted to the first grade rather than retained.

3
Data
  • Early Childhood Longitudinal Study Kindergarten
    cohort (ECLSK)
  • US National Center for Education Statistics
    (NCES).
  • Nationally representative
  • Kindergarten and 1st grade
  • observed Fall 1998, Spring 1998, Spring 1999
  • Student
  • background and educational experiences
  • Math and reading achievement (dependent variable)
  • experience in class
  • Parenting information and style
  • Teacher assessment of student
  • School conditions
  • Analytic sample (1,080 schools that do retain
    some children)
  • 471 kindergarten retainees
  • 10,255 promoted students

4
Effect of Retention on Reading Scores(Hong and
Raudenbush)
5
Possible Confounding Variables
  • Gender
  • Two Parent Household
  • Poverty
  • Mothers level of Education (especially relevant
    for reading achievement)

6
What is the Impact of a Confounding Variable on
an Inference for a Regression Coefficient?(Frank,
K. 2000. Impact of a Confounding Variable on
the Inference of a RegressionCoefficient.
Sociological Methods and Research, 29(2),
147-194.)
7
Impact appears in Partial Correlation
r ty is the sample correlation between the
treatment and the outcome r yv is the sample
correlation between a confound and the outcome r
tv is the sample correlation between a confound
and the treatment Correlation is reduced by the
product of two relevant correlations (values in
denominator can only increase the
partial) Inference for regression coefficient is
same as that for partial correlation
8
Impacts of Covariates on Correlation between
Retention and Reading Achievement
Component Correlations
covariate
impact with with
achievement retention Mothers Education
-0.0122 0.189 -0.064 Female
-0.0054 0.102 -0.053 Two parent
-0.0025 0.086 -0.025 poverty
-0.0080 0.135 -0.059
Negative impact would reduce the magnitude of the
coefficient for retention
9
Covariates and Absorbers (dependent variable
Reading in Spring 1999)
  • Covariates
  • Mothers education
  • Poverty
  • Gender
  • Two parent home
  • References
  • Hong and Raudenbush Shepard Coleman
  • Absorbers
  • Schools as fixed effects
  • Pre-test Spring 1998
  • Growth trajectory Fall 1998-Spring1998
  • References Shadish et al Heckman and Hotz
    (1988 JASA)

10
Extent to which Pre-test Absorbs the Impacts of
Covariates on Inference Regarding Effect of
Retention on Reading Achievement
Controlling for pre-test absorbs 87 of the
impact of Mothers Education once controlling
for pre-test there is less of a need to control
for mothers education
11
Capacity of Controls to Absorb the Impacts of
Covariates
12
Effect of Retention on Achievement After Adding
each Covariate
n10,065, R2 .40 Note 1 years growth is about
10 points, so retention effect gt 1 year growth
13
Randomization as the Gold Standard
  • Randomization preferred
  • Works in long run What is long run?
  • Relationship between n and impact in theoretical
    randomized experiment
  • Alternative Silver Standard
  • Quantify statistical control in a quasi-experiment

14
Need for Simulation?Predicting Mean Impact Using
Wei Pans Approximation(UGLY!)
?tv correlation between treatment and
confound ? yv correlation between outcome and
confound s, a, b coefficients to obtain
approximation
Pan, W., and Frank, K.A., 2004. A probability
index of the robustness of a causal inference,
Journal of Educational and Behavioral Statistics,
28, 315-337.
15
Pans Approximation(UGLY!) But Works
Simulate mean impact n (20,100,1000) ?tv,
?ty, ?vy (.1, .3, .5, .7) Bias of predicted
mean impact (Pan 2003) across simulations is
.00094 with standard deviation of .00071 We have
a function for the impacts across a range of
conditions
16
What is the Impact of a Confounding Variable in
an Randomized Experiment?
?0 in RCT
17
Predicting Mean Impact Using Wei Pans
Approximation Assuming ?tv0 (No Correlation
between treatment and confound, as in randomized
experiment). Elegant!
Where ?ty correlation between treatment and
outcome ?yv correlation between outcome and
unobserved confound
18
Solving Pans Approximation for n (assuming
randomized experiment)
Allows us to predict effective n of a theoretical
randomized experiment given a mean impact and
hypothetical correlation between outcome and
confound Can predict an effective n given an
impact in a quasi-experiment
19
Predicted Sample Size as a Function of Impact
Of mothers education
20
Quantitative Crosswalk between RCT and
Quasi-experiment
  • Quasi-experiment can achieve same or better level
    of control as randomized experiment
  • Red line Hong and Raudenbush achieve control
    equivalent to randomized experiment of size 200 ?
    better than a small RCT
  • But, with a randomized experiment
  • Guaranteed no bias in long run
  • Confidence interval captures uncertainty
  • Trade off between precision versus bias
  • Quasi-experiment could be more precise, but
    possibly biased
  • Key assumption impacts of measured covariates
    represent impacts of unmeasured covariates.

21
Asymptotics of Randomization
  • Elbow in relationship between n and impact.
  • Imprecise prediction for small impact (where we
    care the most)
  • Leverage the shape by defining a single threshold
    (first derivative-25/.001-25000). 25 change in
    n for .001 change in impact

22
Aymptotics of Precision for Randomization Across
Levels of Correlation between Outcome and the
Treatment (?yt) and Outcome and a Confound (?yv)
23
Interpretations
  • Cut offs appear reasonable on the way to
    asymptotic land
  • More affected by treatment effect (can be
    estimated) than by relationship between outcome
    and unobserved confound (unknown). Good.

24
Discussion
  • Characterize control in terms of impact
  • Theoretical randomized experiment as gold
    standard
  • Departure from Cook, who used actual experiments
  • Quasi-experiments (legitimacy)
  • Can equate to theoretical experiment
  • Obtain effective n
  • Use effective n as weight in meta-analysis
  • Cross threshold?
  • Procedure
  • Establish impact of good covariates
  • Establish absorption due to pre-test, etc
  • Equate to randomized experiment

25
What must be the Impact of an Unmeasured
Confounding Variable Invalidate the Inference?
  • Step 1 Establish Correlation Between Retention
    and Score
  • Step 2 Define a Threshold for Inference
  • Step 3 Calculate the Threshold for the Impact
    Necessary to Invalidate the Inference
  • Step 4 Multivariate Extension, with measured
    Covariates

26
Step 1 Establish Correlation Between Retention
and Score
t taken from regression, -26.00 n is the sample
size q is the number of parameters
estimated N-q-19012
27
Step 2 Define a Threshold for Inference
  • Define r as the value of r that is just
    statistically significant

n is the sample size q is the number of
parameters estimated tcritical is the critical
value of the t-distribution for making an
inference
r can also be defined in terms of effect sizes
28
Step 3 Calculate the Threshold for the Impact
Necessary to Invalidate the Inference
Define the impact k rxcv x rycv and assume
rxcv rycv (which maximizes the impact of the
confounding variable Frank 2000).
Set rxycv r and solve for k to find the
threshold for the impact of a confounding
variable (TICV).
impact of an unmeasured confound gt .25 ?
inference invalid
29
Calculations made easy!
  • http//www.msu.edu/kenfrank/papers/calculating20
    indices203.xls

30
Step 4 Multivariate Extension, with Covariates
krx cvz ry cvz Maximizing the impact
with covariates z in the model implies
.21
And
31
Multivariate Calculations
  • http//www.msu.edu/kenfrank/papers/calculating20
    indices203.xls

32
What must be the Impact of an Unmeasured Confound
to Invalidate the Inference?
  • If k gt .25 (or .21 without covariates) then the
    inference is invalid.
  • Maximum for multivariate model occurs when
  • r x cv .46 and ry cv, .45.
  • Furthermore, correlations of unobserved confound
    must be partialled for covariates z.
  • The magnitude of the impact of mothers education
    (strongest measured covariate) .0015
  • ?Impact of unmeasured confound would have to be
    more than 100 times greater than the impact of
    mothers education to invalidate the inference.
    Hmmm.

33
Extensions
  • Logistic Regression
  • See Imbens, Guido Sensitivity to Exogeneity
    Assumptions in Program Evaluation Recent
    Advances in Econometric Methdology (126-132,
    especially 128)
  • David J. Harding. 2003. "Counterfactual Models of
    Neighborhood Effects The Effect of Neighborhood
    Poverty on Dropping Out and Teenage Pregnancy."
    American Journal of Sociology 109(3) 676-719.
  • Logistic regression (Ben Kelcey at U of M)
  • Use weighted least squares
  • Use odds ratios
  • Multilevels
  • Seltzer and Frank (AERA 2007)
  • Multiple thresholds
  • Statistical significance simply redefine H0 ?0.
  • Point estimates define impact necessary to
    reduce coefficient below a series of thresholds,
    each one representing a separate decision.
    Half-way between Bayesian and Frequentist

34
Actual Randomized Experiment
  • Effect of Technology on Teaching
  • Strong Methods
  • Randomization
  • Still some Confounding

35
Relationship Between background Characteristics
and Treatment Assignment in a Randomized Study of
the Effect of Technology on Achievement
Write a Comment
User Comments (0)
About PowerShow.com