Alternatives to Randomized Trials for Estimating Treatment Efficacy (or Harm)

About This Presentation

Title:

Alternatives to Randomized Trials for Estimating Treatment Efficacy (or Harm)

Description:

Title: Urine Testing and Urinary Tract Infections in Febrile Infants: The PROS Febrile Infant Study Author: Thomas B. Newman Last modified by: Tom Newman – PowerPoint PPT presentation

Number of Views:94

Avg rating:3.0/5.0

Slides: 57

Provided by: ThomasB104

Category:

more less

Transcript and Presenter's Notes

Title: Alternatives to Randomized Trials for Estimating Treatment Efficacy (or Harm)

1
Alternatives to Randomized Trials for Estimating
Treatment Efficacy (or Harm)

Thomas B. Newman, MD, MPH
Professor of Epidemiology and Biostatistics and
Pediatrics, UCSF

AltToRcts31Oct06
2
Lecture Outline

Prelude
Announcements
Double gold standard bias
Background
Instrumental variables and natural experiments
Measuring additional unrelated variables to
estimate bias
Propensity scores
Illustration using phototherapy for jaundice

3
Announcements

Handouts Chapter 11, problems, this presentation
Exam question contest questions due Thurs,
11/9/06 by e-mail to TN
Real examples only, recent articles strongly
preferred
Include answer
We may change it
Take-home final will be handed out 11/16,
discussed in class 11/30

4
Double Gold Standard Bias Revisited

Test ultrasound
Disease Intussusception
2 different gold standards.
Air contrast enema
Observation/follow-up
As long is there is no spontaneous resolution the
two gold standards give the same answer
Spontaneous resolution gives BE and F/U

5
Suppose spontaneous resolution occurs

In q cases with U/S
In r cases with U/S
BE will be for q and r
F/U will be for q and r

6
Effect of double gold-standard
7
Alternatives to Randomized Trials for Estimating
Treatment Efficacy (or Harm)

Thomas B. Newman, MD, MPH
Professor of Epidemiology and Biostatistics and
Pediatrics, UCSF

AltToRcts31Oct06
8
Background

Why do RCTs?
Assemble comparable groups (avoid confounding)
Allow blinding (to avoid placebo effect,
cointerventions, and bias in measuring outcome
variable)
Observational studies
May be able to assemble comparable groups or use
statistical adjustment
Wont be blinded

9
Why is it hard to assemble comparable groups
without randomizing?

People who get treated differ from those who
dont
Important differences are with respect risk of
the outcome
Treated people often at higher risk (confounding
by indication for treatment).
Treated people may be at lower risk (selection
bias)

10
Pre-test

Observational studies can never establish
causation. Proof of causation requires
randomized trials.
How many have heard this?
How many agree?

11
Do you believe there is a causal relationship
between

Acetaminophen overdose or mushroom poisoning and
liver failure?
Wearing glasses for refractive errors and
improved vision?
Infiltrate of IV calcium infusion and skin
sloughing?
Receipt of fluids and recovery from dehydration?
Land mine explosions and limb injuries?

12
Post-test

Observational studies can never establish
causation. Proof of causality requires
randomized trials.
How many agree?

13
When is causal inference from observational
studies easy?

Outcomes
not related to indications for treatment
rarely if ever occurs spontaneously
highly localized in time or space
Treatment
well-understood biologically
very rapidly acting

14
When its hard

Outcomes are related to indications or selection
for treatment, are delayed, non specific, or not
well understood
Learning disabilities in children treated with
anticonvulsants
Suicide in users of antidepressants
Mortality after surgery for gastroesophageal
reflux in children

15
Natural Experiments and Instrumental Variables

Find a time or place where receipt of treatment
was unlikely to be related to prognosis
E.g., time-series analyses where something
changed (e.g. new intervention became available)
Instrumental variables (IV) measurable factors
that influence probability of treatment that are
not otherwise associated with outcome

16
Use of large databases

Allows use of (weak) surrogate measures for
actual predictor
Biased towards null
Achieve statistical significance with large
sample size
Algebraically reverse bias towards null (with
various assumptions)

17
Delayed Effects of the Military Draft on
Mortality

Origin of study Agent Orange concern
Design Randomized natural experiment using the
draft lottery
Data source computerized death certificate
registries, CA and PA
Predictor variable of interest military service

Hearst N, Newman TB, Hulley SB. NEJM 1986
314620-24
18
Why not compare outcomes according to the
predictor variable of interest?

Biased comparison those who serve in the
military start out healthier
Healthy warrior effect

19
Delayed Effects of the Military Draft on Mortality

The instrumental variable measured draft lottery
number below cutoff (based on date of birth)
IV associated with predictor variable of
interest, not independently associated with
outcome

20
BUT Having an eligible number was a poor measure
of military service
21
Results
22
RCT as an Instrumental Variable Health effects
of exclusive breast feeding

Cant do RCT of exclusive breast-feeding
Can do RCT of breast-feeding PROMOTION
Assignment to BF promotion group should be
associated with exclusive breast feeding, but not
independently associated with outcome
Need very large sample size
Algebraic correction

23
Promotion of Breastfeeding Intervention Trial
(PROBIT)

Cluster-randomized trial at 31 sites in Belarus
Subjects 17,046 term singleton infants gt2500g
initially breastfed
Intervention WHO/UNICEF Baby Friendly Hospital
Initiative
Outcomes BF _at_ 3,6,9,12 months and allergic,
gastrointestinal and respiratory disease
F/U to 12 months on 16,491 (96.7)

Kramer MS, et al. JAMA 2001285413-20.
24
PROBIT, RQ 1

Does a Baby Friendly Hospital increase
exclusive breastfeeding?
Predictor Group assignment
Outcome Exclusive breast feeding
Intention-to-treat (ITT) analysis is fine
Exclusive BF at 3 months (rounded) 40 vs 5 P lt
0.001

25
Probit RQ2

Does exclusive breastfeeding reduce the risk of
eczema in the infant?
If the only effect of intervention related to
eczema is increasing exclusive BF, then
Predictor Group assignment
Outcome Eczema
ITT analysis biased towards null informative if
study positive
Eczema 3.3 vs 6.3 adjusted OR 0.54 (95 CI
0.31-.95 based on GLIMMIX P 0.03)

26
PROBIT, RQ3

How much does exclusive breastfeeding reduce the
risk of eczema in the infant? (What is the
NNEBF? )
Predictor Group assignment
Outcome Eczema
ITT wont work -- too much misclassification.
(Gives the number needed to be exposed to the
intervention, not the NNEBF.)

Number Needed Exclusively to Breast Feed
27
Algebraic correction

If all of the difference in eczema is due to the
difference in exclusive breast feeding, it can be
shown that the ARR is

28
NNEBF and caveat

Since ARR 8.6, NNEBF to prevent 1 case of
eczema is about 1/.086 12
Caveats
Results are for the effect of breastfeeding in
response to the intervention
Assumes the only effect of the Baby Friendly
Hospital is via difference in exclusive
breastfeeding
Similarly, effects of draft lottery only apply
to those who served as a result of the lottery.

29
Summary/other examples

If variables known NOT to be associated with
outcome are associated with treatment of
interest, consider this approach.
Generalizes to manynatural experiments.
E.g., an intervention is intermittently
available, or only available to certain groups.
-- different outcome by day of the week, etc.

30
More natural experiments

Costs of discontinuity of care increased
laboratory test ordering in patients transferred
to a different team the next morning
Effect of ER Copay rate of appendicitis
perforation unchanged after increase in co-pay.
Aircraft cabin air recirculation and symptoms of
the common cold no difference by type of air
recirculation in aircraft

Lofgren, RO. J Gen Intern Med. 19905501-5
Hsu J, et al. Presented at Bay Area Clinical
Research Symposium 10/17/03 Zitter JN et al.
JAMA 2002288483-6
31
Unrelated variables to estimate bias or
confounding

Measure an outcome that WOULD be affected by
bias, but not by intervention (and see if it is)
Measure a predictor that WOULD cause the same
bias as the predictor of interest (and see if it
does)

32
Observational study of screening sigmoidoscopy

Possible bias patients who agree to
sigmoidoscopy are likely to be different
Solution measure an outcome that would be
similarly affected by bias
Results
Decreased deaths from cancers within the reach of
the sigmoidoscope (OR 0.41)
No effect on deaths from more proximal cancers
(OR 0.96).

Selby et al, NEJM 1992326653-7
33
Effect of British breathalyser crackdown

Abrupt drop in accidents occurring during weekend
nights (when pubs are open)
Measure an outcome that would be affected by
bias accidents during other times
Result No change in accidents occurring during
other hours

See Cook and Campbell Quasi-Experimentation.Bosto
nHoughton Mifflin, p. 219
34
Calcium Channel Blockers (CCB) and AMI

Population based case-control study at Group
Health
Progressive increase in risk of AMI with higher
doses of CCB (P lt0.01)
Concern confounding by indication
Measure a predictor that would cause same bias
beta-blockers
Result progressive decrease in risk associated
with higher doses of beta-blockers (P 0.04)

Psaty et al., JAMA 1995274620-25
35
Suicide Risk in Bipolar Disorder During Treatment
With Lithium and Divalproex

Retrospective cohort study of Kaiser Permanente
and Group Health patients with bipolar disorder
Compared with no treatment, patients treated with
Valproex at 2.1 times suicide risk
Concern confounding by indication
Results Suicides per 1000 person/years
31.3 for treatment with divalproex
15 for no treatment (Plt0.001)
10.8 for Lithium (Plt0.001)
If confounding by indication, expect same bias
for Lithium

Goodwin et al. JAMA. 20032901467-1473
36
Initial Mood Stabilizer Prescription by Year of
Initial Diagnosis
Goodwin et al. JAMA. 20032901467-1473
37
Estimating biases Cautionary Tale

Nurses Health Study
Vitamin E assoc. with decreased risk of CHD (RR
.6)
No significant effect of multiple vitamins
Health Professionals Study
Vitamin E assoc. with decreased risk of CHD (RR
.6)
No significant effect of Vitamin C
TN began taking Vitamin E

N Engl J Med. 19933281444-9 and 1450-6
38
Meta-analysis high-dosage vitamin E
supplementation may increase all-cause mortality
Miller ER et al. Ann Intern Med. 2005 Jan
4142(1)37-46
39
Propensity Scores -1

Big picture want to know if association between
treatment and outcome is CAUSAL
Recall competing explanation confounding by
indication for treatment
Factor must be associated with outcome
Factor must be associated with treatment
Traditional approach adjust for factors
associated with outcome

40
Propensity Scores -2

Alternative approach Create a new variable,
propensity to be treated with the intervention
Then match, stratify, or include it in
multivariable analyses
Advantages
Better power to control for covariables (because
receipt of the intervention may be much more
common than occurrence of the outcome)
You can more easily tell when treated and
untreated groups are not comparable

41
How Much Overlap In The Propensity Scores Do We
Want?
A
B
42
Example Aspirin use and all-cause mortality
among patients being evaluated for known or
suspected Coronary Artery Disease

RQ Does aspirin reduce all-cause mortality in
patients with coronary disease
Design Cohort study
Subjects 6174 consecutive patients getting
stress echocardiograms
Predictor Aspirin use
Outcome All-cause mortality
Crude result 4.5 mortality in each group

Gum PA et al. JAMA 2001 286 1187-94
43
Analysis using Propensity Scores

Two multivariable analyses
Predictors of aspirin use
Predictors of death
Predictors of ASA use turned into a propensity
score
Users and non-users of ASA matched on ASA
propensity score
Compare mortality in matched groups
(Unmatched patients cannot be analyzed)

44
Survival in Propensity-Matched Patients
Recall total N6174
45
Limitations

Can only compare subjects whose propensity scores
overlap
Can only generalize to subjects who could have
received either treatment
Limitations similar to exclusions from clinical
trials
Important variables may be missing from your model

46
Illustration Phototherapy for Neonatal Jaundice

RQ How effective is phototherapy in babies with
Total Serum Bilirubin levels of 20-22.9 mg/dL?
Subjects Newborns at NC-KPMCP 2000 g, 34
wks with TSB 20-22.9 mg/dL at 48 hr (N1777)
Intervention Phototherapy within 8 hours of TSB
20-22.9 mg/dL (N635, 36)
Outcome TSB 25 mg/dL (N21, 1.2)

47
Logistic regression

Phototherapy only OR0.30 (P .05)
Phototherapy gest age OR 0.28 (P0.04)
Phototherapy gest age rate of riseOR 0.12
(P.002)

48
Propensity analysis

Step 1 predictors of PT within 8 hr oif TSB
20-22.9
Rate of rise of TSB, gestational age, race, sex,
maternal age, hospital of birth, etc.
Generate new variable, propensityPT predicted
probability of PT

49
Propensity by whether PT received
50
Logistic Regression With Propensity Score

Phototherapy only OR0.30 (P .05)
Phototherapy gest age OR 0.28 (P0.04)
Phototherapy gest age rate of riseOR 0.12
(P.002)
Logistic Phototherapy propensity scoreOR
0.13 (P.002)
Mantel-Haenszel stratified analysis PT and
propensity score in 4 strata OR 0.14, P0002

51
Efficacy of Phototherapy (PT) for Neonatal
Jaundice

Large interfacility practice variation in use of
phototherapy in the NC KPMCP
Hospital of birth thus an IV for phototherapy use
We can use individual-level data to adjust for
other risk factors for TSB 20 mg/dL

52
Instrumental variable

N too small for IV for TSB 25 mg/dL, so predict
TSB 20 mg/dL
For each hospital, calculate the proportion of
newborns in group C (AAP consider PT group) who
received phototherapy
Use this proportion as a predictor of TSB 20
mg/dL in individual level analyses

53
Group R AAP RECOMMENDS phototherapy Group C
AAP says CONSIDER phototherapy
Atkinson L, Escobar G, Takayama J, Newman TB.
Pediatrics 2003111e555-61
54
Rate of hyperbilirubinemia by PT use in 11
hospitals, 1995-6
55
IV analysis to Predict TSB 20 mg/dL
56
Summary

RCTs are most definitive, but
Not always feasible or necessary
Look for opportunities to answer questions with
observational studies
Instrumental variables
Natural experiments
Propensity scores
Estimating biases

Write a Comment

User Comments (0)

About PowerShow.com

Alternatives to Randomized Trials for Estimating Treatment Efficacy (or Harm) - PowerPoint PPT Presentation

Alternatives to Randomized Trials for Estimating Treatment Efficacy (or Harm)

Title: Urine Testing and Urinary Tract Infections in Febrile Infants: The PROS Febrile Infant Study Author: Thomas B. Newman Last modified by: Tom Newman – PowerPoint PPT presentation