Title: Limitations of Matching
1Limitations of Matching
2Why we do matching?
3Confounding
- A distortion of an association between and
exposure and disease brought about by an
extraneous factor or factors. - Occurs when the exposure is associated with the
confounding factor and with the disease.
4Strategies for control of confounding
1. Random allocation of exposure 2.
Restriction 3. Matching - Partial restriction
4. Stratification 5. Modeling
5Strategies for control of confounding
- 1. Random allocation of exposure
- Ideal (adjusts for known as well as unknown
confounders - Rarely possible in observational studies of risk
factors ethical constraints, time, and cost
6Strategies for control of confounding
- 2. Restriction
- The effects of know or potential confounders can
be eliminated by restriction of study subjects - Should be considered for strong by uncommon risk
factors - Can simplify the analysis (fewer variables)
- Reduces the number of potential subject but
increases precision of estimates number of
subjects - Loss of generalization of restricted factors
7Strategies for control of confounding
- 3. Matching - Partial restriction
- Match on disease status
- Matching gains efficiency in case control studies
- Matching in design must be retained in the
analysis - Compare matched vs. unmatched estimates
8Strategies for control of confounding
- 4. Stratification
- Use stratified analysis to evaluate interaction
and control for confounding - No assumptions and straightforward computational
procedures - Stratum specific estimates become imprecise with
numbers in cells becomes small - ALWAYS look at stratification before jumping into
models (logistic, etc)
9Strategies for control of confounding
- 5. Modeling
- After stratification - modeling may be employed
to control confounding and test for interaction - Modeling is attractive when number of variables
is large or when continuous variables can not be
/ should not be categorized - Disadvantage assumptions
10Blood Pressure
Age
11YS a S bXS
Blood Pressure
YNS a NS bXNS
Age
12(No Transcript)
13YS a S bXS
Blood Pressure
YNS a NS bXNS
Age
14Blood Pressure
Age
15 aS - aNS
Blood Pressure
Age
16b (XS - XNS )
Blood Pressure
Age
17How Matching Can Reduce Confounding Bias?
18Blood Pressure
Age
19Blood Pressure
Age
20Factors influencing efficiency of matching in
Bias Reduction
1. The difference of mean of matching factor
between groups.
21Blood Pressure
Age
22Blood Pressure
Age
23Factors influencing efficiency of matching in
Bias Reduction
2. The ratio of the population variances.
24Blood Pressure
Age
25Blood Pressure
Age
26Factors influencing efficiency of matching in
Bias Reduction
3. The size of the control sample from which the
investigator forms a comparison group.
27Blood Pressure
Age
28Blood Pressure
Age
29Blood Pressure
Age
30Blood Pressure
Age
31YS a S b X2S
YNS a NS b X2NS
YS - YNS a S - a NS b (X2S - X2NS)
If XS XNS then b (X2S - X2NS ) may be
unequal to zero.
32Hypotethical target population of 2 million people
Females
Males
Exposed
Exposed
Unexposed
Unexposed
No. cases in 1 year
4,500
50
100
90
Total
900,000
100,000
100,000
900,000
1-year risk
0.005
0.0005
0.001
0.0001
(4500100)/1,000,000
33
Crude risk ratio
(5090)/1,000,000
33Expected results of a matched 1-year cohort study
of 200,000 subjects drawn from the target
population
Females
Males
Exposed
Exposed
Unexposed
Unexposed
No. cases in 1 year
450
45
10
1
Total
90,000
90,000
10,000
10,000
Relative Risk
10
10
(45010)/100,000
10
Crude risk ratio
(451)/100,000
34Expected results of a case-control study matched
on sex when the source of subjects is the same
target population
Females
Males
Exposed
Exposed
Unexposed
Unexposed
4500
50
100
90
Cases (4740)
Controls (4740)
4095
455
19
171
Approximate expected OR
10
10
(4500100)(455171)
Approximate expected Crude OR
5
(409519)(5090)
35Matching may induce selection bias in
case-control studies
36Overmatching
Overmatching includes three forms
- Matching that harms statistical efficiency.
- - e.g case-control matching on a variable
associated with exposure but not disease (not
confounder). - Matching that harms validity.
- - e.g matching on an intermediate variable
between exposure and disease (causal pathway). - Matching that harms cost efficiency.
37Conclusion
Conventional matching is rarely the optimal
stratified design.