Analysis of matched data; plus, diagnostic testing - PowerPoint PPT Presentation

About This Presentation
Title:

Analysis of matched data; plus, diagnostic testing

Description:

Title: Disordered Eating, Menstrual Irregularity, and Bone Mineral Density in Young Female Runners Author: John Last modified by: Kristin Created Date – PowerPoint PPT presentation

Number of Views:25
Avg rating:3.0/5.0
Slides: 59
Provided by: John61
Learn more at: http://web.stanford.edu
Category:

less

Transcript and Presenter's Notes

Title: Analysis of matched data; plus, diagnostic testing


1
Analysis of matched data plus, diagnostic testing
2
Correlated Observations
  • Correlated data arise when pairs or clusters of
    observations are related and thus are more
    similar to each other than to other observations
    in the dataset.
  • Ignoring correlations will
  • overestimate p-values for within-person or
    within-cluster comparisons
  • underestimate p-values for between-person or
    between-cluster comparisons

3
Pair Matching Why match?
  • Pairing can control for extraneous sources of
    variability and increase the power of a
    statistical test.
  • Match 1 control to 1 case based on potential
    confounders, such as age, gender, and smoking.

4
Example
  • Johnson and Johnson (NEJM 287 1122-1125, 1972)
    selected 85 Hodgkins patients who had a sibling
    of the same sex who was free of the disease and
    whose age was within 5 years of the
    patientsthey presented the data as.

OR1.47 chi-square1.53 (NS)
From John A. Rice, Mathematical Statistics and
Data Analysis.
5
Example
  • But several letters to the editor pointed out
    that those investigators had made an error by
    ignoring the pairings. These are not independent
    samples because the sibs are pairedbetter to
    analyze data like this

OR2.14 chi-square2.91 (p.09)
From John A. Rice, Mathematical Statistics and
Data Analysis.
6
Pair Matching example
  • Match each MI case to an MI control based on age
    and gender.
  • Ask about history of diabetes to find out if
    diabetes increases your risk for MI.

7
Pair Matching example
Which cells are informative?
8
Pair Matching
OR estimate comes only from discordant pairs! The
question is among the discordant pairs, what
proportion are discordant in the direction of the
case vs. the direction of the control. If more
discordant pairs favor the case, this indicates
ORgt1.
9
P(favors case/discordant pair)
10
odds(favors case/discordant pair)
11
OR estimate comes only from discordant
pairs!! OR 37/16 2.31 Makes Sense!
12
McNemars Test
Null hypothesis P(favors case / discordant
pair) .5 (note equivalent to OR1.0 or cell
bcell c)
13
McNemars Test
Null hypothesis P(favors case / discordant
pair) .5 (note equivalent to OR1.0 or cell
bcell c)
By normal approximation to binomial
14
McNemars Test generally
By normal approximation to binomial
Equivalently
15
McNemars Test
McNemars Test
16
Example McNemars EXACT test
  • Split-face trial
  • Researchers assigned 56 subjects to apply SPF 85
    sunscreen to one side of their faces and SPF 50
    to the other prior to engaging in 5 hours of
    outdoor sports during mid-day. The outcome is
    sunburn (yes/no).
  • Unit of observation side of a face
  • Are the observations correlated? Yes.


Russak JE et al. JAAD 2010 62 348-349.
17
Results ignoring correlation
Table I   --  Dermatologist grading of sunburn
after an average of 5 hours of skiing/snowboarding
(P .03 Fishers exact test)
Sun protection factor Sunburned Not sunburned
85 1 55
50 8 48

Fishers exact test compares the following
proportions 1/56 versus 8/56. Note that
individuals are being counted twice!
18
Correct analysis of data
Table 1. Correct presentation of the data (P
.016 McNemars exact test).
SPF-50 side SPF-50 side
SPF-85 side Sunburned Not sunburned
Sunburned 1 0
Not sunburned 7 48
McNemars exact test Null hypothesis
Xbinomial (n7, p.5)
19
RECALL 95 confidence interval for a difference
in INDEPENDENT proportions
20
95 CI for difference in dependent proportions
21
95 CI for difference in dependent proportions
22
The connection between McNemar and
Cochran-Mantel-Haenszel Tests
23
View each pair is its own age-gender stratum
Example Concordant for exposure (cell a from
before)
24
x 9
x 37
x 16
x 82
25
Mantel-Haenszel for pair-matched data
We want to know the relationship between diabetes
and MI controlling for age and gender (the
matching variables). Mantel-Haenszel methods
apply.
26
RECALL The Mantel-Haenszel Summary Odds Ratio
27
ad/T 0 bc/T0
ad/T1/2 bc/T0
ad/T0 bc/T1/2
ad/T0 bc/T0
28
Mantel-Haenszel Summary OR
29
Mantel-Haenszel Test Statistic(same as McNemars)
30
Concordant cells contribute nothing to
Mantel-Haenszel statistic (observedexpected)
31
Discordant cells
32
(No Transcript)
33
(No Transcript)
34
Example Salmonella Outbreak in France, 1996
From Large outbreak of Salmonella enterica
serotype paratyphi B infection caused by a goats'
milk cheese, France, 1993 a case finding and
epidemiological study BMJ 312 91-94 Jan 1996.
35
(No Transcript)
36
Epidemic Curve
37
Matched Case Control Study
  • Case Salmonella gastroenteritis.
  • Community controls (11) matched for
  • age group (lt 1, 1-4, 5-14, 15-34, 35-44, 45-54,
    55-64, or gt 65 years)
  • gender
  • city of residence

38
Results
39
In 2x2 table form any goats cheese
40
In 2x2 table form Brand A Goats cheese
41
x8
x24
x2
x25
42
Summary 8 concordant-exposed pairs (strata)
contribute nothing to the numerator
(observed-expected0) and nothing to the
denominator (variance0).
Summary 25 concordant-unexposed pairs contribute
nothing to the numerator (observed-expected0)
and nothing to the denominator (variance0).
43
Summary 2 discordant control-exposed pairs
contribute -.5 each to the numerator
(observed-expected -.5) and .25 each to the
denominator (variance .25).
Summary 24 discordant case-exposed pairs
contribute .5 each to the numerator
(observed-expected .5) and .25 each to the
denominator (variance .25).
44
(No Transcript)
45
Diagnostic Testing and Screening Tests
46
Characteristics of a diagnostic test
  • Sensitivity Probability that, if you truly have
    the disease, the diagnostic test will catch it.
  • SpecificityProbability that, if you truly do not
    have the disease, the test will register
    negative.

47
Calculating sensitivity and specificity from a
2x2 table
ab
cd
Among those with true disease, how many test
positive?
Among those without the disease, how many test
negative?
48
Hypothetical Example
10
990
Sensitivity9/10.90
1 false negatives out of 10 cases
Specificity 881/990 .89
109 false positives out of 990
49
What factors determine the effectiveness of
screening?
  • The prevalence (risk) of disease.
  • The effectiveness of screening in preventing
    illness or death.
  • Is the test any good at detecting
    disease/precursor (sensitivity of the test)?
  • Is the test detecting a clinically relevant
    condition?
  • Is there anything we can do if disease (or
    pre-disease) is detected (cures, treatments)?
  • Does detecting and treating disease at an earlier
    stage really result in a better outcome?
  • The risks of screening, such as false positives
    and radiation.

50
Positive predictive value
  • The probability that if you test positive for the
    disease, you actually have the disease.
  • Depends on the characteristics of the test
    (sensitivity, specificity) and the prevalence of
    disease.

51
Example Mammography
  • Mammography utilizes ionizing radiation to image
    breast tissue.
  • The examination is performed by compressing the
    breast firmly between a plastic plate and an
    x-ray cassette that contains special x-ray film.
  • Mammography can identify breast cancers too small
    to detect on physical examination.
  • Early detection and treatment of breast cancer
    (before metastasis) can improve a womans chances
    of survival.
  • Studies show that, among 50-69 year-old women,
    screening results in 20-35 reductions in
    mortality from breast cancer.

52
Mammography
  • Controversy exists over the efficacy of
    mammography in reducing mortality from breast
    cancer in 40-49 year old women.
  • Mammography has a high rate of false positive
    tests that cause anxiety and necessitate further
    costly diagnostic procedures.
  • Mammography exposes a woman to some radiation,
    which may slightly increase the risk of mutations
    in breast tissue.

53
Example
  • A 60-year old woman has an abnormal mammogram
    what is the chance that she has breast cancer?
    E.g., what is the positive predictive value?

54
Calculating PPV and NPV from a 2x2 table
ac
bd
Among those who test positive, how many truly
have the disease?
Among those who test negative, how many truly do
not have the disease?
55
Hypothetical Example
118
882
PPV9/1187.6
NPV881/88299.9
Prevalence of disease 10/1000 1
56
What if disease was twice as prevalent in the
population?
20
980
sensitivity18/20.90
specificity872/980.89
Sensitivity and specificity are characteristics
of the test, so they dont change!
57
What if disease was more prevalent?
126
874
PPV18/12614.3
NPV872/87499.8
Prevalence of disease 20/1000 2
58
Conclusions
  • Positive predictive value increases with
    increasing prevalence of disease
  • Or if you change the diagnostic tests to improve
    their accuracy.
Write a Comment
User Comments (0)
About PowerShow.com