18: Cross-Tabulated Counts - PowerPoint PPT Presentation

About This Presentation
Title:

18: Cross-Tabulated Counts

Description:

Chapter 18 Cross-Tabulated Counts Part A Chapter 18, Part A: 18.1 Types of Samples 18.2 Naturalistic and Cohort Samples 18.3 Chi-Square Test of Association Types of ... – PowerPoint PPT presentation

Number of Views:54
Avg rating:3.0/5.0
Slides: 30
Provided by: budger
Learn more at: https://www.sjsu.edu
Category:

less

Transcript and Presenter's Notes

Title: 18: Cross-Tabulated Counts


1
Chapter 18Cross-Tabulated CountsPart A
2
Chapter 18, Part A
  • 18.1 Types of Samples
  • 18.2 Naturalistic and Cohort Samples
  • 18.3 Chi-Square Test of Association

3
Types of Samples
  • I. Naturalistic Samples simple random sample
    or complete enumeration of the population
  • II. Purposive Cohorts select fixed number of
    individuals in each exposure group
  • III. Case-Control select fixed number of
    diseased and non-diseased individuals

4
Naturalistic (Type I) Sample
Random sample of study base
5
Naturalistic (Type I) Sample
Random sample of study base
  • How did we study CMV (the exposure) and
    restenosis (the disease) relationship via a
    naturalistic sample?
  • A population was identified and sampled
  • Sample classified as CMV and CMV-
  • Disease occurrence (restenosis) was studied and
    compared in the groups.

6
Purposive Cohorts (Type II sample)
Fixed numbers in exposure groups
  • How would we study CMV and restenosis with a
    purposive cohort design?
  • A population of CMV individuals would be
    identified.
  • From this population, select, say 38,
    individuals.
  • A population of CMV- individuals would be
    identified.
  • From this population, select, say, 38
    individuals.
  • Disease occurrence (restenosis) would be studied
    and compared among the groups.

7
Case-control (Type III sample)
Set number of cases and non-cases
  • How would I do study CMV and restenosis with a
    case-control design?
  • A population of patents who experienced
    restenosis (cases) would be identified.
  • From this population, select, say, 38,
    individuals.
  • A population of patients who did not restenose
    (controls) would be identified.
  • From this population, select, say, 38
    individuals.
  • The exposure (CMV) would be studied and compared
    among the groups.

8
Case-Control (Type III sample)
Set number of cases and non-cases
9
Naturalistic Sample Illustrative Example
Edu. Smoke? Smoke?
Edu. - Tot
HS 12 38 50
JC 18 67 85
JC 27 95 122
UG 32 239 271
Grad 5 52 57
Total 94 491 585
  • SRS, N 585
  • Cross-classify education level (categorical
    exposure) and smoking status (categorical
    disease)
  • Talley R-by-C table cross-tab

10
Cross-tabulation (cont.)
Educ. Smoke? Smoke?
Educ. - Tot
HS 12 38 50
JC 18 67 85
Some 27 95 122
UG 32 239 271
Grad 5 52 57
Total 94 491 585
Row margins
Total
Column margins
11
Cross-tabulation of counts
For uniformity, we will always put the exposure
variable in rows put the disease variable in
columns
12
Exposure / Disease relationship
Use conditional proportions to describe
relationships between exposure and disease

13
Conditional Proportions Exposure / Disease
Relationship
In naturalistic and cohort samples ? row percents!
R-by-2 Table R-by-2 Table R-by-2 Table R-by-2 Table
- Total
Grp 1 a1 b1 n1
Grp 2 a2 b2 n2
? ? ? ?
Grp R aR bR nR
Total m1 m2 N
14
Example
Prevalence of smoking by education
Lower education associated with higher prevalence
(negative association between education and
smoking)
15
Relative Risks
Let group 1 represent the least exposed group
16
Illustration RRs
Note trend
17
k Levels of Disease
Efficacy of Echinacea example. Randomized
controlled clinical trial echinacea vs. placebo
in treatment of URI Exposure Echinacea vs.
placebo Disease severity of illness
Source JAMA 2003, 290(21), 2824-30
18
Row Percents for Echinacea Example
Echinacea group fared slightly worse than placebo
group
19
Chi-Square Test of Association
  • A. H0 no association in population Ha
    association in population
  • B. Test statistic

20
Observed
Degree Smoke Smoke - Tot
HS 12 38 50
JC 18 67 85
JC 27 95 122
UG 32 239 271
Grad 5 52 57
Total 94 491 585
21
Expected
Smoke Smoke - Total
HighS (50 94) 585 8.034 (50 491) 585 41.966 50
JC 13.658 71.342 85
Some 19.603 102.397 122
UG 43.545 227.455 271
Grad 9.159 47.841 57
Total 94 491 585
22
Continuity Corrected Chi-Square
  • Pearsons (uncorrected) chi-square
  • Yates continuity-corrected chi-square

23
Chi-Square Hand Calc.
24
Chi-Square ? P-value
  • X2stat 13.20 with 4 df
  • Table E ? 4 df row ? bracket chi-square statistic
    ? look up right tail (P-value) regions
  • Example bracket X2stat between 11.14 (P .025)
    and 13.28 (P .01)
  • ?.01 lt P lt .025

  Right tail Right tail Right tail Right tail Right tail Right tail Right tail Right tail Right tail
0.975 0.25 0.20 0.15 0.10 0.05 0.025 0.01 0.01
df 4 0.48 5.39 5.99 6.74 7.78 9.49 11.14 13.28 14.86
25
Illustration X2stat 13.20 with 4 df
The P-value AUC in the tail beyond X2stat
26
WinPEPI gt Compare2 gt F1
Input screen row 5 not visible
Output
27
Chi-Square, cont.
  1. How the chi-square works. When observed values
    expected values, the chi-square statistic is 0.
    When the observed minus expected values gets
    large ? evidence against H0 mounts
  2. Avoid chi-square tests in small samples. Do not
    use a chi-square test when more than 20 of the
    cells have expected values that are less than 5.

28
Chi-Square, cont.
  1. Supplement chi-squares with descriptive stat.
    Chi-square statistics do not quantify effects
  2. For 2-by-2 tables, chi-square and z tests produce
    identical P-values.

29
Discussion and demo on power and sample size
  • For estimation
  • For testing
  • Power
  • Sample size
Write a Comment
User Comments (0)
About PowerShow.com