The sample size calculation for heterogeneous variances presentation

About This Presentation

Transcript and Presenter's Notes

Title: The sample size calculation for heterogeneous variances

1
The sample size calculation for heterogeneous
variances

Wei-ming Luh
???
National Cheng Kung University,
????
Tainan, Taiwan

Selecting an insufficient sample size yields a
study with inadequate sensitivity, whereas
selecting an excessive sample size wastes
resources.

3
Sufficient sample size is important.

Textbooks Mace (1974), Cohen (1988), Kraemer
Thiemann (1987), and Desu Raghavarao (1990).
Computer programs Gorman Primavera (1995),
Lenth (2000), Morse (1999), SAS Institute (1999)
Thomas (1998) even provided a comprehensive list
of power-analysis software
//www.forestry.ubc.ca/conservation/power/

4
One-sample t testGiven a, ß

The minimum sample size needed
is the standardized effect
size, which is the difference of the actual and
hypothesized means in standard deviation, s,
units

5
Guenther (1981)s modification
6
Caution!

Conventional formulas are based on the assumption
of normality and variance homogeneity.

7
Assumption violationHeavy-tailed and asymmetric
distribution
8
In the context of long-tailed distributions and
heterogeneous variances

Trimmed mean method to correct non-normality
Approximate test to take care of heterogeneity
(Behrens-Fisher problem )

9
1. Trimmed mean
Let be the order
statistics of random sample
Let be the proportion of trimming in each
tail of the distribution So the effective
sample size is
10
Yuens method (1974)
which is distributed approximately as the Student
t with degrees of freedom
11
Winsorized Variance(replacing)
12
Effective sample size
13
Original sample size
Always rounding up to the next highest integer.
14
Monte Carlo simulation

1. We generated data by using SAS RANNOR function
to create the standard normal observations (Z)
2. We used the g-and-h distributions (Hoaglin,
1985) to transform Z to reflect the target
distribution shapes

15
Five distribution shapes

The corresponding skew and kurtosis Normal
(g0, h0) (0, 0)
heavy-tailed (g0, h.1) (0, 5.5)
(g0, h.2) (0, 36.22)
Asymmetrical (g0.5, h0) (1.75, 8.9)
(g0.5, h.2) (13.16, 42895)

16
Sample size table (for d1, one-sided test,
power.80)
17
Simulation results (a.05)
18
Conclusion

The heavier the distribution tail, the fewer the
subjects needed for the trimmed mean method.
The trimmed sample size can achieve the desired
statistical power while the conventional sample
size formulas result in over-sampling.

19
2. unequal variances
Yuens method (1974)
20
Schouten (1999),
21
Monte Carlo simulation
22
(No Transcript)
23

Generalized Behrens-Fisher problem
Schwertman (1987)
The largest mean difference can be tested as

24
More questions

How much to trim?
When variances are unknown and possibly unequal,
the usual unbiased estimate can be used.
How about confidence interval?
How about cost constraint?

25
More sample size formulas should be developed for
robust statistics.

Thank you for your listening.

luhwei_at_mail.ncku.edu.tw

Write a Comment

User Comments (0)

About PowerShow.com

The sample size calculation for heterogeneous variances PowerPoint PPT Presentation