ANOVA - PowerPoint PPT Presentation

1 / 33

About This Presentation

Title:

ANOVA

Description:

ANOVA Determining Which Means Differ in Single Factor Models – PowerPoint PPT presentation

Number of Views:104

Avg rating:3.0/5.0

Slides: 34

Provided by: JohnLa63

Category:

more less

Transcript and Presenter's Notes

Title: ANOVA

1

ANOVA
Determining Which Means Differ in Single Factor
Models

2
Single Factor ModelsReview of Assumptions

Recall that the problem solved by ANOVA is to
determine if at least one of the true mean values
of several different treatments differs from the
others.
For ANOVA we assumed
The distribution of the population for each
treatment is normal.
The standard deviations of each population,
although unknown, are equal.
Sampling is random and independent.

3
Determining Which Means DifferBasic Concept

Suppose the result of performing a single factor
ANOVA test is a low p-value, which indicates that
at least one population mean does, in fact,
differ from the others.
The natural question is, Which differ?
The answer is that we conclude that two
population means differ if their two sample means
differ by a lot.
The statistical question is, What is a lot?

4
Example

The length of battery life for notebook computers
is of concern to computer manufacturers.
Toshiba is considering 5 different battery models
(A, B, C, D, E) that have different costs.
The question is, Is there enough evidence to
show that average battery life differs among
battery types?

5
Data

A B C D E
130 90 100 140 160
115 80 95 150 150
130 95 110 150 155
125 98 100 125 145
120 92 105 145 165
110 85 90 130 125
?x 121.67 90 100 140 150

Grand Mean ?x 120
6
(No Transcript)
7
OUTPUT

8
Motivation for The Fisher Procedure

Fishers Procedure is a natural extension of the
comparison of two population means when the
unknown variances are assumed to be equal
Recall this is an assumption in single factor
ANOVA
Testing for the difference of two population
means (with equal but unknown ss) has the form
H0 µ1 µ2 0
HA µ1 µ2 ? 0

9
Best Estimate for s2 and the Appropriate Degrees
of Fredom

Recall that when there were only 2 populations,
the best estimate for s2 is sp2 and the degrees
of freedom is (n1-1) (n2-1) or n1 n2 - 2.
For ANOVA, using all the information from the k
populations the best estimate for s2 is MSE and
the degrees of freedom is DFE.

Two populations With Equal
Variances ANOVA Best estimate for s2 sp2
MSE Degrees of Freedom n1 n2 2
DFE
10
Two Types of Tests

There are two types of tests that can be applied
A test or a confidence interval for the
difference in two particular means
e.g. µE and µB
A set of tests which determine differences among
all means.
This is called a set of experiment wise (EW)
tests.
The approach is the same.
We will illustrate an approach called the Fisher
LSD approach.
Only the value used for a will be different.

11
Determining if µi Differs From µjFishers LSD
Approach

H0 µi µj 0
HA µi µj ? 0

LSD stands for Least Significant Difference
12
When Do We Conclude Two Treatment Means (µi and
µj) Differ?

We conclude that two means differ, if their
sample means,?xi and?xj, differ by a lot.
A lot is LSD given by

13
Confidence Intervals for the Difference in Two
Population Means

A confidence interval for µi µj is found by

14
Equal vs. Unequal Sample Sizes

If the sample sizes drawn from the various
populations differ, then the denominator of the
t-statistic will be different for each pairwise
comparison.
But if the sample sizes are equal (n1 n2 n3
.) , we can designate the equal sample size by N
Then the t-test becomes

15
LSD For Equal Sample Sizes

16
What Do We Use For a?

Recall that a is
In Hypothesis Tests the probability of
concluding that there is a difference when there
is not.
In Confidence Intervals the probability the
interval will not contain the true difference in
mean values
If doing a single comparison test or constructing
a confidence interval,
For an experimentwise comparison of all means,
We will actually be conducting 10 t-tests
µE - µD, (2) µE - µC, (3) µE - µB, (4) µE - µA,
(5) µD - µC,
(6) µD - µB, (7) µD - µA, (8) µC µB, (9) µC -
µA, (10) µB - µA

select a as usual
Use aEW
17
aEW The probability ofMaking at least one Type
I Error

Suppose each test has a probability of concluding
that there is a difference when there is not
(making a Type I error) a.
Thus for each test, the probability of not making
a Type I error is 1-a.
So the probability of not making any Type I
errors on any of the 10 tests is (1- a)10
For a .05, this is (.95)10 .5987
The probability of making at least one Type I
error in this experiment, is denoted by aEW.
Here, aEW 1 - .5987 .4013 -- That is, the
probability we make at least one mistake is now
over 40!
To have a lower aEW, a for each test must be
significantly reduced.

18
The Bonferroni Adjustment for a

To make aEW reasonable, say .05, a for each test
must be reduced.
The Bonferroni Adjustment is as follows
NOTE decreasing a, increases ß, the probability
of not concluding that there is a difference
between two means when there really is. Thus,
some researchers are reluctant to make a too
small because this can result in very high ß
values.

19
What Should a for Each Test Be?
The required a values for the individual t-tests
for aEW .05 and aEW .10 are

For aEW .05 For aEW .05
Number of Treatments, k a for each test
3 0.01667
4 0.00833
5 0.00500
6 0.00333
7 0.00238
8 0.00179
9 0.00139
10 0.00111
For aEW .10 For aEW .10
Number of Treatments, k a for each test
3 0.03333
4 0.01667
5 0.01000
6 0.00667
7 0.00476
8 0.00357
9 0.00278
10 0.00222
20
LSDEWFor Multiple Comparison Tests

When doing the series of multiple comparison
tests to determine which means differ, the test
would be to conclude that µi differs from µj if
Where LSDEW is given by

21
Procedure for Testing Differences Among All Means

We begin by calculating LSDEW which we have shown
will not change from test to test if the sample
sizes are the same from each sample. That is the
situation in the battery example that we
illustrate here.
A different LSD would have to be calculated for
each comparison if the sample sizes are
different.

22
Procedure (continued)

Then construct a matrix as follows

23
Procedure (continued)

Fill in mean of each treatment across the top row
and down the left-most column (in our example,
XA 121.67, XB 90, XC 100,
XD 140, XE 150)

24
Procedure (continued)

For each cell below the main diagonal, compute
the absolute value of difference of the means in
the corresponding column and row

25
Procedure (continued)

Compare each difference with LSDEw(17.235 in our
case). If the
difference between and gt LSDEw. we can
conclude that there is difference in µi and µj.

26
Tests For the Battery Example

For the battery example,
Which average battery lives can we conclude
differ?
Give a 95 confidence interval for the difference
in average battery lives between
C batteries and B batteries
E batteries and B batteries

Use LSDEW Multiple Comparisons
Use LSD Individual Comparisons
27
Battery Example Calculations

Experimental error of ?EW .05
For k 5 populations, a aEW /10 .05/10
.005
From the Excel output
? xE 150, ? xD 140, ? xA 121.67,? xc
100, ? xB 90
MSE 94.05333, DFE 25, N 6 from each
population
Use TINV(.005,25) to generate t.0025,25
3.078203

28
Analysis of Which Means Differ

We conclude that two population means differ if
their sample means differ by more than LSDEW
17.2355.
Construct a matrix of differences,
Compare with LSDEW

29
Conclusion of Comparisons
30
LSD For Confidence Intervals

Confidence intervals for the difference between
two mean values, i and j, are of the form
(Point Estimate) t?/2,DFE(Standard Error)

31
LSD for Battery Example

For the battery example

32
The Confidence Intervals

95 confidence interval for the difference in
mean battery lives between batteries of type C
and batteries of type B.
95 confidence interval for the difference in
mean battery lives between batteries of type E
and batteries of type B.

33
REVIEW