Title: Power: 1
1Hypothesis Testing Type II Error and Power
2Type I and Type II Error Revisited
NULL HYPOTHESIS Actually True Actually
False
? 1-a Type II error b
Type I error a ? 1-b
Fail to Reject DECISION Reject
Either type error is undesirable and we would
like both a and b to be small. How do we control
these?
3- A Type I error, or an a-error is made when a true
hypothesis is rejected. - The letter a (alpha) is used to denote the
probability related to a type I error - a also represents the level of significance of
the decision rule or test - You, as the investigator, select this level
4- A Type II error, or an b-error is made when a
false hypothesis is NOT rejected. - The letter b (beta) is used to denote the
probability related to a type II error - 1-b represents the POWER of a test
- The probability of rejecting a false null
hypothesis - The value of b depends on a specific alternative
hypothesis - b can be decreased (power increased) by
- increasing sample size
5Computing Power of a Test
- Example Suppose we have test of a mean with
- Ho mo 100 vs. Ha mo ?100
- s 10
- n 25
- a .05
- If the true mean is in fact m 105,
- what is b, the probability of failing to reject
Ho when we should ? - What is the power (1-b) of our test to reject Ho
when we should reject it?
6In this example, the standard error is s/?n
10/52, so that
a/2 .025
a/2 .025
mo100
100 -1.96(2) 96.08
100 1.96(2) 103.92
We will reject Ho if (x ? 96.08) or if (x ?
103.92)
7- We will reject Ho
- if x is greater than 103.92
- or x is less than 96.08
- Lets look at these decision points relative to
our specific alternative. - Suppose, in fact, that ma 105.
Distribution based on Ha
96.08
ma105
103.92
8ma105
103.92
96.08
z
- 4.46 - 0.96 0
9- note
- a is fixed in advance by the investigator
- b depends on
- the sample size ? se (s / ?n)
- the specific alternative, ma
- we assume that the variance s2 holds for both the
null and alternative distributions
a/2
a/2
b
ma 105
m0 100
100-1.96(se) 96.08
1001.96(se) 103.92
10Again, looking at our specific alternative ma
105
b area where we fail to reject Ho even though Ha
is correct
a/2 area where we reject Ho for Ha Good!
a/2
ma 105
m0 100
100-1.96(se) 96.08
1001.96(se) 103.92
11- We define power as 1-b
- power Pr(rejecting Ho Ha is true)
- In our example,
- power 1-b 1 .1685 .8315
- That is,
- with a .05
- a sample size of n25
- a true mean of ma 105,
- the power to reject the null hypothesis (mo100)
is 83.15.
12Example 2
Suppose we want to test, at the a .05 level,
the following hypothesis Ho m 67 vs. Ha m
? 67 We have n25 and we know s 3.
To test this hypothesis we establish our critical
region.
a/2
a/2
? 67 ?
13Here, we reject Ho, at the a.05 level when or
a/2 Rejection region
a/2 Rejection region
65.82 67 68.18
14Now, select a specific alternative to compute
b Let Ha1 ma67.5
fail-to-reject region based on H0
65.82 67.5 68.18
z
2.80 0 1.13
or Power 1-b 13
15Now look at the same thing for different values
of ma
Type II Error (b) and Power of Test for a .05,
n25, mo 67, s 3
ma zlower zupper b Power 1-b
68.5 - 4.47 - .53 .29 .71
68 - 3.36 0.30 .62 .38
67.5 - 2.80 1.13 .87 .13
67 - 1.96 1.96 .95 .05
66.5 - 1.13 2.80 .87 .13
66 - 0.30 3.36 .62 .38
65.5 0.53 4.47 .29 .71
mo
16Let us plot Power (1-b) vs. alternative mean
(µa). This plot will be called the power curve.
Note at ma mo 1-b a
1.00
The farther the alternative is from m0, the
greater the power.
0.75
0.50
1 - b
0.25
0.00
65
66
67
68
69
m0
ma
17Suppose we want to test, the same hypothesis,
still at the a .05 level, s 3 Ho m
67 vs. Ha m ? 67 But we will now use n100.
We establish our critical region now with sx
s / ?n 3/10 .3
a/2
a/2
? 67 ?
18With n100, we reject Ho, at the a.05 level
when or
a/2 Rejection region
a/2 Rejection region
66.41 67 67.59
19Again, select a specific alternative to compute
b Let Ha ma67.5
fail-to-reject region based on H0
66.41 67.5 67.59
z
3.63 0 0.30
or Power 1-b 38
20Now look at the same thing for different values
of ma
Type II Error (b) and Power of Test for a .05,
n100, mo 67, s 3
ma zlower zupper b Power 1-b
68.5 - 6.97 - 3.04 .00 1.00
68 - 5.30 - 1.37 .09 .91
67.5 - 3.63 0.30 .62 .38
67 - 1.96 1.96 .95 .05
66.5 - 0.30 3.63 .62 .38
66 1.37 5.30 .09 .91
65.5 3.04 6.97 .00 1.00
mo
21Power Curves Power (1-b) vs. ma for n25,
100 a .05, mo 67
n 100 n 25
1 - b
For the same alternative ma, greater n gives
greater power.
ma
22- Clearly, the larger sample size has resulted in
- a more powerful test.
- However, the increase in power required an
additional 75 observations. - In all cases a .05.
- Greater power means
- we have a greater chance of rejecting Ho in favor
of Ha - even for alternatives that are close to the value
of mo.
23- We will revisit our discussion of power when we
discuss sample size in the context of hypothesis
testing. - Minitab allows you to compute power of a test for
a specific alternative - You must supply
- The difference between the null and a specific
alternative mean m0-ma - The sample size, n
- The standard deviation, s
24Using Minitab to estimate Sample Size Stat ?
Power and Sample Size ? 1-Sample Z
Sample size (to specify several, separate with a
space)
Difference between mo and ma ( to specify
several, separate with a space)
2-sided test
s
25Power and Sample Size 1-Sample Z Test Testing
mean null (versus not null) Calculating power
for mean null difference Alpha 0.05
Assumed standard deviation 10
Sample Difference Size Power 2
25 0.170075 2 100 0.516005
5 25 0.705418 5
100 0.998817