6'Hypothesis Testing and the Comparison of 2 or More Populations PowerPoint PPT Presentation

presentation player overlay
About This Presentation
Transcript and Presenter's Notes

Title: 6'Hypothesis Testing and the Comparison of 2 or More Populations


1
6. Hypothesis Testing and the Comparison of 2 or
More Populations
  • ASW Chapter 9 Chapter 10

2
A) Introduction
  • Estimating parameters of population ?
    hypothesis testing on our model.

3
Confidence Intervals and Hypothesis Testing
  • Confidence intervals ? range that µ falls into.
  • NOW is µ gt 0, or gt 1, etc.
  • OR is µ1 gt µ2?
  • Testing for specific values of µ.
  • We have a confidence interval for Saskatchewan
    female wages by age.
  • What could we test here?
  • We will have a confidence interval for Bachelors
    salaries in Saskatchewan.
  • What could we test here?
  • Others gasoline prices? Stock Market
    fluctuations?

4
B) Developing Null and Alternative Hypotheses
  • Start with a testable hypothesis.
  • Point of interest do older women get paid more?
  • Economic theory is 0 lt MPC lt 1 and constant?
  • Define its opposite
  • Older womens salaries are lt average.
  • MPC is gt 1.
  • One is the Null hypothesis, the other is the
    Alternative hypothesis.
  • Use sample data to test the Null hypothesis.
  • What if it is not that simple to have 2 opposites?

5
Which is the Null?
  • General rule the hypothesis with the sign or
    the lt or the gt sign is the Null.
  • OR the Null is something we assume is true
    unless contradicted by the sample.

6
1. Research hypotheses
  • Testing an exception to the general rule, so it
    goes in the alternative.
  • E.g, testing if older womens salaries (µ) gt
    average
  • H0 µ lt µ(average)
  • HA µ gt µ(average)
  • Results will tell us either
  • If testing shows H0 cannot be rejected
    (accepted) ? implies that older womens
    salaries are not higher, but we cannot be sure.
  • If testing shows H0 can be rejected ? we can
    infer HA is true, µ gt µ(average).

7
2. Testing the Validity of a Claim
  • Assume claim is true until disproven.
  • E.G. manufacturers claim of weight/container.
  • H0 µ gt 100 grams.
  • HA µ lt 100 grams.
  • Results will tell us either
  • If testing shows H0 cannot be rejected
    (accepted) ? manufacturers claim not
    challenged.
  • If testing shows H0 can be rejected ? we can
    infer manufacturer is lying.

8
3. Testing in Decision-Making
  • Here, if either too high or too low, need to do
    something.
  • E.G. is class length 75 minutes?
  • H0 µ 75 minutes.
  • HA µ ? 75 minutes.
  • If H0 not rejected (accepted), no change in
    behaviour.
  • If H0 rejected gt change behaviour.

9
C) Type I and Type II Errors
  • Sample data ? could have errors.

10
Type I and Type II Errors
Population Condition
H0 True
H0 False
Conclusion
Correct Decision
Type II Error
Accept H0
Correct Decision
Type I Error
Reject H0
11
False Positive or False Negative?
U.K. police defend shoot-to-kill after
mistake Blair said Menezes had emerged from an
apartment block in south London that had been
under surveillance in connection with Thursdays
attacks, and refused police orders to halt.
Menezes had also been wearing an unseasonably
heavy coat, further raising police
suspicions. MSNBC, July 24th.
12
Other Type I and Type II Errors
  • Sampling songs.
  • Health tests.
  • Pregnancy tests.
  • Jury decisions.

13
Level of Significance
  • Hypothesis testing is really designed to control
    the chance of a Type I error.
  • Probability of Type I error the level of
    significance.
  • Selecting ? ( level of significance ) ? select
    probability of Type I error.
  • What is the level of significance for Jury
    trials?
  • We do not control for Type II errors
  • gt except by our language of stating do not
    reject H0.

14
Level of Significance contd
  • ? is picked by researcher gt normally 5?
  • ? 5 ? type I error happens only 5 of the time.

15
D) The probability value (p-value) approach
  • Develop null and alternative hypothesis.
  • Select level of significance ?.
  • Collect data, calculate sample mean and test
    statistic.
  • Use test statistic to calculate p-value.
  • Compare reject H0 if p-value lt ?.
  • The sample implies that the alternative (your
    research hypothesis) is true.

16
Hypothesis Testing The Critical Value Approach
  • Develop null and alternative hypothesis.
  • Select level of significance ?.
  • Collect data, calculate sample mean and test
    statistic.
  • Use ? to determine critical value and rejection
    rule.
  • Compare if test statistic gt critical value,
    reject H0.
  • The sample implies that the alternative (your
    research hypothesis) is true.

17
Hypothesis Testing contd
  • This is essentially inverting our confidence
    index.
  • ? Is ? more than 2 standard deviations away from
    some benchmark?

18
E) Population Mean, s Known, One-Tailed Test
  • Same hypothesis? p-value method and critical
    value method.
  • Example a new employment program initiative has
    been introduced to reduce time spent being
    unemployed.
  • Goal 12 weeks or less unemployed.
  • Population standard deviation believed to be 3.2
    weeks.
  • Sample of 40 unemployed workers, average time
    unemployed 13.25 weeks.
  • Assuming a level of significance (?) of .05, is
    the program goal being met?

19
First (Common) Steps
  • Hypothesis H0 µ lt 12 HA µ gt 12
  • Clearly 13.25 gt 12.
  • This casts doubt on our program goal (the null),
    and whether we should continue it.
  • Key is it enough more, given sample size and
    standard deviation, or is it just a (small)
    random fluctuation?

20
Computing the Test Statistic
  • Under our assumptions use the standard normal.
  • Use sample mean to calculate test statistic
  • Is this z big enough to reject the null
    hypothesis?
  • Next, go our two routesCalculate p-value OR
    z-critical.

21
Calculating the p-value
  • Given the z-value, what is the corresponding
    probability?
  • This is the probability that 13.25 gt 12 by chance.

22
Calculating the p-value contd
  • Find 2.47 on the Standard Normal Distribution
    tables

23
Calculating the p-value contd
  • .4932 is the probability of being between 0 and
    z2.47.
  • p 0.5 0.4932 0.0038.

24
Should We Reject the Hypothesis?
  • This says that the probability of getting a
    sample mean of 13.25 when the true mean is 12
    .0038 or less than ½ of 1 percent.
  • Our significance level was only 5, so we reject
    the null.
  • We are 99.62 certain that the program has
    failed.

25
Rejection of the Null
0.0038
0
Z2.47
z
Z.05
  • Sometimes we say significant at the 0.38
    level.

26
Critical Value Approach
  • This is an alternative you often see in textbooks
    or articles.
  • Find the value of z.05, and compare it to the
    test value of z (2.47).
  • From the tables, z.05 1.645.
  • Because 2.47 gt 1.645, reject H0.

27
Rejection
0
Z2.47
z
Z.05 1.645
28
Excel
  • Lets do this example in Excel.
  • Look at Appendix 9.2 in text, especially Figure
    9.8

29
F) Population Mean, s Known, Two-Tailed Test
  • Null µ µ0 ? Alternative is µ ? µ0.
  • Must examine two areas of the distribution.
  • Example
  • Price/earnings ratios for stocks.
  • Theory stable rate of P/E in market 13.
  • If P/E (market) lt 13, you should invest in the
    stock market.
  • If P/E (market) gt 13, you should take your money
    out.

30
Estimate Steps
  • Can we estimate if the population P/E is 13 or
    not?
  • Common steps
  • Set hypothesis H0 µ 13 HA µ ? 13
  • Select ? .05.
  • Calculate standard error.
  • Calculate z-value.

31
Calculating Test Statistic
  • We have a sample of 50
  • ? 12.1.
  • Historical s 3.0456.

32
p-value Approach
  • Calculate the p-value.
  • We will calculate for the lower tail? then make
    an adjustment for the upper tail.

33
p-value, Two-Tailed Test
p(z gt 2.09) ??
p(z lt 2.09) ??
0
Z2.09
z
Z2.09
We can just calculate one value, and double it.
34
Calculating the p-value contd
  • Find 2.090 on the Standard Normal Distribution
    tables

35
p-value, Two-Tailed Test
0.4817
p(z gt 2.09) 0.5 -.4817 0.0183
p(z lt 2.09) ??
0
Z2.09
z
Z2.09
Doubling the value, we find the p-value 0.0366
36
Should We Reject the Null Hypothesis?
  • Yes!
  • p-value 0.0366 lt ? 0.05.
  • There is only a 3.66 chance that the measured
    price/earnings ratio sample mean of 12.1 is not
    equal to the stable rate of 13 by random chance.

37
Critical Value Approach
  • Reject the null hypothesis iftest z-value gt
    critical value or if test z-value lt critical
    value
  • Two tailed test ? 0.05 ? need critical value
    for ?/2 0.025.
  • The tables tell us that this is 1.96.

38
G) Population Mean, s Unknown
  • s unknown ? must estimate it with our sample too
    ? use t-distribution, n 1 degrees of freedom.

39
One-tailed Test, p-value Approach
  • Steps1. Set up hypothesis.2. Decide on level
    of significance (?)3. Collect data, calculate
    sample mean and test statistic.4. Use test
    statistic t-table/Excel to calculate
    p-value.5. Compare reject H0 if p-value lt ?.

40
Example Highway Patrol
The RCMP periodically samples vehicle speeds
at various locations on a particular roadway.
The sample of vehicle speeds is used to test the
hypothesis
H0 m lt 65
The locations where H0 is rejected are
deemed the best locations for radar traps.
41
Example Highway Patrol
Outside LumsdenA sample of 64 vehicles gt
average speed 65.5 mph gt standard deviation
4.2 mph. Use a .05 to test the hypothesis.
42
Common to Both Approaches
1. Determine the hypotheses.
H0 ? lt 65 Ha m gt 65
a .05
2. Specify the level of significance.
3. Compute the value of the test statistic.
43
4. Estimate the p-value From t-Distribution Table
  • Must interpolate the value of t 0.9524, df
    65

44
p Value Approach
5. Determine whether to reject H0.
Because pvalue gt a .05, we do NOT reject H0.
We are at least 95 confident that the mean
speed of vehicles outside Lumsden is LESS than OR
EQUAL TO 65 mph.
45
Critical Value Approach
4. Determine the critical value and rejection
rule.
For a .05 and d.f. 64 1 63, t.05 1.669
Reject H0 if t gt 1.669
5. Determine whether to reject H0.
Because 0.9524 lt 1.669, we do NOT reject H0.
46
H) Introduction Comparing Population Differences
  • Do men get paid more than women?
  • 46,452 for men vs. 35,122 for women
    (bachelors).
  • Do more 100-level Economics courses help you in
    Econ 201?
  • Has the crime rate risen?
  • Are there more hurricanes recently?

47
Key point the role of standard deviation
48
Comparing 2 Populations
  • True population means ?1 and ?2.
  • Random sample of n1 gt ?1.
  • Versus random sample of n2 gt ?2.
  • Transform into problem is ?1 ?2 0?
  • Assuming ?1 and ?2 known gt use z-test.
  • If unknown gt estimate ?s from sample ss, and
    use t-test.

49
I) Confidence Intervals, 2 Means ?s Unknown
  • How important is an extra introductory course in
    determining your grade in Economics 201?
  • Data
  • Natural experiment.
  • 59 students.
  • 43 had only one 10x course.
  • 16 had two 10x courses.
  • Final exam grades
  • One 10x average 61.69, s1 22.65, n1 43.
  • Two 10x average 75.11, s2 12.80, n2 16.

50
Confidence Interval Estimation
  • Point estimator ?1 ?2.
  • Standard error of ?1 ?2 is

51
Confidence Interval contd
  • Confidence interval of difference in means ?1
    ?2 Margin of Error
  • Typically use a 0.05.
  • Margin of error

52
Degrees of Freedom
  • One UGLY formula
  • In this example df 47.36 ? round down to 47.
  • 95 confidence interval ? t0.025.
  • For 47 degrees of freedom, table says 2.012.

53
Confidence Interval contd.
54
Confidence Interval contd.
55
Confidence Interval
  • We are 95 confident that students with only one
    10x course scored between 3.775 and 22.725
    lower than students with two 10x courses.
  • Next step would be why, how??

56
J) Hypothesis Tests, 2 Means ?s Unknown
  • Two datasets gt is the mean value of one larger
    than the other?
  • Is it larger by a specific amount?
  • µ1 vs. µ2 gt µ1 µ2 vs. D0.
  • Often set D0 0 gt is µ1 µ2?

57
Example Female vs. Male Salaries
  • Saskatchewan 2001 Census data- only Bachelors
    degrees- aged 21-64- work full-time- not in
    school
  • Men ?M 46,452.48, sM 36,260.1, nM 557.
  • Women ?W 35,121.94, sW 20,571.3, nW 534.
  • ?M ?W 11,330.44 our point estimate.
  • Is this an artifact of the sample, or do men make
    significantly more than women?

58
Hypothesis, Significance Level, Test Statistic
  • We will now ONLY use the p-value approach, and
    NOT the critical value approach.
  • Research hypothesis men get paid more
  • H0 µM µW lt 0H1 µM µW gt 0
  • Select ? 0.05
  • Compute test t-statistic

59
4. a. Compute the Degrees of Freedom
  • Can compute by hand, or get from Excel

60
4. b. Computing the p-value
The p-value ltltlt 0.005.
61
5. Check the Hypothesis
  • Since the p-value is ltltlt 0.05, we reject H0.
  • We conclude that we can accept the alternative
    hypothesis that men get paid more than women at a
    very high level of confidence (greater than 99).

62
Excel
63
Summary
  • Hypothesis tests on comparing two populations.
  • Convert to a comparison of the difference to a
    standard.
  • More complex standard deviation and degrees of
    freedom.
  • Same methodology as comparing other hypothesis
    tests.

64
K) Statistical vs. Practical Significance
  • Our tests statistically significantly
  • Real world interestpractical significance.
  • Men vs. women the difference is statistically
    significant AND practically
  • 46,452.48 vs 35,121.94
  • Saskatchewan, full-time, BachelorsWomen make
    only 75.6 of men, same average education level.

65
Source Leader-Post, Oct. 31, 2008
66
L) Matched Samples
  • Controlled experiment gt match individuals in
    each group.
  • Matched samples gt each individual tries each
    method in turn.
  • Variation between samples not a problem.
  • Focus on difference data.
  • Independent samples gt the norm in economics.
  • Regression analysis.

67
M) Introduction to ANOVA
  • What if we want to compare 3 or more sample means
    (treatment means)?
  • Example total income, Saskatchewan females
    employed full-time and full-year, by age, 2003
    (Source See Oct. 8th lectures)

68
ANOVAs Hypotheses
69
N) Steps of ANOVA
1. Set up the Hypothesis Statements H0 µ1 µ2
µ3 µ4 µk HA Not all population means
are equal 2. Collect your sample data Means
?1, ?2, ?3, ?4, ?k Variances s21, s22,
s23, s24 , s2k Sample Sizes n1, n2, n3, n4,
nk
70
Steps of ANOVA Continued
  • 4. Calculate the overall average

5. Create our two estimates of ?2.
71
Step 5 a) Estimating ?2 via SSTR
  • Between-treatments estimate of ?2 or sum of
    squares due to treatments (SSTR).
  • This compares to , and constructs
    an estimate of ?2 based on the assumption the
    Null Hypothesis is true

72
Step 5 b) Estimating ?2 via SSE
  • Within-treatments estimate of ?2 or the sum of
    squares due to error (SSE).
  • This takes the weighted average of the sample sj2
    as an estimate of ?2 and is a good estimate
    regardless of whether the Null is true

73
Step 6 Testing The Null
  • If Null true, both estimates should be similar,
    andSSTR 1.SST
  • If ratio gtgtgt 1 ? reject the Null, accept the
    Alternative that there is multiple population
    distributions.

74
Steps of ANOVA
  • Set up the Hypothesis Statement. (Null all
    means are equal)
  • Collect the sample data.
  • Select level of significance gt a 0.05.
  • Calculate the overall average.
  • a) Estimate ?2 via sum of squares due to
    treatments (SSTR).b) estimate of ?2 via sum of
    squares due to error (SSE).
  • If Null true, both estimates should be similar,
    and STR 1. SST

75
MSTR and MSE
  • MSTR sum of squares due to treatment numerato
    r degrees of freedom
  • sum of squares due to treatment no.
    of treatments 1
  • SSTR k-1
  • MSE Sum of squares due to error denominator
    degrees of freedom
  • Sum of squares due to error
  • total no. of obs. no. of treatments
  • SSE nT k

76
F-test
  • F-statistic MSTR ? k-1 degrees of freedom
    (df1) MSE ? nT k degrees of freedom
    (df2)
  • If H0 is true, MSTR MSE ? F-statistic 1.
  • If H0 is false? p-value is lt level of
    significance (a).? F-statistic is higher than
    critical value from the table/Excel.

77
F-Distribution
0
Ftest-value
78
O) Saskatchewan Female Wages Example
  • Example total income, Saskatchewan females
    employed full-time and full-year, by age, 2003
    (Source See Oct. 8th lectures)

79
Calculating the MSTR, MSE
80
Calculating F-Stat and p-value
  • Ftest-value MSTR 1148.5 2.59 MSE
    443.8

81
F-Table for df1 3 and df2 176
Clearly the p-value gt 0.05 gt accept the Null of
one distribution
176 degrees of freedom, F2.59 in here.
82
Excel F-test formula
  • FDIST(F-value, df1, df2) gt yields value of
    .0544.

0
2.59
83
P) Econometrics for Dummies
  • Instead of ANOVA, economists tend to use
    Regression analysis dummy variables.
  • Gives us the direction and size of the
    differences in mean values.
  • But ANOVA is a useful first step.
Write a Comment
User Comments (0)
About PowerShow.com