Final Review Session

About This Presentation

Title:

Final Review Session

Description:

Compares the proportion of successes in a sample to a ... F-test for Comparing the variance of two groups. Sample. Null hypothesis. The two populations ... – PowerPoint PPT presentation

Number of Views:27

Avg rating:3.0/5.0

Slides: 94

Provided by: lukeh4

Category:

more less

Transcript and Presenter's Notes

Title: Final Review Session

1
Final Review Session
2
Exam details

Short answer, similar to book problems
Formulae and tables will be given
You CAN use a calculator
Date and Time Dec. 7, 2006, 12-130 pm
Location Osborne Centre, Unit 1 (A)

3
(No Transcript)
4
Things to Review

Concepts
Basic formulae
Statistical tests

5
Things to Review

Concepts
Basic formulae
Statistical tests

6
Populations Samples Random sample
First Half
Null hypothesis Alternative hypothesis P-value
Parameters Estimates
Mean Median Mode
Type I error Type II error
Sampling distribution Standard error
Variance Standard deviation
Central limit theorem
Categorical Nominal, ordinal Numerical Discrete,
continuous
7
Second Half
Normal distribution Quantile plot Shapiro-Wilk
test Data transformations
Simulation Randomization Bootstrap Likelihood
Nonparametric tests
Independent contrasts
Observations vs. experiments Confounding
variables Control group Replication and
pseudoreplication Blocking Factorial design Power
analysis
8
Example Conceptual Questions

(youve just done a two-sample t-test comparing
body size of lizards on islands and the mainland)
What is the probability of committing a type I
error with this test?
State an example of a confounding variable that
may have affected this result
State one alternative statistical technique that
you could have used to test the null hypothesis,
and describe briefly how you would have carried
it out.

9
Randomization test
Null hypothesis Randomized data
Sample
Calculate the same test statistic on the
randomized data
Null distribution
Test statistic
compare
How unusual is this test statistic?
P gt 0.05
P lt 0.05
Reject Ho
Fail to reject Ho
10
Things to Review

Concepts
Basic formulae
Statistical tests

11
(No Transcript)
12
Things to Review

Concepts
Basic formulae
Statistical tests

13
Sample
Null hypothesis
Test statistic
Null distribution
compare
How unusual is this test statistic?
P gt 0.05
P lt 0.05
Reject Ho
Fail to reject Ho
14
Statistical tests

Binomial test
Chi-squared goodness-of-fit
Proportional, binomial, poisson
Chi-squared contingency test
t-tests
One-sample t-test
Paired t-test
Two-sample t-test

F-test for comparing variances
Welchs t-test
Sign test
Mann-Whitney U
Correlation
Spearmans r
Regression
ANOVA

15
Statistical tests

Binomial test
Chi-squared goodness-of-fit
Proportional, binomial, poisson
Chi-squared contingency test
t-tests
One-sample t-test
Paired t-test
Two-sample t-test

F-test for comparing variances
Welchs t-test
Sign test
Mann-Whitney U
Correlation
Spearmans r
Regression
ANOVA

16
Quick reference summary Binomial test

What is it for? Compares the proportion of
successes in a sample to a hypothesized value, po
What does it assume? Individual trials are
randomly sampled and independent
Test statistic X, the number of successes
Distribution under Ho binomial with parameters n
and po.
Formula

P 2 Prx?X
P(x) probability of a total of x successes p
probability of success in each trial n total
number of trials
17
Binomial test
Null hypothesis Prsuccesspo
Sample
Test statistic x number of successes
Null distribution Binomial n, po
compare
How unusual is this test statistic?
P gt 0.05
P lt 0.05
Reject Ho
Fail to reject Ho
18
Binomial test
19
Statistical tests

Binomial test
Chi-squared goodness-of-fit
Proportional, binomial, poisson
Chi-squared contingency test
t-tests
One-sample t-test
Paired t-test
Two-sample t-test

F-test for comparing variances
Welchs t-test
Sign test
Mann-Whitney U
Correlation
Spearmans r
Regression
ANOVA

20
Quick reference summary ?2 Goodness-of-Fit test

What is it for? Compares observed frequencies in
categories of a single variable to the expected
frequencies under a random model
What does it assume? Random samples no expected
values lt 1 no more than 20 of expected values lt
5
Test statistic ?2
Distribution under Ho ?2 with
df categories - parameters - 1
Formula

21
?2 goodness of fit test
Null hypothesis Data fit a particular Discrete
distribution
Sample
Calculate expected values
Test statistic

Null distribution
2 With
N-1-param. d.f.

compare
How unusual is this test statistic?
P gt 0.05
P lt 0.05
Reject Ho
Fail to reject Ho
22
?2 Goodness-of-Fit test
23
Possible distributions
Prx n frequency of occurrence
24
Given a number of categories Probability
proportional to number of opportunities Days of
the week, months of the year
Proportional
Number of successes in n trials Have to know n, p
under the null hypothesis Punnett square, many
p0.5 examples
Binomial
Number of events in interval of space or time n
not fixed, not given p Car wrecks, flowers in a
field
Poisson
25
Statistical tests

Binomial test
Chi-squared goodness-of-fit
Proportional, binomial, poisson
Chi-squared contingency test
t-tests
One-sample t-test
Paired t-test
Two-sample t-test

F-test for comparing variances
Welchs t-test
Sign test
Mann-Whitney U
Correlation
Spearmans r
Regression
ANOVA

26
Quick reference summary ?2 Contingency Test

What is it for? Tests the null hypothesis of no
association between two categorical variables
What does it assume? Random samples no expected
values lt 1 no more than 20 of expected values lt
5
Test statistic ?2
Distribution under Ho ?2 with
df(r-1)(c-1) where r rows, c columns
Formulae

27
?2 Contingency Test
Null hypothesis No association between variables
Sample
Calculate expected values
Test statistic

Null distribution
2 With
(r-1)(c-1) d.f.

compare
How unusual is this test statistic?
P gt 0.05
P lt 0.05
Reject Ho
Fail to reject Ho
28
?2 Contingency test
29
Statistical tests

Binomial test
Chi-squared goodness-of-fit
Proportional, binomial, poisson
Chi-squared contingency test
t-tests
One-sample t-test
Paired t-test
Two-sample t-test

F-test for comparing variances
Welchs t-test
Sign test
Mann-Whitney U
Correlation
Spearmans r
Regression
ANOVA

30
Quick reference summary One sample t-test

What is it for? Compares the mean of a numerical
variable to a hypothesized value, µo
What does it assume? Individuals are randomly
sampled from a population that is normally
distributed.
Test statistic t
Distribution under Ho t-distribution with n-1
degrees of freedom.
Formula

31
One-sample t-test
Null hypothesis The population mean is equal to
?o
Sample
Null distribution t with n-1 df
Test statistic
compare
How unusual is this test statistic?
P gt 0.05
P lt 0.05
Reject Ho
Fail to reject Ho
32
One-sample t-test

Ho The population mean is equal to ?o
Ha The population mean is not equal to ?o

33
Paired vs. 2 sample comparisons
34
Quick reference summary Paired t-test

What is it for? To test whether the mean
difference in a population equals a null
hypothesized value, µdo
What does it assume? Pairs are randomly sampled
from a population. The differences are normally
distributed
Test statistic t
Distribution under Ho t-distribution with n-1
degrees of freedom, where n is the number of
pairs
Formula

35
Paired t-test
Null hypothesis The mean difference is equal to ?o
Sample
Null distribution t with n-1 df n is the number
of pairs
Test statistic
compare
How unusual is this test statistic?
P gt 0.05
P lt 0.05
Reject Ho
Fail to reject Ho
36
Paired t-test

Ho The mean difference is equal to 0
Ha The mean difference is not equal 0

37
Quick reference summary Two-sample t-test

What is it for? Tests whether two groups have the
same mean
What does it assume? Both samples are random
samples. The numerical variable is normally
distributed within both populations. The
variance of the distribution is the same in the
two populations
Test statistic t
Distribution under Ho t-distribution with
n1n2-2 degrees of freedom.
Formulae

38
Two-sample t-test
Null hypothesis The two populations have the
same mean ?1??2
Sample
Null distribution t with n1n2-2 df
Test statistic
compare
How unusual is this test statistic?
P gt 0.05
P lt 0.05
Reject Ho
Fail to reject Ho
39
Two-sample t-test

Ho The means of the two populations are equal
Ha The means of the two populations are not equal

40
Statistical tests

Binomial test
Chi-squared goodness-of-fit
Proportional, binomial, poisson
Chi-squared contingency test
t-tests
One-sample t-test
Paired t-test
Two-sample t-test

F-test for comparing variances
Welchs t-test
Sign test
Mann-Whitney U
Correlation
Spearmans r
Regression
ANOVA

41
F-test for Comparing the variance of two groups
42
F-test
Null hypothesis The two populations have the
same variance ?21? ?22
Sample
Null distribution F with n1-1, n2-1 df
Test statistic
compare
How unusual is this test statistic?
P gt 0.05
P lt 0.05
Reject Ho
Fail to reject Ho
43
Statistical tests

Binomial test
Chi-squared goodness-of-fit
Proportional, binomial, poisson
Chi-squared contingency test
t-tests
One-sample t-test
Paired t-test
Two-sample t-test

F-test for comparing variances
Welchs t-test
Sign test
Mann-Whitney U
Correlation
Spearmans r
Regression
ANOVA

44
Welchs t-test
Null hypothesis The two populations have the
same mean ?1??2
Sample
Null distribution t with df from formula
Test statistic
compare
How unusual is this test statistic?
P gt 0.05
P lt 0.05
Reject Ho
Fail to reject Ho
45
Statistical tests

Binomial test
Chi-squared goodness-of-fit
Proportional, binomial, poisson
Chi-squared contingency test
t-tests
One-sample t-test
Paired t-test
Two-sample t-test

F-test for comparing variances
Welchs t-test
Sign test
Mann-Whitney U
Correlation
Spearmans r
Regression
ANOVA

46
Parametric
Nonparametric
One-sample and Paired t-test
Sign test
Mann-Whitney U-test
Two-sample t-test
47
Quick Reference Summary Sign Test

What is it for? A non-parametric test to compare
the medians of a group to some constant
What does it assume? Random samples
Formula Identical to a binomial test with po
0.5. Uses the number of subjects with values
greater than and less than a hypothesized median
as the test statistic.

P 2 Prx?X
P(x) probability of a total of x successes p
probability of success in each trial n total
number of trials
48
Sign test
Null hypothesis Median mo
Sample
Test statistic x number of values greater than
mo
Null distribution Binomial n, 0.5
compare
How unusual is this test statistic?
P gt 0.05
P lt 0.05
Reject Ho
Fail to reject Ho
49
Sign Test

Ho The median is equal to some value mo
Ha The median is not equal to mo

50
Quick Reference Summary Mann-Whitney U Test

What is it for? A non-parametric test to compare
the central tendencies of two groups
What does it assume? Random samples
Test statistic U
Distribution under Ho U distribution, with
sample sizes n1 and n2
Formulae

n1 sample size of group 1 n2 sample size of
group 2 R1 sum of ranks of group 1
Use the larger of U1 or U2 for a two-tailed test
51
Mann-Whitney U test
Null hypothesis The two groups Have the same
median
Sample
Test statistic U1 or U2 (use the largest)
Null distribution U with n1, n2
compare
How unusual is this test statistic?
P gt 0.05
P lt 0.05
Reject Ho
Fail to reject Ho
52
Statistical tests

Binomial test
Chi-squared goodness-of-fit
Proportional, binomial, poisson
Chi-squared contingency test
t-tests
One-sample t-test
Paired t-test
Two-sample t-test

F-test for comparing variances
Welchs t-test
Sign test
Mann-Whitney U
Correlation
Spearmans r
Regression
ANOVA

53
Quick Reference Guide - Correlation Coefficient

What is it for? Measuring the strength of a
linear association between two numerical
variables
What does it assume? Bivariate normality and
random sampling
Parameter ?
Estimate r
Formulae

54
Quick Reference Guide - t-test for zero linear
correlation

What is it for? To test the null hypothesis that
the population parameter, ?, is zero
What does it assume? Bivariate normality and
random sampling
Test statistic t
Null distribution t with n-2 degrees of freedom
Formulae

55
T-test for correlation
Null hypothesis ?0
Sample
Test statistic
Null distribution t with n-2 d.f.
compare
How unusual is this test statistic?
P gt 0.05
P lt 0.05
Reject Ho
Fail to reject Ho
56
Statistical tests

Binomial test
Chi-squared goodness-of-fit
Proportional, binomial, poisson
Chi-squared contingency test
t-tests
One-sample t-test
Paired t-test
Two-sample t-test

F-test for comparing variances
Welchs t-test
Sign test
Mann-Whitney U
Correlation
Spearmans r
Regression
ANOVA

57
Quick Reference Guide - Spearmans Rank
Correlation

What is it for? To test zero correlation between
the ranks of two variables
What does it assume? Linear relationship between
ranks and random sampling
Test statistic rs
Null distribution See table if ngt100, use
t-distribution
Formulae Same as linear correlation but based on
ranks

58
Spearmans rank correlation
Null hypothesis ?0
Sample
Test statistic rs
Null distribution Spearmans rank Table H
compare
How unusual is this test statistic?
P gt 0.05
P lt 0.05
Reject Ho
Fail to reject Ho
59
Statistical tests

Binomial test
Chi-squared goodness-of-fit
Proportional, binomial, poisson
Chi-squared contingency test
t-tests
One-sample t-test
Paired t-test
Two-sample t-test

F-test for comparing variances
Welchs t-test
Sign test
Mann-Whitney U
Correlation
Spearmans r
Regression
ANOVA

60
Assumptions of Regression

At each value of X, there is a population of Y
values whose mean lies on the true regression
line
At each value of X, the distribution of Y values
is normal
The variance of Y values is the same at all
values of X
At each value of X the Y measurements represent a
random sample from the population of Y values

61
OK
Non-normal
Unequal variance
Non-linear
62
Quick Reference Summary Confidence Interval for
Regression Slope

What is it for? Estimating the slope of the
linear equation Y ? ?X between an explanatory
variable X and a response variable Y
What does it assume? Relationship between X and
Y is linear each Y at a given X is a random
sample from a normal distribution with equal
variance
Parameter ?
Estimate b
Degrees of freedom n-2
Formulae

63
(No Transcript)
64
Quick Reference Summary t-test for Regression
Slope

What is it for? To test the null hypothesis that
the population parameter ? equals a null
hypothesized value, usually 0
What does it assume? Same as regression slope
C.I.
Test statistic t
Null distribution t with n-2 d.f.
Formula

65
T-test for Regression Slope
Null hypothesis ?0
Sample
Test statistic
Null distribution t with n-2 df
compare
How unusual is this test statistic?
P gt 0.05
P lt 0.05
Reject Ho
Fail to reject Ho
66
Statistical tests

Binomial test
Chi-squared goodness-of-fit
Proportional, binomial, poisson
Chi-squared contingency test
t-tests
One-sample t-test
Paired t-test
Two-sample t-test

F-test for comparing variances
Welchs t-test
Sign test
Mann-Whitney U
Correlation
Spearmans r
Regression
ANOVA

67
Quick Reference Summary ANOVA (analysis of
variance)

What is it for? Testing the difference among k
means simultaneously
What does it assume? The variable is normally
distributed with equal standard deviations (and
variances) in all k populations each sample is a
random sample
Test statistic F
Distribution under Ho F distribution with k-1
and N-k degrees of freedom

68
Quick Reference Summary ANOVA (analysis of
variance)

Formulae

mean of group i overall mean
ni size of sample i N total sample size
69
ANOVA
Null hypothesis All groups have the same mean
k Samples
Test statistic
Null distribution F with k-1, N-k df
compare
How unusual is this test statistic?
P gt 0.05
P lt 0.05
Reject Ho
Fail to reject Ho
70
ANOVA

Ho All of the groups have the same mean
Ha At least one of the groups has a mean that
differs from the others

71
ANOVA Tables
Source of variation Sum of squares df Mean Squares F ratio P
Treatment k-1
Error N-k
Total N-1
72
Picture of ANOVA Terms
SSTotal MSTotal
SSGroup MSGroup
SSError MSError
73
Two-factor ANOVA Table
Source of variation Sum of Squares df Mean Square F ratio P
Treatment 1 SS1 k1 - 1 SS1 k1 - 1 MS1 MSE
Treatment 2 SS2 k2 - 1 SS2 k2 - 1 MS2 MSE
Treatment 1 Treatment 2 SS12 (k1 - 1)(k2 - 1) SS12 (k1 - 1)(k2 - 1) MS12 MSE
Error SSerror XXX SSerror XXX
Total SStotal N-1
74
Interpretations of 2-way ANOVA Terms
75
Interpretations of 2-way ANOVA Terms
Effect of Temperature, Not pH
76
Interpretations of 2-way ANOVA Terms
Effect of pH, Not Temperature
77
Interpretations of 2-way ANOVA Terms
Effect of pH and Temperature, No interaction
78
Interpretations of 2-way ANOVA Terms
Effect of pH and Temperature, with interaction
79
Quick Reference Summary 2-Way ANOVA

What is it for? Testing the difference among
means from a 2-way factorial experiment
What does it assume? The variable is normally
distributed with equal standard deviations (and
variances) in all populations each sample is a
random sample
Test statistic F (for three different
hypotheses)
Distribution under Ho F distribution

80
Quick Reference Summary 2-Way ANOVA

Formulae

Just need to know how to fill in the table
81
2-way ANOVA
Null hypotheses (three of them)
Samples
Test statistic
Null distribution F
compare
How unusual is this test statistic?
P gt 0.05
P lt 0.05
Reject Ho
Fail to reject Ho
82
2-way ANOVA
Null hypotheses (three of them)
Samples
Treatment 1
Test statistic
Null distribution F
compare
How unusual is this test statistic?
P gt 0.05
P lt 0.05
Reject Ho
Fail to reject Ho
83
2-way ANOVA
Null hypotheses (three of them)
Samples
Treatment 2
Test statistic
Null distribution F
compare
How unusual is this test statistic?
P gt 0.05
P lt 0.05
Reject Ho
Fail to reject Ho
84
2-way ANOVA
Null hypotheses (three of them)
Samples
Interaction
Test statistic
Null distribution F
compare
How unusual is this test statistic?
P gt 0.05
P lt 0.05
Reject Ho
Fail to reject Ho
85
General Linear Models

First step formulate a model statement
Example

86
General Linear Models

Second step Make an ANOVA table
Example

Source of variation Sum of squares df Mean Squares F ratio P
Treatment k-1
Error N-k
Total N-1

87
Randomization test
Null hypothesis Randomized data
Sample
Calculate the same test statistic on the
randomized data
Null distribution
Test statistic
compare
How unusual is this test statistic?
P gt 0.05
P lt 0.05
Reject Ho
Fail to reject Ho
88
Which test do I use?
89
Methods for a single variable
1
How many variables am I comparing?
Methods for comparing two variables
2
90
Methods for a single variable
1
How many variables am I comparing?
Methods for comparing two variables
2
3
Methods for comparing three or more variables
91
Methods for one variable
Is the variable categorical or numerical?
Categorical
Comparing to a single proportion po or to a
distribution?
Numerical
po
distribution
One-sample t-test
?2 Goodness- of-fit test
Binomial test
92
Methods for two variables
X
Contingency analysis
Logistic regression
Y
Correlation Regression
t-test ANOVA
93
How many variables am I comparing?
1
2
Is the variable categorical or numerical?
Categorical
Contingency analysis
Logistic regression
Numerical
Comparing to a single proportion po or to a
distribution?
t-test ANOVA
Correlation Regression
One-sample t-test
po
distribution
Contingency analysis
?2 Goodness- of-fit test
Binomial test

Write a Comment

User Comments (0)