Psychology 315: Statistics I - PowerPoint PPT Presentation

1 / 26

About This Presentation

Title:

Psychology 315: Statistics I

Description:

... rats and injected them with an amphetamine then measured the time they spend on ... Conclude that the amphetamine injections had an effect. ... – PowerPoint PPT presentation

Number of Views:36

Avg rating:3.0/5.0

Slides: 27

Provided by: rickgu

Category:

more less

Transcript and Presenter's Notes

Title: Psychology 315: Statistics I

1
Introduction to the t-test
Aron, Aron Coups, Chapter 8
2
(No Transcript)
3
The t-distribution

We have made use of the z distribution (the
standard normal distribution) to do hypothesis
testing and to compute confidence intervals for
example
For hypothesis testing we can calculate
Z (M - PopulationM ) / PopulationSDM
and determine if Z exceeds our z-score cutoff
(e.g., 1.64)
For confidence intervals we can calculate
Confidence interval M 1.96(PopulationSDM)
These two calculations require knowing
PopulationSD so that we can calculate
PopulationSDM
What if we didnt know PopulationSDM and had to
estimate it from our sample?
What should we use to estimate PopulationSDM ?
Would we be able to substitute this estimate
(estimated PopulationSDM) into our formulas and
have everything work out the same way?

4
Sampling Distributions -- biased and unbiased
estimates

We know that if you repeatedly choose samples of
size N and compute the mean (M) for each sample,
we will create a distribution of sample means.
The mean of the distribution of sample means
(PopulationMM) will equal that of the original
distribution (PopulationM).
In this sense the sample mean (M) is an unbiased
estimate of PopulationM because on average M
PopulationM.

5
Sampling Distributions -- biased and unbiased
estimates

If we repeatedly choose samples of size N and
compute the variance (SD2) for each sample,
we will create a distribution of variances.
The mean of the distribution of variances will
not equal of the variances of original
distribution (PopulationSD2).
In this sense the sample variance (SD2) is a
biased estimate of PopulationSD2 because on
average SD2 ? PopulationSD2.

6
Sampling Distributions -- biased and unbiased
estimates

But, if we repeatedly choose samples of size N
and compute the the variance as
we will create a distribution of variances and
the mean of the distribution of variances
(computed with N-1) will equal of the variance of
original distribution (PopulationSD2).
We refer to N-1 as the number of degrees of
freedom (df) i.e., df N-1
In this sense the sample variance computed with
N-1 in the denominator (S2) is an unbiased
estimate of PopulationSD2 because on average S2
PopulationSD2.

7
Sampling Distributions -- biased and unbiased
estimates

Therefore, going back to our first question,
What should we use to estimate PopulationSDM ?
The answer is we should first compute an unbiased
estimate of PopulationSD2 as
And then we should compute an estimate of
PopulationSDM as
or
Explain the difference between S2 and SD2
Explain the difference between S and SM
Explain the difference between SM and
PopulationSDM

Web Demo
8
The t-distribution

What we know already
The distribution of sample means has the
following parameters
PopulationMM PopulationM
PopulationSDM PopulationSD/sqrt(N)
Thus any particular M can can be converted to a Z
score in the following way
Producing the standard normal distribution or,
in other words, a sampling distribution of
Z-scores.

9
The t-distribution

Gosset raised the following question
How would things change if we divided by SM
rather than PopulationSDM itself?
Would the resulting sampling distribution be
normal?
The answer is no in general t-scores are not
normally distributed.
But the distribution of t-scores is symmetric
and has a mean of zero
The exact form of the t-distribution depends on N
For large N the t-distribution approaches the
standard normal distribution (the z-distribution)
but for small N the t- distribution becomes
leptokurtic.

10
The t-distribution
The good news is that the t-distribution has a
definite form we know this because of smart
people like Gossett
You dont have to remember this!!!
11
The t-distribution
12
The t-distribution
We use the t-distribution in same way we use the
z-distribution we find t-critical in the same
way we find z-critical
t-critical (tcrit)
13
(No Transcript)
14
The one sample t-test

Consider our baby/supercharged vitamin example
again
Lets say you know that on average the first
steps are taken at 14 months
but you dont know PopulationSD

15
The one sample t-test

Assume youve chosen a sample of 16 babies and
given them the vitamins
H0 Pop1M gt Pop2M
H1 Pop1M lt Pop2M
Set ? .05 and perform a one tailed t-test.
The comparison distribution is the t-distribution
with df (N - 1) 15, and tcrit -1.753
Calculate t

16
The one sample t-test

The sample mean is M 8 and and standard
deviation S 12
Calculate SM S / sqrt(N) 12/4 3
Calculate t (M - PopulationM) / SM (8 - 14) /
3 -2
t -2, which is less than tcrit -1.753
Therefore, reject H0.
Conclude that the vitamins had an effect.

17
The one sample t-test

Consider our rat on the running wheel again
Lets say you know that on average rats spend 50
minutes out of 2 hrs on a running wheel but you
dont know PopulationSD

18
The one sample t-test

Assume youve chosen a sample of 25 rats and
injected them with an amphetamine then measured
the time they spend on the running wheel.
H0 Pop1M ??? Pop2M
H1 Pop1M gt Pop2M
Set ? .05 and perform a one tailed test.
The comparison distribution is the t-distribution
with df 24, therefore tcrit 1.711
Calculate t

19
The t-test for dependent means

The sample mean is M 60 and and standard
deviation S 16
Calculate SM S / sqrt(N) 16/5 3.2
Calculate t (M - PopulationM) / SM (60 - 50)
/ 3.2 3.125
t 3.125 which is greater than tcrit 1.711
Therefore, reject H0.
Conclude that the amphetamine injections had an
effect.

20
The t-test for dependent means

Consider our rats on the running wheel again
Repeated measures designs.

21
The t-test for dependent means

Assume youve chosen a sample of 10 rats and
measured their running time before and after an
amphetamine injection.
Before After Difference
12 13 1
14 14 0
15 16 1
13 13 0
12 13 1
11 13 2
14 14 0
13 13 0
0.625 mean Mdiff
0.744 S
0.263 SM S / sqrt(N)
2.38 t (t Mdiff / SM)

22
The t-test for dependent means

Our t score 2.38, which exceeds the cutoff
value of t (1.89) for a one tailed test.
Therefore we reject the null hypothesis.

23
The t-test for dependent means

Assumption the t-test is based on the assumption
that the population you sampled is normal
If this assumption does not hold then the
validity of the t-test may be questions
One way to assess this assumption is to see
whether the sample distribution appears to be
normal.
Estimating Effect Size
Because we dont know the population SD we cant
compute effect size but we can estimate it from
the mean difference and the sample standard
deviation
i.e., M / S

24
The t-test for dependent means

Power

25
The t-test for dependent means

Power when Planning an Experiment

26
Confidence intervals for means and mean
differences

Recall that when we knew the PopulationSD and
sample size, we were able to compute a confidence
interval about a sample mean
i.e., M zcrit (PopulationSDM)
When we have to estimate PopulationSD from S, we
can use SM and the t-distribution to calculate a
condidence interval about a sample mean
i.e., M tcrit (SM)
The same procedure can be used to place a
confidence interval around a mean difference (or
a mean) when PopulationSD is unknown
i.e., MDiff tcrit (SM) where SM is the standard
error of the difference scores.