Statistics Onesample ttest - PowerPoint PPT Presentation

1 / 39
About This Presentation
Title:

Statistics Onesample ttest

Description:

The standard error provides a measure of how well a sample mean approximates the ... The two random samples of dependant scores measure an interval or ratio variable ... – PowerPoint PPT presentation

Number of Views:33
Avg rating:3.0/5.0
Slides: 40
Provided by: wilo
Category:

less

Transcript and Presenter's Notes

Title: Statistics Onesample ttest


1
StatisticsOne-sample t-test
  • June 20, 2000

2
To Do
  • Take over world
  • Use statistics
  • One-sample t-test
  • End of chapter 8
  • Two-sample and related sample t-tests
  • Chapter 9

3
  • One-sample t-test
  • Compared to the z-test
  • The t-distribution and df
  • Examples of its use
  • Confidence intervals
  • Power
  • What it is
  • How to get it

4
Moving from the z-test to the t-test
  • The z-test may be summarized as follows
  • A sample mean is expected to approximate a
    population mu. This allows us to use the sample
    mean to test a hypothesis about the population
    mu.
  • The standard error provides a measure of how well
    a sample mean approximates the population mean.
  • To quantify our inferences about the population,
    we compare the obtained sample mean with the
    hypothesized population mu by computing a z-score
    test statistic

5
Why use the t-test then?
  • Usually, we dont know the population standard
    deviation
  • Therefore, we cant calculate the standard error
    of the sample means
  • Without the standard error of the sample means,
    we dont know how much difference to expect in
    our obtained difference
  • However, the t-test allows us to estimate the
    standard error using the sample standard
    deviation
  • So the t-test formula looks like the z-test
    formula

6
Characteristics of the t-test
  • The t statistic is used to test hypotheses about
    a population mu when the population standard
    deviation is not known.
  • The t statistic formula is similar to the z
    statistic formula, except the t statistic formula
    uses estimated standard error.
  • The estimated standard error is used as an
    estimate of the standard error of the sample mean
    when the population standard deviation is not
    known.
  • It is computed from the sample standard deviation
    and provides an estimate of the standard distance
    between a sample mean and the population mu.
  • The t distribution is effected by the degrees of
    freedom (sample size)

7
Characteristics of the t distribution
  • The shape of the t distribution changes by values
    of degrees of freedom (df)
  • df describes the number of scores in a sample
    that are free to vary. It is equal to the sample
    size minus one.
  • The larger the df (i.e., sample size) the better
    the sample represents its population
  • With a large df ( about 120) the t distribution
    is normal (like the z distribution)
  • With small df values the t distribution is
    flatter and more spread out
  • Not all df values are given in the t table - so
    use the larger critical t value (i.e., for the
    lower df value)

8
Critical values from the z-table at different df
values
  • The critical values of z for a two-tailed test
    with an ? .05 are -1.96 and 1.96
  • For a t with a sample size of 4 and df 3, the
    critical values are -3.182 and 3.182
  • For a t with a sample size of 31 and df 30,
    the critical values are -2.042 and 2.042
  • For a t with a sample size of 121 and df 120,
    the critical values are -1.980 and 1.980

9
Same basic experimental situation
Known population before treatment
Unknown population after treatment
Treatment
? 30
? ?
10
Procedures for conducting the t-test
  • Step 1 The null and alternative hypotheses are
    stated and the alpha level is set (typically to
    .05)
  • Step 2 The critical region is located using the
    df value, direction of the hypothesis, and the
    alpha value in the t distribution table.
  • Step 3 The sample data are collected and the test
    statistic is calculated
  • Step 4 The null hypothesis is evaluated.

11
Conclusions from t results
  • If the t statistic falls within the critical
    region (exceeds the value of the critical t) then
    the null hypothesis (Ho) is rejected.
  • We then conclude that a treatment effect exists.
  • If the t statistic does not fall within the
    critical region (is less than the value of the
    critical t) then we fail to reject the null
    hypothesis (Ho).
  • We then conclude that we failed to observe
    evidence for a treatment effect in our study.

12
Reporting results of t-tests
  • Proper format for reporting t-test results
  • t(8) -3.61, p lt .05
  • This says
  • We preformed the t-test
  • We had 8 degrees of freedom
  • Our tobt is -3.61
  • This probability of our obtained t value is less
    than 5, therefore the difference between the
    mean and the mu is significant and we reject the
    null hypothesis

13
Estimating the population mu
  • Estimate the population mu using the sample mean
  • Point estimation
  • Likely to be wrong (too specific)
  • Estimate a range of scores that may contain the
    population mu
  • Interval estimation
  • Accounts for sampling error
  • A confidence interval contains the highest and
    lowest values of the population mu that are not
    significantly different from our mean

14
Interpreting confidence intervals
  • The computational formula for confidence
    intervals is
  • Use the two-tailed critical values of t
  • The amount of confidence (such as 95) is related
    to the alpha value used when finding tcrit
  • Using an alpha of .05, you calculate a 95
    confidence interval
  • Using an alpha of .01, you calculate a 99
    confidence interval

(sx)(-tcrit) X ? (sx)(tcrit) X
15
Experimental Power
  • The power of a statistical test is the
    probability that the test will correctly reject a
    false null hypothesis.
  • Power is effected by
  • The alpha level, reduce alpha (i.e., from .05 to
    .01) you reduce your power
  • One-tailed versus two-tailed tests, one-tailed
    tests have more power
  • Sample size, increasing sample size increases
    your power

16
Power and Effect Size
  • Effect size (i.e., Cohens d) is an indication of
    how much the IV and DV are related
  • Hypothesis tests (i.e. the t-test) are an
    indication of how reliable the relationship
    between the IV and DV is
  • Power and effect size
  • The smaller the effect size, the more power
    required to find the effect
  • A larger effect size does not need as much power
  • Some non-significant results are due to a lack of
    power

17
Two-Sample t-tests
  • Transition from one to two sample tests
  • Two-sample t-tests
  • Independent Samples
  • Homogeneity of variance
  • Related Samples

18
Contrasting the z-test, one-sample and
two-samples t-test
  • In the z-test, we know the population mu and
    standard deviation
  • In the one-sample t-test, we know the population
    mu but do not know the population standard
    deviation
  • estimate the standard error using the sample
    standard deviation
  • In the two-samples t-test, we dont know either
    the population mu or standard deviation
  • estimate the differences in the population with
    the sample means and estimate the standard error
    of the differences using the pooled sample
    variances

19
Comparing the z-test, one-sample and two-samples
t-tests
  • The test equation has the same basic form

20
Two-sample research design
  • An experiment that uses a separate sample for
    each treatment condition (or each population) is
    call an independent-samples research design
  • We are interested in the mean difference between
    two populations

Population A
Population B
Unknown ? ?
Unknown ? ?
?A - ?B gt 0
Sample A
Sample B
21
Assumptions of the two-sample t-test
  • The two random samples of dependant scores
    measure an interval or ratio variable
  • The population of raw scores represented by each
    sample is normally distributed and best described
    by the mean.
  • We do not know the variance of either population
    and must estimate it from the sample data.
  • The populations represented by our samples have
    homogeneous variance.
  • Homogeneity of variance means that the true
    variance of the two population distributions are
    the same.
  • This is especially important when the sample
    sizes are not equal.

22
Null and alternative hypotheses tested by the
two-sample t-test
  • Two-tailed hypotheses predicting a mean
    difference of zero
  • Ha ?1 - ?2 ? 0
  • Ho ?1 - ?2 0
  • Two-tailed hypotheses predicting a non-zero mean
    difference
  • Ha ?1 - ?2 ? 10
  • Ho ?1 - ?2 10
  • One-tailed hypotheses predicting a difference of
    zero
  • Ha ?1 - ?2 gt 0
  • Ho ?1 - ?2 0

23
Steps in conducting a independent-samples t-test
  • Identify your experimental hypothesis (then your
    Ho and Ha) and select an alpha level (typically
    .05)
  • Collect data from samples that meet the
    assumptions of the two-sample t-test and
    calculate the means and variances
  • Calculate the pooled variance using the sample
    variances then calculate the standard error of
    the difference using the pooled variance then
    calculate tobt using the standard error of the
    difference
  • Compare your tobt with the tcrit in the tables
  • For two-sample t-tests, df (n1 - 1) (n2-1)
  • Report your results and graph the means
  • Calculate the confidence interval for the mean
    differences
  • Calculate the effect size (using either rpb or d)

24
Calculating the error term of the two-sample
t-test
  • In estimating two population mus we have two
    sources of error
  • x1 approximates ?1 with some error
  • x2 approximates ?2 with some error
  • We are interested in the combined error of the
    samples so we pool the variance
  • This gives us an average error of the two
    samples
  • Weigh variances by their df
  • Way of accounting for differences in sample sizes
  • Larger samples get more weight because they are
    better estimates of the population
  • Use the pooled variance to calculate the standard
    error of the difference

25
Sampling distribution of mean differences when ?1
- ?1 0
  • We are interested in the significance of the
    difference of our sample scores
  • So we have a sampling distribution of mean
    differences
  • We are comparing our differences in sample means
    to the differences in population mus given by the
    null hypothesis

26
Results of the t-test
  • Present the results of your t-test
  • t(30) 2.94, p lt .05
  • df 30
  • tobt 2.94
  • Difference is significant
  • Calculate the confidence interval of the mu
    difference
  • If we preformed the experiment on the population,
    we are 95 confident that the difference would be
    between about .90 and 5.08
  • Calculate the effect size
  • The strength of the relationship or how much the
    independent and dependant variable are related
  • rpb2 (.10, .30, .50)
  • d tobt (.20, .50, .80)

27
Power
  • How to enhance the power of you two-sample t-test
  • Maximize the difference produced by the two
    conditions
  • High impact manipulations
  • Very different conditions of the independent
    variable
  • Minimize the variability of the raw scores
  • Good experimental control
  • Eliminate extraneous variables
  • Maximize the sample ns
  • Smaller denominator when calculating tobt
  • Larger df resulting in a smaller value of tcrit

28
Related-sample t-test
  • t-Test experiments
  • Related-sample t-test
  • Designs
  • Dependent variable
  • Pros and cons

29
Types of designs using t-tests
  • Single-sample
  • One sample of subjects
  • Comparing the mean and the mu
  • Independent-samples
  • Two samples of subjects
  • Comparing two means
  • Repeated-measures
  • One sample of subjects measured twice
  • Looking at the difference between the means of
    the two measurements
  • Matched-subjects
  • Two samples of subjects that are paired on a
    certain variable
  • Looking at the difference between the two means

30
Designs for related samples
  • Matched-sample design
  • Each individual in one sample is matched with a
    subject in the other sample
  • The matching is done so that the two individuals
    are equivalent (or nearly equivalent) with
    respect to a specific variable that the
    researcher would like to control
  • Repeated-measures design
  • A single sample of subjects are used to compare
    two different treatment conditions
  • Each individual is measured in one treatment, and
    then the same individual is measured again in the
    second treatment. Thus, a repeated-measures study
    produces two sets of scores, but each is obtained
    from the same sample of subjects

31
Pros and Cons of Related Samples Designs
  • Matched-samples design
  • Pro More powerful control individual
    differences
  • Con Matched on wrong variable
  • Repeated-measures design
  • Pro Even more powerful better control of
    individual differences
  • Con Order effects
  • The first survey may effect performance on the
    second survey
  • Counter balance 50 get A then B 50 get B then
    A

32
Dependent variable in related-samples t-tests
  • In both cases (matched and repeated designs), we
    subtract one score from the other and do a
    one-sample-like t-test on the average difference
    (D)
  • Instead of the mean of x, we use the mean of D
  • D x2 - x1 after - before
  • Instead of a known mu value, we use a value given
    in the null hypothesis (i.e., set by the
    experimenter)

33
Hypotheses Tested
  • Interested in whether or not any difference
    exists between scores in the first treatment and
    scores in the second treatment.
  • Is the population mean difference (?D) equal to
    zero (no change) or has a change occurred?
  • Two-tailed hypotheses
  • Ha ?D ? 0 or Ha ?D ? 20
  • Ho ?D 0 Ho ?D 20
  • One-tailed hypotheses
  • Ha ?D gt 0
  • Ho ?D 0

34
Results of the t-test
  • Present the results of your t-test
  • t(30) 2.94, p lt .05
  • df 30
  • tobt 2.94
  • Difference is significant
  • Calculate the confidence interval of the mu
    difference
  • If we preformed the experiment on the population,
    we are 95 confident that the difference would be
    between about .90 and 5.08

35
Effect Size
  • Calculate the effect size
  • The strength of the relationship or how much the
    independent and dependant variable are related
  • rpb
  • Small - 10
  • Medium - 30
  • Large - 50
  • d tobt
  • Small - 20
  • Medium - 50
  • Large - 80

36
Power
  • How to enhance the power of your two-sample
    t-test
  • Maximize the difference produced by the two
    conditions
  • High impact manipulations
  • Very different conditions of the independent
    variable
  • Minimize the variability of the raw scores
  • Good experimental control
  • Eliminate extraneous variables
  • Maximize the sample ns
  • Smaller denominator when calculating tobt
  • Larger df resulting in a smaller value of tcrit

37
Learning Check
  • 1 What assumptions must be satisfied for the
    repeated-measures t-test to be valid?
  • Random samples, interval or ratio variables,
    population of differences (D) is normally
    distributed, homogeneous variance (always equal n
    sizes)
  • 2 Describe some situations for which a
    repeated-measure design is well suited.
  • When subjects are hard to find (requires less
    subjects) required by your research question
    (differences over time) individual differences
    are large (reduces error)

38
Learning Check
  • 3 How is a matched-subjects design similar to a
    repeated-measures design? How do they differ?
  • They both reduce the role of individual
    differences thereby increasing power. They differ
    in that there are two samples in the matched
    design and one in the repeated measures design.
  • 4 The data from a research study consist of 8
    scores for each of two different treatment
    conditions. How many individual subjects would be
    needed to produce these data.
  • a. For an independent-measures design?
  • 16, two separate sample with n 8 in each
  • b. For a repeated measures design?
  • 8 subjects, the same 8 subjects are measured in
    both treatments
  • c. For a matched-subjects design?
  • 16 subjects, 8 matched pairs

39
The End
Write a Comment
User Comments (0)
About PowerShow.com