Goodness of Fit Tests - PowerPoint PPT Presentation

1 / 15
About This Presentation
Title:

Goodness of Fit Tests

Description:

... number of males and females, and that 60% of the population should be mature. ... Mature. Male. Mature. Female. 381. Observed and Expected Frequencies ... – PowerPoint PPT presentation

Number of Views:239
Avg rating:3.0/5.0
Slides: 16
Provided by: cour92
Category:
Tags: fit | goodness | mature | tests

less

Transcript and Presenter's Notes

Title: Goodness of Fit Tests


1
Goodness of Fit Tests
  • QSCI 381 Lecture 41
  • (Larson and Farber, Sect 10.1)

2
Multinomial Experiments
  • A
    is a probability experiment consisting of a
    fixed number of trials in which there are more
    than two possible outcomes for each independent
    trial. The probability for each outcome is fixed
    and each outcome is classified into
  • .
  • Examples of multinomial experiments include
  • You sample 100 animals from a population. The
    categories could be age, length, maturity state.
  • You sample 1000 poppies in a field. The
    categories could be colour.
  • You sample 20 animals and calculate the frequency
    that each has a particular genetic haplotype.

multinomial experiment
categories
3
Goodness-of-fit Tests
  • A
    is used to test whether an observed
    frequency distribution fits an expected
    distribution.
  • We need to specify a null and an alternative
    hypothesis. Generally the null hypothesis is that
    the observed frequency distribution (the data)
    fits the expected distribution. The alternative
    hypothesis is that this is not the case.

chi-square goodness-of-fit test
4
Example-I
  • We expect that a healthy marine mammal
    population should consist of an equal number of
    males and females, and that 60 of the population
    should be mature. We sample 150 animals and
    assess the fraction in each of four categories to
    be

5
Observed and Expected Frequencies
  • The
    of a category is the frequency for the category
    observed in the data.
  • The
    of a category is the calculated frequency for the
    category. Expected frequencies are obtained by
    assuming the specified (or hypothesized)
    distribution is correct. The expected frequency
    for the i th category is
  • Where n is the number of trials, and pi is the
    assumed probability for the i th category.

observed frequency O
expected frequency E
6
Observed and Expected Frequencies(Example)
7
The Chi-square goodness-of-fit Test-I
  • IF
  • the observed frequencies are obtained from a
    random sample, and
  • the expected frequencies are greater than or
    equal to 5 (pool categories if this is not the
    case).
  • then the sampling distribution for the
    goodness-of-fit test is a chi-square distribution
    with k-1 degrees of freedom where k is the number
    of categories. The test statistic is

8
The Chi-square goodness-of-fit Test-II
  • Identify the claim and state the null and
    alternative hypotheses.
  • Specify the level of significance, ?.
  • Determine the degrees of freedom, d.fk-1.
  • Find the critical value of the chi-square
    distribution and hence define the rejection
    region for the test.
  • Calculate the test statistic.
  • Check whether or not the value of the test
    statistic is in the rejection region.

9
Example (Test using ?0.01)
  • H0 the distribution of animals between sex and
    maturity classes equals that expected for a
    healthy population.
  • The degrees of freedomk-13.
  • The critical value of the chi-square distribution
    is 11.34 (CHIINV(0.01,3))

10
Example (Test using ?0.01)
  • We reject the null hypothesis at the
  • 1 level of significance.

11
Example-A-1 (?0.05)
  • The probability of a particular bird species
    utilizing each of five habitats is known. We
    collect data for a different species (n137) and
    wish to assess whether the two species differ in
    their habitat requirements.

12
Example-A-2 (?0.05)
The critical value is 9.49 we fail to reject
the null hypothesis
13
Testing for Normality
  • We can use the chi-square test in some cases to
    assess whether a variable is normally
    distributed.
  • The null and alternative hypotheses are that
  • The variable has a normal distribution.
  • The variable does not have a normal distribution.

14
Example
Can we assume that these data are normal (assume
?0.05)?
15
Calculating the Test Statistic
Eipi x 149
xi is the mid-point of each class
Write a Comment
User Comments (0)
About PowerShow.com