STATISTICS HYPOTHESES TEST (III) Nonparametric Goodness-of-fit (GOF) tests - PowerPoint PPT Presentation

About This Presentation
Title:

STATISTICS HYPOTHESES TEST (III) Nonparametric Goodness-of-fit (GOF) tests

Description:

HYPOTHESES TEST (III) Nonparametric Goodness-of-fit (GOF) tests Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University – PowerPoint PPT presentation

Number of Views:239
Avg rating:3.0/5.0
Slides: 35
Provided by: rslabntuN8
Category:

less

Transcript and Presenter's Notes

Title: STATISTICS HYPOTHESES TEST (III) Nonparametric Goodness-of-fit (GOF) tests


1
STATISTICSHYPOTHESES TEST (III)Nonparametric
Goodness-of-fit (GOF) tests
  • Professor Ke-Sheng Cheng
  • Department of Bioenvironmental Systems
    Engineering
  • National Taiwan University

2
Description of nonparametric Problems
  • Until now, in the estimation and hypotheses
    testing problems, we have assumed that the
    available observations come from distributions
    for which the exact form is known, even though
    the values of some parameters are unknown. In
    other words, we have assumed that the
    observations come from a certain parametric
    family of distributions, and a statistical
    inference must be made about the values of the
    parameters defining that family.

3
  • In many situations, we do not assume that the
    available observations come from a particular
    family of distributions. Instead, we want to
    study inferences that can be made about the
    distribution from which the observations come,
    without making special assumptions about the form
    of that distribution.

4
  • For example, we might simply assume that
    observations form a random sample from a
    continuous distribution, without specifying the
    form of this distribution any further and we
    then investigate the possibility that this
    distribution is a normal distribution.

5
  • Problems in which the possible distributions of
    the observations are not restricted to a specific
    parametric family are called nonparametric
    problems, and the statistical methods that are
    applicable in such problems are called
    nonparametric methods.

6
Goodness-of-fit test
  • A very common statistical problem in hydrological
    frequency analysis or water resources planning is
    that whether the available observations (a random
    sample available to us) come from a particular
    type of distribution. For example, before we can
    estimate the magnitude of the 24-hour rainfall
    depth with 100-year return period, we must decide
    (identify) the type of probability distribution
    for the rainfall data (the annual maximum series)
    through statistical tests.

7
  • Lets consider statistical problems based on data
    such that each observation can be classified as
    belonging to one of a finite number of possible
    categories. If a large population consists of
    data of k different categories, and let pi denote
    the probability that an observation will belong
    to category i (i 1, 2, , k). Of course, for
    i 1, 2, , k and .

8
(No Transcript)
9
(No Transcript)
10
(No Transcript)
11
  • Therefore, it seems reasonable to base a test on
    the values of the differences
  • for i 1, 2, , k and reject Ho when the
    magnitudes of these differences are relatively
    large.

12
Chi-square GOF test
13
(No Transcript)
14
(No Transcript)
15
(No Transcript)
16
(No Transcript)
17
(No Transcript)
18
Number of categories
Sample size
19
(No Transcript)
20
Kolmogorov-Smirnov GOF test
  • The chi-square test compares the empirical
    histogram against the theoretical histogram.
  • In contrast, the K-S test compares the empirical
    cumulative distribution function (ECDF) against
    the theoretical CDF.

21
(No Transcript)
22
(No Transcript)
23
(No Transcript)
24
  • In order to measure the difference between Fn(X)
    and F(X), ECDF statistics based on the vertical
    distances between Fn(X) and F(X) have been
    proposed.

25
(No Transcript)
26
(No Transcript)
27
(No Transcript)
28
(No Transcript)
29
(No Transcript)
30
(No Transcript)
31
(No Transcript)
32
Values of for the Kolmogorov-Smirnov test
33
Goodness-of-fit tests using R
  • ?2 test for GOF test
  • chisq.test
  • The above test doesnt account for any parameters
    in determining the expected values.
  • The degree of freedom of the test statistic is
    k-1.
  • Kolmogorov-Smirnov GOF test
  • ks.test (one-sample test)

34
  • ks.test(x, y, parameters, alternative)
  • where x is the data vector to be tested, y is a
    string vector specifying the hypothesized
    distribution, parameters are the values of
    distribution parameters corresponding to y, and
    alternative represents a string vector (less,
    greater, or two.sided) for one-tail or
    two-tail test.
  • Examples
  • ks.test(x, pnorm, 30, 10, alternativetwo.side
    d)
  • ks.test(x, pexp, 0.2, alternativegreater)
Write a Comment
User Comments (0)
About PowerShow.com