Title: Nonparametric Methods III
1Nonparametric Methods III
Henry Horng-Shing Lu Institute of
Statistics National Chiao Tung University hslu_at_sta
t.nctu.edu.tw http//tigpbp.iis.sinica.edu.tw/cour
ses.htm
2PART 4 Bootstrap and Permutation Tests
- Introduction
- References
- Bootstrap Tests
- Permutation Tests
- Cross-validation
- Bootstrap Regression
- ANOVA
3References
- Efron, B. Tibshirani, R. (1993). An Introduction
to the Bootstrap. Chapman Hall/CRC. - http//cran.r-project.org/doc/contrib/Fox-Companio
n/appendix-bootstrapping.pdf - http//cran.r-project.org/bin/macosx/2.1/check/boo
tstrap-check.ex - http//bcs.whfreeman.com/ips5e/content/cat_080/pdf
/moore14.pdf
4Hypothesis Testing (1)
- A statistical hypothesis test is a method of
making statistical decisions from and about
experimental data. - Null-hypothesis testing just answers the question
of how well the findings fit the possibility
that chance factors alone might be responsible. - This is done by asking and answering a
hypothetical question. - http//en.wikipedia.org/wiki/Statistical_hypothesi
s_testing
5Hypothesis Testing (2)
- Hypothesis testing is largely the product of
Ronald Fisher, Jerzy Neyman, Karl Pearson and
(son) Egon Pearson. Fisher was an agricultural
statistician who emphasized rigorous experimental
design and methods to extract a result from few
samples assuming Gaussian distributions.
6Hypothesis Testing (3)
- Neyman (who teamed with the younger Pearson)
emphasized mathematical rigor and methods to
obtain more results from many samples and a wider
range of distributions. Modern hypothesis testing
is an (extended) hybrid of the Fisher vs.
Neyman/Pearson formulation, methods and
terminology developed in the early 20th century.
7Hypothesis Testing (4)
8Hypothesis Testing (5)
9Hypothesis Testing (6)
10Hypothesis Testing (7)
- Parametric Tests
- Nonparametric Tests
- Bootstrap Tests
- Permutation Tests
11Confidence Intervals vs.
Hypothesis Testing (1)
- Interval estimation ("Confidence Intervals") and
point estimation ("Hypothesis Testing") are two
different ways of expressing the same
information. - http//www.une.edu.au/WebStat/unit_materials/c5_in
ferential_statistics/confidence_interv_hypo.html
12Confidence Intervals vs.
Hypothesis Testing (2)
- If the exact p-value is reported, then the
relationship between confidence intervals and
hypothesis testing is very close. However, the
objective of the two methods is different - Hypothesis testing relates to a single conclusion
of statistical significance vs. no statistical
significance. - Confidence intervals provide a range of plausible
values for your population.
13Confidence Intervals vs.
Hypothesis Testing (3)
- Which one?
- Use hypothesis testing when you want to do a
strict comparison with a pre-specified hypothesis
and significance level. - Use confidence intervals to describe the
magnitude of an effect (e.g., mean difference,
odds ratio, etc.) or when you want to describe a
single sample. - http//www.nedarc.org/nedarc/analyzingData/advance
dStatistics/convidenceVsHypothesis.html
14P-value
- http//bcs.whfreeman.com/ips5e/content/cat_080/pdf
/moore14.pdf
15Achieved Significance Level (ASL)
- Definition
- A hypothesis test is a way of deciding whether
or not the data decisively reject the hypothesis
. - The archived significance level of the test
(ASL) is defined as . - The smaller ASL, the stronger is the evidence of
false. - The ASL is an estimate of the p-value by
permutation and bootstrap methods. - https//www.cs.tcd.ie/Rozenn.Dahyot/453Bootstrap/0
5_Permutation.pdf
16Bootstrap Tests
- Methodology
- Flowchart
- R code
17Bootstrap Tests
- Beran (1988) showed that bootstrap inference is
refined when the quantity bootstrapped is
asymptotically pivotal. - It is often used as a robust alternative to
inference based on parametric assumptions. - http//socserv.mcmaster.ca/jfox/Books/Companion/ap
pendix-bootstrapping.pdf
18Hypothesis Testing by a Pivot (1)
- Pivot or pivotal quantity a function of
observations whose distribution does not depend
on unknown parameters. - http//en.wikipedia.org/wiki/Pivotal_quantity
- Examples
- A pivot
-
- when and is known
19Hypothesis Testing by a Pivot (2)
- An asymptotic pivot
- when
- where , is unknown, and
20One Sample Bootstrap Tests
- T statistics can be regarded as a pivot or an
asymptotic pivotal when the data are normally
distributed. - Bootstrap T tests can be applied when the data
are not normally distributed.
21Bootstrap T tests
22Flowchart of Bootstrap T Tests
Bootstrap B times
23Bootstrap T Tests by R
24Bootstrap Tests by The Bca
- The BCa percentile method is an efficient method
to generate bootstrap confidence intervals. - There is a correspondence between confidence
intervals and hypothesis testing. - So, we can use the BCa percentile method to test
whether H0 is true. - Example use BCa to calculate p-value
25BCa Confidence Intervals
- Use R package boot.ci(boot)
- Use R package bcanon(bootstrap)
- http//qualopt.eivd.ch/stats/?pagebootstrap
- http//www.stata.com/capabilities/boot.html
26R package "boot.ci(boot)"
- http//finzi.psych.upenn.edu/R/library/boot/DESCRI
PTION
27An Example of "boot.ci" in R
28R package "bcanon(bootstrap)"
- http//finzi.psych.upenn.edu/R/library/bootstrap/D
ESCRIPTION
29An example of "bcanon" in R
30BCa
- http//qualopt.eivd.ch/stats/?pagebootstrap
31Two Sample Bootstrap Tests
32Flowchart of Two-Sample Bootstrap Tests
mnN
combine
Bootstrap B times
33Two-Sample Bootstrap Tests by R
34Permutation Tests
- Methodology
- Flowchart
- R code
35Permutation
- In several fields of mathematics, the term
permutation is used with different but closely
related meanings. They all relate to the notion
of (re-)arranging elements from a given finite
set into a sequence. - http//en.wikipedia.org/wiki/Permutation
36Permutation Tests (1)
- Permutation test is also called a randomization
test, re-randomization test, or an exact test. - If the labels are exchangeable under the null
hypothesis, then the resulting tests yield exact
significance levels.
37Permutation Tests (2)
- Confidence intervals can then be derived from the
tests. - The theory has evolved from the works of R.A.
Fisher and E.J.G. Pitman in the 1930s. - http//en.wikipedia.org/wiki/Pitman_permutation_te
st
38Applications of Permutation Tests (1)
- We can use a permutation test only when we can
see how to resample in a way that is consistent
with the study design and with the null
hypothesis. - http//bcs.whfreeman.com/ips5e/content/cat_080/pdf
/moore14.pdf
39Applications of Permutation Tests (2)
- Two-sample problems when the null hypothesis says
that the two populations are identical. We may
wish to compare population means, proportions,
standard deviations, or other statistics. - Matched pairs designs when the null hypothesis
says that there are only random differences
within pairs. A variety of comparisons is again
possible. - Relationships between two quantitative variables
when the null hypothesis says that the variables
are not related. The correlation is the most
common measure of association, but not the only
one.
40Inference by Permutation Tests (1)
- A traditional way is to consider some hypotheses
and , - and the null hypothesis becomes .
- Under , the statistic can be modeled
as a normal distribution with mean - 0 and variance .
- https//www.cs.tcd.ie/Rozenn.Dahyot/453Bootstrap/0
5_Permutation.pdf
41Inference by Permutation Tests (2)
- The ASL is then computed by
- when is unknown and has to be estimated from the
data by - We will reject if .
42Flowchart of The Permutation Test for Mean Shift
in One Sample
Partition 2 subset B times
(treatment group)
(treatment group)
(control group)
(control group)
43An Example for One Sample Permutation Test by R
(1)
44An Example for One Sample Permutation Test by R
(2)
- http//mason.gmu.edu/csutton/EandTCh15a.txt
45An Example for One Sample Permutation Test by R
(3)
46Flowchart of The Permutation Test for Mean Shift
in Two Samples
combine
mnN
Partition subset B times
47Bootstrap Tests vs. Permutation Tests
- Very similar results between the permutation test
and the bootstrap test. - is the exact probability when .
- is not an exact probability but is
guaranteed to be accurate as an estimate of the
ASL, as the sample size B goes to infinity. - https//www.cs.tcd.ie/Rozenn.Dahyot/453Bootstrap/0
5_Permutation.pdf
48Cross-validation
49Cross-validation
- Cross-validation, sometimes called rotation
- estimation, is the statistical practice of
partitioning a sample of data into subsets such
that the analysis is initially performed on a
single subset, while the other subset(s) are
retained for subsequent use in confirming and
validating the initial analysis. - The initial subset of data is called the training
set. - The other subset(s) are called validation or
testing sets. - http//en.wikipedia.org/wiki/Cross-validation
50Overfitting Problems (1)
- In statistics, overfitting is fitting a
statistical model that has too many parameters. - When the degrees of freedom in parameter
selection exceed the information content of the
data, this leads to arbitrariness in the final
(fitted) model parameters which reduces or
destroys the ability of the model to generalize
beyond the fitting data.
51Overfitting Problems (2)
- The concept of overfitting is important also in
machine learning. - In both statistics and machine learning, in order
to avoid overfitting, it is necessary to use
additional techniques (e.g. cross-validation,
early stopping, Bayesian priors on parameters or
model comparison), that can indicate when further
training is not resulting in better
generalization. - http//en.wikipedia.org/wiki/Overfitting
52R package crossval(bootstrap)
53An Example of Cross-validation by R
54Bootstrap Regression
- Bootstrapping pairs
- Resample from the sample pairs .
- Bootstrapping residuals
- 1. Fit by the original sample and
obtain the residuals. - 2. Resample from residuals.
55Bootstrapping Pairs by R (1)
- http//www.stat.uiuc.edu/babailey/stat328/lab7.ht
ml
56Bootstrapping Pairs by R (2)
57Bootstrapping Residuals by R
58ANOVA
- When random errors follow a normal distribution
- When random errors do not follow a Normal
distribution - Bootstrap tests
- Permutation tests
59An Example of ANOVA by R (1)
- Example
- Twenty lambs are randomly assigned to three
different diets. The weight gain (in two weeks)
is recorded. Is there a difference among the
diets? - http//mcs.une.edu.au/stat261/Bootstrap/bootstrap
.R
60An Example of ANOVA by R (2)
61An Example of ANOVA by R (3)
62An Example of ANOVA by R (4)
63An Example of ANOVA by R (5)
64An Example of ANOVA by R (6)
65An Example of ANOVA by R (7)
66An Example of ANOVA by R (1)
- Data source
- http//finzi.psych.upenn.edu/R/library/rpart/html/
kyphosis.html - Reference
- http//www.stat.umn.edu/geyer/5601/examp/parm.html
67An Example of ANOVA by R (2)
- Kyphosis is a misalignment of the spine. The data
are on 83 laminectomy (a surgical procedure
involving the spine) patients. The predictor
variables are age and age2 (that is, a quadratic
function of age), number of vertebrae involved in
the surgery and start the vertebra number of the
first vertebra involved. The response is presence
or absence of kyphosis after the surgery (and
perhaps caused by it).
68An Example of ANOVA by R (3)
69An Example of ANOVA by R (4)
70An Example of ANOVA by R (5)
71An Example of ANOVA by R (6)
72Exercises
- Write your own programs similar to those examples
presented in this talk. - Write programs for those examples mentioned at
the reference web pages. - Write programs for the other examples that you
know. - Practice Makes Perfect!