Power recap - PowerPoint PPT Presentation

1 / 47

About This Presentation

Title:

Power recap

Description:

p-values of 1 fake data is crap! Power recap. A 1000 simulation Power analyses ... Exam. www.bokfynd.nu. www.abebooks.com. Nice Books. Lunch? or. Dedication! ... – PowerPoint PPT presentation

Number of Views:32

Avg rating:3.0/5.0

Slides: 48

Provided by: didrikvan

Category:

more less

Transcript and Presenter's Notes

Title: Power recap

1
Power recap
2
Power recap

It is good to fake data
BUT
p-values of 1 fake data is crap!

3
Power recap

A 1000 simulation Power analyses is not crap!
BUT
Power depends on
Sample size
Effect size
Variation

4
Project considerations I

Make graphs
Check for outliers
Check assumptions
Decide if you want to transform y and or x
Check VIF
Are your assumtions still fkd up.
? Well, thats for today.

5
Project considerations II

Interpret interactions first!
If they are significant? Are main effects
still interpretable?
Distinguish between y x1 and y x1
given x2
Simplify your models!

6
2 ? 2 tables
Logistic regression
Categoric
Melica
1.0
0.8
0.6
Prob. of choosing Melica
0.4
0.2
0.0
Response variable
Luzula
4.5
5.5
6.5
7.5
Ant size
Regression
Anova
Continuous
-
-
Seed size
Continuous
Categoric
Explanatory variable
7
Response variable
Regression
Anova
Continuous
-
-
Seed size
Continuous
Categoric
Explanatory variable
8
Assumptions for parametric tests with continuous
response? i.e., also linear models!!

About the same variation in all groups or along a
continuous variable or along fitted values
Pretty normal residuals ( noice)

9
The residuals

are the noice that is not explained by the
explanatory variable(s)
In a regression the residuals are the distance
from the data points to the regression line
In an Anova the residual are the distance to the
group mean
In a linear model the residuals are the distance
from the data points to the fitted values.

10
Residuals
11
Residuals
12
Assumption check
13
Assumption check
14
Assumption check
15
Solutions

Poisson for counts (generalized linear model)
Non-parametric tests
Resampling methods
Permutation
Bootstrap
Binarize your response

16
quasipoisson fit on Xanthoria
17
Xanthoria reproduction again
18
Poisson distribution

Response Numbers (not true continuous)
Examples
Are there more maple seedlings close to a maple?
Response number per square
m1lt-glm(numberdistance,familyPoisson)

19
Poisson distribution

Usually log(y) also works fine.
Poisson excells
small means
many zeroes
Many zeroes ? Hurdle models

20
break?
21
Non-parametric tests

Based on ranked values instead of actual data.

22
Non-parametric tests

Still often in use.
Questionable with modern computers.
? In principle permutions of ranked values
? But worse than real permutations, because
information about actual data values is
discarded.

23
Non-parametric tests

Still often in use.
Questionable with modern computers.
? In principle permutions of ranked values
? But worse (than real permutations) because
information about actual data values is
discarded.

BENEFIT Calm dow outliers!
?
24
Response variable
Regression
Anova 2 groups also t-test
Continuous
-
-
Seed size
Continuous
Categoric
Explanatory variable
25
Response variable
Kruskal-Wallis also Mann-Whitney U-test Paired
Sign test(binomial)
Kendall rank correlation also Spearman rank
Continuous
-
-
Seed size
Continuous
Categoric
Explanatory variable
26
Permutations

Does not require normal distribution
BUT, does require distributions to be equal if
your hypothesis is not true.
? Example
If the lichens are equally large in the city as
they are at campus, they must have the same
variation and e.g., skewness.
In principle a test of if the distributions
differ.

27
Ash seed dispersal
28
Acer twigs plasticity
29
Birch cost of reproduction
30
bootstrap

to pull oneself up by one's bootstraps
to succeed only on one's own effort or
abilities.

31
shrimp-booting...
32
Rumex crispus Rumex longifolius
300
250
250
200
200
150
150
100
100
50
50
0
0
1.0
1.1
1.2
1.3
1.4
1.5
1.25
1.30
1.35
1.40
33
Confidence intervals

shows how sure we are of a group mean.
The confidence interval will contain the true
mean in 95 of the time.
The larger our sample size the more sure (
confident!) we are of our sample mean ? the
confidence interval decreases
And (of course), the more variation within
groups, the less sure we get ? confidence
interval increases

34
Bootstrap for tests
120
80
No. boot-samples
60
40
20
0
-5
0
5
10
15
20
25
boot.difference
35
Bootstrap

Does not require normal distribution of
residuals.
Does not require the same variation.
Only requirement is that what you bootstrap
(e.g., means) are the same if your hypothesis is
not correct.
And, in practice, a large, representative sample