Title: How do I know which distribution to use
1How do I know which distribution to use?
2Three discrete distributions
Proportional
Poisson
Binomial
3Given a number of categories Probability
proportional to number of opportunities Days of
the week, months of the year
Proportional
Number of successes in n trials Have to know n, p
under the null hypothesis Punnett square, many
p0.5 examples
Binomial
Number of events in interval of space or time n
not fixed, not given p Car wrecks, flowers in a
field
Poisson
4Proportional
Binomial
Binomial with large n, small p converges to the
Poisson distribution
Poisson
5Examples name that distribution
- Asteroids hitting the moon per year
- Babies born at night vs. during the day
- Number of males in classes with 25 students
- Number of snails in 1x1 m quadrats
- Number of wins out of 50 in rock-paper-scissors
6Proportional
Generate expected values
Calculate ?2 test statistic
Binomial
Poisson
7Sample
Null hypothesis
Test statistic
Null distribution
compare
How unusual is this test statistic?
P gt 0.05
P lt 0.05
Reject Ho
Fail to reject Ho
8Chi-squared goodness of fit test
Null hypothesis Data fit a particular Discrete
distribution
Sample
Calculate expected values
Chi-squared Test statistic
- Null distribution
- 2 With
- N-1-p d.f.
compare
How unusual is this test statistic?
P gt 0.05
P lt 0.05
Reject Ho
Fail to reject Ho
9The Normal Distribution
10Babies are normal
11Babies are normal
12Normal distribution
13Normal distribution
- A continuous probability distribution
- Describes a bell-shaped curve
- Good approximation for many biological variables
14Continuous Probability Distribution
15The normal distribution is very common in nature
Human body temperature
Human birth weight
Number of bristles on a Drosophila abdomen
16Discrete probability distribution
Continuous probability distribution
Normal
Poisson
Probability
Probability density
PrX2 0.22
PrX2 ?
Probability that X is EXACTLY 2 is very very small
17Discrete probability distribution
Continuous probability distribution
Normal
Poisson
Probability
Probability density
Pr1.5X2.5 Area under the curve
PrX2 0.22
18Discrete probability distribution
Continuous probability distribution
Normal
Poisson
Probability
Probability density
Pr1.5X2.5 0.06
PrX2 0.22
19Normal distribution
Probability density
x
20Normal distribution
Probability density
x
Area under the curve
21But dont worry about this for now!
22A normal distribution is fully described by its
mean and variance
23A normal distribution is symmetric around its mean
Probability Density
Y
24About 2/3 of random draws from a normal
distribution are within one standard deviation of
the mean
25About 95 of random draws from a normal
distribution are within two standard deviations
of the mean
(Really, its 1.96 SD.)
26Properties of a Normal Distribution
- Fully described by its mean and variance
- Symmetric around its mean
- Mean median mode
- 2/3 of randomly-drawn observations fall between
?-? and ?? - 95 of randomly-drawn observations fall between
?-2? and ?2?
27Standard normal distribution
- A normal distribution with
- Mean of zero. (m 0)
- Standard deviation of one. (s 1)
28Standard normal table
- Gives the probability of getting a random draw
from a standard normal distribution greater than
a given value
29Standard normal table Z 1.96
30Standard normal table Z 1.96
PrZgt1.960.025
31Normal Rules
PrXltx PrXgtx1
1
32Standard normal is symmetric, so...
33Normal Rules
- PrX gt x PrX lt -x
- PrX lt x 1- PrX gt x
34Sample standard normal calculations
- PrZ gt 1.09
- PrZ lt -1.09
- PrZ gt -1.75
- Pr0.34 lt Z lt 2.52
- Pr-1.00 lt Z lt 1.00
35What about other normal distributions?
- All normal distributions are shaped alike, just
with different means and variances
36What about other normal distributions?
- All normal distributions are shaped alike, just
with different means and variances - Any normal distribution can be converted to a
standard normal distribution, by
Z-score
37Z tells us how many standard deviations Y is from
the mean
The probability of getting a value greater than Y
is the same as the probability of getting a value
greater than Z from a standard normal
distribution.
38Z tells us how many standard deviations Y is from
the mean
PrZ gt z PrY gt y
39Example British spies
MI5 says a man has to be shorter than 180.3 cm
tall to be a spy. Mean height of British men is
177.0cm, with standard deviation 7.1cm, with a
normal distribution. What proportion of British
men are excluded from a career as a spy by this
height criteria?
40Draw a rough sketch of the question
41m 177.0cm s 7.1cm y 180.3 PrY gt y
42Part of the standard normal table
PrZ gt 0.46 0.32276, so PrY gt 180.3
0.32276
43Sample problem
- MI5 says a woman has to be shorter than 172.7 cm
tall to be a spy. - The mean height of women in Britain is 163.3 cm,
with standard deviation 6.4 cm. What fraction of
women meet the height standard for application to
MI5?