Univariate Statistics 2: the normal curve, zscores, and relative frequencies - PowerPoint PPT Presentation

1 / 18

About This Presentation

Title:

Univariate Statistics 2: the normal curve, zscores, and relative frequencies

Description:

... above and below the midpoint of the abscissa (horizontal axis of the curve) ... symmetrical the mean, median, and mode are all at the same point on the abscissa. ... – PowerPoint PPT presentation

Number of Views:32

Avg rating:3.0/5.0

Slides: 19

Provided by: sys84

Category:

more less

Transcript and Presenter's Notes

Title: Univariate Statistics 2: the normal curve, zscores, and relative frequencies

1
Univariate Statistics 2the normal curve,
z-scores, and relative frequencies

Monday 7th November 2005

2
Some Important Characteristics of the Normal
Curve

The normal curve is a symmetrical distribution of
scores with an equal number of scores above and
below the midpoint of the abscissa (horizontal
axis of the curve).
Since the distribution of scores is symmetrical
the mean, median, and mode are all at the same
point on the abscissa. In other words, the mean
the median the mode.
If we divide the distribution up into standard
deviation units, a known proportion of scores
lies within each portion of the curve.
Tables exist so that we can find the proportion
of scores above and below any part of the curve,
expressed in standard deviation units. Scores
expressed in standard deviation units, as we will
see shortly, are referred to as Z-scores.

3
The Standard Normal Distribution.
Mean 0 Standard deviation 1
Total area under the curve 100
4
Working out relative frequencies with a standard
normal distribution

Because the area under a histogram is equal to
100
And because we are familiar with the shape of a
standard normal distribution
We can use the standard normal distribution to
work out the relative frequencies of different
events.

5
Question What is the relative frequency of
observations below 1.18?
6
Question What is the relative frequency of
observations below 1.18? That is, find the
relative frequency of the event Z lt 1.18. (Here
small z is 1.18.) Step 1 Sketch the curve.
Identify--on the measurement (horizontal/X)
axis--the indicated range of values.
The event z lt 1.18 is shaded in green. Events and
possibilities are one in the same.
7
Question What is the relative frequency of
observations below 1.18? Step 2 The relative
frequency of the event is equal to the area under
the curve over the description of the event.
The blue area is the relative frequency of the
event z lt 1.18. This area appears to be
approximately 85-90. A good sketch will help
you verify your answer.
8
Question What is the relative frequency of
observations below 1.18?

Step 3
Look at the Table that describes the area under
the standard normal curve. Some tables list
precise values. If so you can look up 1.18.
Otherwise, since this is nearly 1.2, you can look
that up for an approximation.
Corresponding to a measurement value of z 1.18
is an area of 0.8810.
This is exactly the answer to the question!
Notice that it agrees with the picture as well as
the original "guess." For any value z the table
supplies the area under the curve over the region
to the left of z.
Again, area relative frequency.

9
A note on the layout of tables.
Tables showing z-scores are laid out differently.
However they all will tell you the same thing. A
common layout, thats different to Field is shown
below. If we go across from 1.1 to 0.08, we find
the score for 1.18. It says 0.8810. This is what
Field calls the Larger Portion. This table does
not give the Smaller portion. But since we know
that the two portions combined 1.0 (or 100),
we can work it out as 1 - 0.8810 0.1190 (which
is what Field lists as Smaller Portion
10
Question What is the relative frequency of
observations below 1.18? Answer 0.8810 or
88.10. For a standard normal variable the
relative frequency of observations falling below
1.18 is 0.8810. (Also, for any normal
distribution, 0.8810 or 88.1 of the observations
fall below 1.18 times the standard deviation
above the mean.)
11
Question What is the relative frequency of
observations below -0.63?
12
Question What is the relative frequency of
observations below -0.63?

Identify the range of values described by "below
-0.63" (shaded green).
Identify the area you need to find (shaded blue).
Look-up the appropriate area in your table. (In
some tables you must be careful to choose the
"negative" portion of your table--look up -0.63.)
That area is 0.2643. For a standard normal
variable the relative frequency of observations
falling below -0.63 is 0.2643. (Also, for any
normal distribution, 0.2643 or 26.43 of the
observations fall below 0.63 times the standard
deviation below the mean. Below because -0.63 is
negative.)

Answer 0.2643 or 26.43.
13
howevera problem

So far weve been working with a standard normal
distribution.
However this is not really that useful since most
populations do not have a mean of 0 nor a
standard deviation of 1.
In the real world statistics needs to be able to
work out the likelihood of other types of events,
which have a wide range of values.
For instance How likely is it to get a mark of
at least 75 in an exam, if the mean is 60 and the
standard deviation is 10?

14
z-scores

In order to address these types of problems we
use z-scores.
z-scores are calculated by subtracting the mean
from any value and dividing it by the standard
deviation.
Z X - mean
s
z-scores will always have a mean of 0 and a
standard deviation of 1.
We can quickly see that this is true of the
mean, since when the Xmean, the numerator will
equal 0, and therefore z must 0.
It may be a little less clear that it is true of
the standard deviation.
However if you think about the instance when X
is one standard deviation bigger than the mean
(i.e. X mean s)
? z (mean s) - mean s 1
s s

15
Example conversion to z-scores

So, returning to the example of the test marks,
where the question was the likelihood of getting
a mark of 75 or better if mean60 and s10. We
can calculate the z-score for getting 75 as
follows
Z 75-60
10
z 1.5
Once we know the z-score for a mark we can work
out the relative frequency that people will score
that amount or better in exactly the same way as
we were when we were working with the standard
normal distribution.

16
Example continued.
The likelihood of a mark of 75 or over is the
same as a likelihood of a z-score of 1.5 or over.
It is shown by the red part of the distribution
If we look up 1.5 we find the area is 0.0670.
Therefore there is a 6.7 probability of getting
a mark of 75 or better.
17
You can also find the score that corresponds with
a certain percentage

For example, you want to find what mark you would
need to be in the top 20
Look at the tables in reverse, finding what
z-score corresponds to a p of .20. ?
approximately z.84
Then work out the mark that corresponds to z.84
X mean (.84 standard deviation)
60 (.8410) 68.4
Therefore you would need a mark of 68.4 to be
in the top 20

Example You want to find the scores people get
in the middle 30.
This would correspond with a score ranging 15
above and below the mean.
Some tables give values between two points, the
one in Field does not.
So we work out that the small area to the right
of b is equal to (50-15) 35
We then look at the tables in reverse. And find
that a smaller portion corresponding to .35
occurs at the approximate z-score .385
Likewise the z-score 15 below the mean (at a) is
-.385
We can then convert these to marks
60 (.38510) 56.15 60 (-.38510) 63.85
Therefore the middle 30 of marks fell between
56.15 and 63.85