Title: Basic Data Analysis
1Basic Data Analysis
2Measures of Central Tendency
3The Mode
- the value or property that occurs most frequently
in the data
4Find the mode
- 6, 7, 2, 3, 4, 6, 2, 6
- The mode is 6.
5Find the mode
- 6, 7, 2, 3, 4, 5, 9, 8
- There is no mode for this data.
6The Median
- the central value of an ordered distribution
7To find the median of raw data
- Order the data from smallest to largest.
- For an odd number of data values, the median is
the middle value. - For an even number of data values, the median is
found by dividing the sum of the two middle
values by two.
8Find the median
- Data 5, 2, 7, 1, 4, 3, 2
- Rearrange 1, 2, 2, 3, 4, 5, 7
The median is 3.
9Find the median
Data 31, 57, 12, 22, 43, 50 Rearrange 12, 22,
31, 43, 50, 57
The median is the average of the middle two
values
10The Mean
- The mean of a collection of data is found by
- summing all the entries
- dividing by the number of entries
11Find the mean
6, 7, 2, 3, 4, 5, 2, 8
12Sigma Notation
- The symbol ? means sum the following.
- ? is the Greek letter (capital) sigma.
13Notations for mean
- Population mean
- Greek letter (mu)
14Number of entries in a set of data
- If the data represents a sample, the number of
entries n. - If the data represents an entire population, the
number of entries N.
15Sample mean
16Population mean
17Quartiles
- Percentiles that divide the data into fourths
- Q1 25th percentile
- Q2 the median
- Q3 75th percentile
18Quartiles
Median Q2
Q1
Q3
Lowest value
Highest value
Inter-quartile range IQR Q3 Q1
19Computing Quartiles
- Order the data from smallest to largest.
- Find the median, the second quartile.
- Find the median of the data falling below Q2.
This is the first quartile. - Find the median of the data falling above Q2.
This is the third quartile.
20Find the quartiles
- 12 15 16 16 17 18 22 22
- 23 24 25 30 32 33 33 34
- 41 45 51
The data has been ordered. The median is 24.
21Find the quartiles
12 15 16 16 17 18 22 22 23 24 25 30
32 33 33 34 41 45 51
The data has been ordered. The median is 24.
22Find the quartiles
12 15 16 16 17 18 22 22 23 24 25 30
32 33 33 34 41 45 51
For the data below the median, the median is
17. 17 is the first quartile.
23Find the quartiles
12 15 16 16 17 18 22 22 23 24 25 30
32 33 33 34 41 45 51
For the data above the median, the median is
33. 33 is the third quartile.
24Find the interquartile range
- 12 15 16 16 17 18 22 22
- 23 24 25 30 32 33 33 34
- 41 45 51
IQR Q3 Q1 33 17 16
25Five-Number Summary of Data
- Lowest value
- First quartile
- Median
- Third quartile
- Highest value
26Box-and-Whisker Plots
- a graphical presentation of the five-number
summary of data
27Making a Box-and-Whisker Plot
- Draw a vertical scale including the lowest and
highest values. - To the right of the scale, draw a box from Q1 to
Q3. - Draw a solid line through the box at the median.
- Draw lines (whiskers) from Q1 to the lowest and
from Q3 to the highest values. -
28Construct a Box-and-Whisker Plot
12 15 16 16 17 18 22 22 23 24 25 30
32 33 33 34 41 45 51
Lowest 12 Q1 17 median 24 Q3 33 Highest
51
29Box-and-Whisker Plot
Lowest 12 Q1 17 median 24 Q3 33 Highest
51
30Resistant Measure
- a measure that is not influenced by extremely
high or low data values
31Which is less resistant?
- The mean is less resistant. It can be made
arbitrarily large by increasing the size of one
value.
32Measures of Variation
- Range
- Standard Deviation
- Variance
- Mean Absolute Deviation (MAD) in GPS
33The Range
- the difference between the largest and smallest
values of a distribution
34Find the range
- 10, 13, 17, 17, 18
- The range largest minus smallest
- 18 minus 10 8
35The standard deviation
- a measure of the average variation of the data
entries from the mean
36Standard deviation of a sample
mean of the sample
n sample size
37To calculate standard deviation of a sample
- Calculate the mean of the sample.
- Find the difference between each entry (x) and
the mean. These differences will add up to zero. - Square the deviations from the mean.
- Sum the squares of the deviations from the
mean. - Divide the sum by (n ? 1) to get the variance.
- Take the square root of the variance to get the
standard deviation.
38The Variance
- the square of the standard deviation
39Variance of a Sample
40Find the standard deviation and variance
x 30 26 22
4 0 ?4
16 0 16 ___
Sum 0
78
32
mean 26
41The variance
32 ? 2 16
42The standard deviation
s
43Find the mean, the standard deviation and variance
x 4 5 5 7 4
?1 0 0 2 ?1
1 0 0 4 1
mean 5
25
6
44The mean, the standard deviation and variance
Mean 5
45Population Mean and Standard Deviation
46MEAN ABSOLUTE DEVIATIONMAD (Added to GPS)
47To Find Mean, Median, Quartiles, and Standard
Deviation On TI-83/84
Calculator 1) Place the data in L1 (Press STAT
EDIT to see L1, L2 etc) 2) press the STAT
button 3) Move cursor to Calc 4) Press enter 5)
Press L1 and press enter For Mean read
Standard Deviation of Sample read
Standard Deviation of Population read
Scroll down to read Min, Q1, Med, Q3, and
Max