Title: Descriptive statistics
1Descriptive statistics
- Describing data with numbers
- measures of location
2What to describe?
- What is the location or center of the data?
- How do the data vary?
3Measures of Location
4Mean
- Denoted by x-bar ( )
- Seriously affected by unusual values
(outliers). - Appropriate for measurement data and
equally-spaced ordinal data.
5Median
- Robust to outliers, that is, not affected much
by unusual values. - Appropriate for measurement data and ordinal data.
6Mode
- The value that occurs most frequently.
- One data set can have many modes.
- Appropriate for all types of data.
7The most appropriate measure of location depends
on
the shape of the datas distribution.
8Most appropriate measure of location
- Depends on whether or not data are symmetric or
skewed. - Depends on whether or not data have one
(unimodal) or more (multimodal) modes.
9Symmetric and Unimodal
10Symmetric and Unimodal
Descriptive Statistics Variable N Mean
Median TrMean StDev SE Mean GPA 92
3.0698 3.1200 3.0766 0.4851 0.0506 Variable
Minimum Maximum Q1 Q3 GPA
2.0200 3.9800 2.6725 3.4675
11Symmetric and Bimodal
12Symmetric and Bimodal
13Symmetric and Bimodal
Descriptive Statistics Variable N Mean
Median TrMean StDev SE Mean Height 64
68.614 69.000 68.634 3.802 0.475 Variable
Minimum Maximum Q1 Q3 Height 61.000
76.000 65.125 72.000
14Skewed Right
15Skewed Right
Descriptive Statistics Variable N Mean
Median TrMean StDev SE Mean q39 92
61.04 46.50 52.93 62.90 6.56 Variable
Minimum Maximum Q1 Q3 q39
0.00 400.00 21.50 83.00
16Choosing Appropriate Measure of Location
- If data are symmetric, the mean, median, and mode
will be approximately the same. - If data are multimodal, report the mean, median
and/or mode for each subgroup. - If data are skewed, report the median.