What is Statistics - PowerPoint PPT Presentation

1 / 30
About This Presentation
Title:

What is Statistics

Description:

Statistics is a collection of procedures and principles for gathering data and ... T-Shirt Size (S, M, L, XL. XXL) Categorical variable w/order. Ordinal ... – PowerPoint PPT presentation

Number of Views:29
Avg rating:5.0/5.0
Slides: 31
Provided by: linda430
Category:

less

Transcript and Presenter's Notes

Title: What is Statistics


1
  • What is Statistics?

2
  • Statistics is a collection of procedures and
    principles for gathering data and analyzing
    information in order to help people make
    decisions when faced with uncertainty.

3
7 Statistical Stories w/Morals
  • Speedy Drivers
  • Disaster in the Skies
  • Dating
  • Angry Women
  • Prayer Pressure
  • Heart Attacks
  • Internet Loneliness

4
Data are used to make a judgment about a
situation
  • What question needs to be answered?
  • How should we collect data how much?
  • How can we summarize the data?
  • What decisions or generalizations can be made in
    regards to the question based on the data
    collected?

5
Population Data vs. Sample Data
  • Everyoneeverything
  • Parameterssummary measurements denoted by greek
    letters (?, ?)
  • Representative smaller subset of population
  • Statisticssummary measurements denoted by
    standard letters (p, )

6
Data--Types of Variables
7
Summarizing Dataw/ Bar graph of Categorical Data
8
HistogramUnivariate quantitative data
9
Comparative BoxplotsUnivariate quantitative data
by category
Outliers
10
ScatterplotBivariate quantitative data
11
Summary Features of Quantitative Variables
LocationCenter(average) SpreadVariability Shape
Distribution pattern with data
12
LocationCenter
Mean(?, ) add up all data values and divide by
number of data values Medianlist data values in
order, locate middle data value
Data Set 19, 20, 20, 21, 22
Mean is
Median is 20 since it is the middle number of the
ranked data values.
13
Describing Spread Range, Quartiles, and
Interquartile Range
  • Range Maximum minimum
  • Q1 (Quartile 1) is the 25th percentile of ordered
    data or median of lower half of ordered data
  • Median (Q2) is 50th percentile of ordered data
  • Q3 (Quartile 3) is the 75th percentile of ordered
    data or median of upper half of ordered data
  • IQR(Interquartile Range) Q3 Q1
  • Any point that falls outside the interval
    calculated by Q1- 1.5(IQR) and Q3 1.5(IQR) is
    considered an outlier.

14
Boxplot5 Number Summary
Computersx1000 250 1000 2400 3500 5400 8600
1000
2950
5400
8600
250
Q3
Max
min
Q1
median
15
Describing Spread Standard Deviation
Roughly speaking, standard deviation is the
average distance values fall from the mean.
16
Population and SampleStandard Deviation
  • ?2 ?population variance
  • s2 ?sample variance

What is Variance?
17
Calculated Standard Deviations
18
Various Distributions
19
Empirical Rule
34
34
68
47.5
47.5
95
49.85
49.85
99.7
20
Empirical Rule
68
95
99.7
68-95-99.7 RULE
21
Empirical Rule-- Let HN(69, 2.52)
What is the likelihood that a randomly selected
adult male would have a height less than 69
inches?
P(h lt 69) .50
22
Empirical Rulerestated
  • 68 of the values fall within 1 standard
    deviation of the mean in either direction
  • 95 of the values fall within 2 standard
    deviation of the mean in either direction
  • 99.7 of the values fall within 3 standard
    deviation of the mean in either direction
  • Note If the range ? 6 doesnt roughly equal the
    standard deviation, the data may contain outliers
    or have a skewed shape.

23
Using the Empirical Rule
Let HN(69, 2.52)
What is the likelihood that a randomly selected
adult male will have a height between 64 and 74
inches?
P(64 lt h lt 74) .95
24
Using Empirical Rule-- Let HN(69, 2.52)
What is the likelihood that a randomly selected
adult male would have a height of less than 66.5
inches?
P(h lt 66.5) 1 (.50 .34) .50 - .34
.16
25
Using Empirical Rule-- Let HN(69, 2.52)
What is the likelihood that a randomly selected
adult male would have a height of greater than 74
inches?
P(h gt 74) 1 - (.50 .475) 1 -
.975 .025
26
Using Empirical Rule-- Let HN(69, 2.52)
What is the probability that a randomly selected
adult male would have a height between 64 and
76.5 inches?
P(64 lt h lt 76.5) .475 .4985
.9735
27
Standardized Z-Score
  • To get a Z-score, you need to have 3 things
  • Observed value of X
  • Population mean, ?
  • Population standard deviation, ?
  • Then follow the formula.

28
Empirical Rule Z-Score
  • About 68 of values in a normally distributed
    data set have z-scores between 1 and 1 95 of
    the values have z-scores between 2 and 2 and
    99.7 of the values have z-scores between 3 and
    3.

29
Z-Score Let HN(69, 2.52)
  • What would be the standardized score for an adult
    male who stood 71.5 inches?

HN(69, 2.52) ZN(0, 12)
30
Z-Score Let HN(69, 2.52)
What would be the standardized score for an adult
male who stood 65.25 inches?
h 65.25 as z -1.5
Write a Comment
User Comments (0)
About PowerShow.com