Title: Statistics Chapter
1Statistics Chapter
- Measures of Central Tendency
2Introduction
- Statistics
- Helps us better understand the data we work with.
- From www.bls.gov
- Earnings
- Median hourly earnings of welders, cutters,
solderers, and brazers were 14.72 in May 2004.
The middle 50 percent earned between 11.90 and
18.05. The lowest 10 percent had earnings of
less than 9.79, while the top 10 percent earned
over 22.20. - Median hourly earnings of machinists were 16.33
in May 2004. The middle 50 percent earned between
12.84 and 20.33. The lowest 10 percent earned
less than 10.08, while the top 10 percent earned
more than 24.34.
3Introduction
- Statistics
- Helps us better understand the data we work with.
Thermal eye watches weld quality    A steel
maker and a research centre have developed an
automated system for monitoring the quality of
welds in industrial seam-welding. An optical
pyrometer senses the temperature of the weld as
it emerges from the machine. The weld is
acceptable if the average and standard deviation
of the temperatures along its length both fall
within set values. Studies show excellent
correlation with traditional inspection
techniques. The system can also detect any
gradual deterioration in the weld quality,
indicating the need for preventative maintenance
of the welding machine.
4Introduction
- Statistics
- Helps us better understand the data we work with.
- What single number best represents this data?
- How spread out is the data? (How similar are
the s similar in size?) - What is the likelihood of a particular result to
happen based on this data?
5Central Tendency
- What 1 single number best represents a collection
of numbers (data)?
6Central Tendency Analogy
7Central Tendency
Machinists Pay
?
?
?
?
?
?
8Central Tendency
Machinists Pay
(May 2005) Mean annual earnings 35,350
(May 2005) Median Hourly Salary 16.51
Machinists Pay Data
9Central Tendency
10Mean
- Mean is the average of a set of numbers.
11Mean Sample Problem 1
- Average 1-way distance driven from home to SWTC
mean 25.0
12Mean Sample Problem 2
mean 883 bolts/day
13Statistics
14Median
15Median
mean 137,929
mean 83.8
16Median
Median 77.5
88
72
71
92
60
83
17Median
88
72
71
92
60
83
75
18Sample Problem
- Number of defects per shift (2 weeks of data)
Mean ? Median ?
4, 4, 3, 12, 8, 5, 5, 4, 4, 3
Mean 5.2 Median 4
19Statistics
20Mode
- Most common
- Most frequently occurring
- Most popular
Soft Drinks
0.75
21Mode
19 yrs
22Mode
19 yrs
22 yrs
23Homework
- Practice Set 4
- Pages 11 -12 1-3
24Statistics
25Introduction
Ave depth 4.2 ft
26Introduction
Introduction
Ave depth 4.2 ft
27(No Transcript)
28Data Set 1 1, 3, 5, 6, 6, 8, 8, 10, 10 mean
6.33
Data Set 2 5, 5, 5, 6, 7, 7, 7, 7, 8 Mean 6.33
29Range
- Range Highest - Lowest
- A small range means
- A large range means
30Range
Data 1, 3, 5, 6, 6, 8, 8, 10, 10
31Range
Data 5, 5, 5, 6, 7, 7, 7, 7, 8
32Data Set 1 1, 3, 5, 6, 6, 8, 8, 10, 10 mean
6.33 range 9
Data Set 2 5, 5, 5, 6, 7, 7, 7, 7, 8 mean
6.33 range 3
33Range Our Class
range 32
34Range Our Class
range 13.5 hrs/wk
35Statistics
- Measures of Variability Standard Deviation
36Standard Deviation
- Standard deviation is another way to measure
variation. - Standard Deviation tells you the average
distance a piece of data is from that groups
mean.
375.1 2 3.1
- Hours per week studying
- Mean 5.1 hrs/wk
5.1 4 1.1
7 5.1 1.9
15 5.1 9.9
etc
Standard Deviation 3.5 hrs
2
0
4
6
8
10
12
14
16
38Compute Standard Deviation
- Calculator
- Computer
- Excel
Sx
stdev(A1A7)
8.55
39Review
- Measures of variability tell you
- how similar a collection of numbers are to one
another - how different they are from one another.
Range 7 Standard Deviation 2.5
89, 90, 92, 95, 88, 90
Range 11.7 Standard Deviation 31
90, 71, 80, 93, 62, 84
40Comparison
Pro easy to compute Con only looks at the
high and low values nothing else. Can be
affected by flyers.
41Comparison
Pro gives a more accurate picture of
variability since it involves all of the data in
the computation. Con takes longer to
compute
42In-class Problems
- Question 1 Which group of data will have a
large standard deviation? Which will have a
small standard deviation? - Question 2 For which group of data will the
computed range be deceptive?
Set 1 7.85, 8.21, 7.15, 6.82, 5.95, 8.05, 8.72
Set 2 7.15, 12.98, 13.25, 12.01, 12.37, 11.95
43Questions (1 of 3)
- Traffic speeds of all vehicles moving along a
stretch of freeway are monitored for several
days. The standard deviation was found to be
relatively low, that means - A) the vehicles are moving at a relatively
uniform speed
44Questions (1 of 3)
- Traffic speeds of all vehicles moving along a
stretch of freeway are monitored for several
days. The standard deviation was found to be
relatively low, that means - A) the vehicles are moving at a relatively
uniform speed - B) the vehicles are moving at very dissimilar
speedssome are moving quite slow and others are
moving quite fast.
45Questions (2 of 3)
- Which machining shop has a wider range of ages in
its force? - A) Acme Metals
- Standard Deviation 5 yrs
- B) Emerald City Machining
- Standard Deviation 7.8 years
46Questions (3 of 3)
- Which measure range or standard deviation,
would give the most realistic indication of
variability for this data set?
Number of off-site welding jobs performed per
week.
11, 15, 12, 14, 18, 12, 12, 13, 1, 2, 10, 15, 13,
12, 16
47Homework
- Practice Set 6
- Pages 22 - 23 1-4
48Statistics
49Normal Distributions
- Data gets collected about a certain topic
- IQ Scores
- Heights
- Number of Arrests
- Age
- Size of Manufactured Parts
50Normal Distributions
- As you record the data, you note that numbers
tend to pile up in certain areas.
?
?
?
?
?
red blue yellow green orange
51Normal Distributions
- As you record the data, you note that numbers
tend to pile up in certain areas.
?
?
?
?
?
?
?
?
?
?
?
?
?
?
?
?
?
?
?
?
red blue yellow green orange
52Normal Distributions
red blue yellow green orange
53Normal Distribution
X X X X X
X X X X
X X X X
X X
X X
X
X
100
115
85
130
70
55
145
54IQ Info
- Genius IQ is generally considered to begin around
140 to 145, representing 0.25 of the population
(1 in 400). Here's a rough guide - 115-124 - Above average (e.g., university
students) - 125-134 - Gifted (e.g., post-graduate students)
- 135-144 - Highly gifted (e.g., intellectuals)
- 145-154 - Genius (e.g., professors)
- 155-164 - Genius (e.g., Nobel Prize winners)
- 165-179 - High genius
- 180-200 - Highest genius
- gt200 - "Unmeasurable genius"
55Normal Distribution
- Heights of U.S Women Age 18-25
- mean 65.5 inches
- standard deviation 2.5 inches
65.5
68
63
70.5
60.5
58
73
56Standard Deviation Characteristics
Weight of Candy Bars Produced
IQ Scores
What can be learned from the different curve
shapes?
Traffic Speeds
Age of Auto Thieves
57Standard Deviation Characteristics
15
15
100
115
85
58Standard Deviation Characteristics
15 15
15 15
100
115
85
130
70
59Standard Deviation Characteristics
15 15 15
15 15 15
100
115
85
145
55
70
130
60Normal Distributions
- For any data that is normally distributed
- 68 of the data is located within 1 standard
deviation from the mean.
?
?
61Normal Distributions
- For any data that is normally distributed
- 95 of the data is located within 2 standard
deviations from the mean.
? ?
? ?
62Normal Distributions
- For any data that is normally distributed
- 99.7 of the data is located within 3 standard
deviations from the mean.
? ? ?
? ? ?
63Practice
- Daily attendance at the annual county fair for
the past ten years.
What is the mean for this data?
What is the standard deviation for this data?
4500
4750
4250
5000
4000
3750
5250
64Homework
- Practice Set 7
- Page 28 1 - 5
65Statistics
- Normal Distribution Applications
66Using Normally Distributed Data
- Data on the Heights of U.S. Women
65.5
68
63
70.5
60.5
58
73
67Applications
68Applications
69Using Normally Distributed Data
Service Life (Months) of SureStart Car Batteries
- How long does the average SureStart Battery last?
- Would it be better for this company to set a 44
month warranty or a 56 month warranty?
52
54
48
56
46
44
58
70Applications
Warranties
71Sample Problems
- Data Lifespan of 60w Lightbulbs
As a manufacturer, what would be the safest claim
to make about the bulbs? a) Our bulbs are
guaranteed to last 2000 hrs. b) Our bulbs are
guaranteed to last 2300 hrs. c) Our bulbs are
guaranteed to last 1500 hours.
2000
1850
1600
2450
2150
1550
2300
72Sample Problems
- Establish ad campaigns to target products.
a) What is the average age of your customers?
b) What percent of your customers are between
the ages of 15 and 23?
c) What percent of your customers are over 25
years old?
19
21
17
23
15
13
25
73Sample Problems
- Establish ad campaigns to target products.
d) What percent of your customers are 15 years
old or less?
e) If you sold to 15,000 customers last year,
how many were between the ages of 17 and 21?
19
21
17
23
15
13
25
74Sample Problems
a) In a classroom of 20 people, how many will
have IQs between 85 and 115?
100
115
85
130
70
55
145
75Sample Problems
b) At SWTC, assuming around 1400 full time
students, how many will have an IQ below 85?
How many below 70?
100
115
85
130
70
55
145
76Sample Problems
c) In Platteville, assuming a population of
about 10,000 people, how many will have an IQ
above 130?
100
115
85
130
70
55
145
77Homework
- Practice Set 8
- Pages 34-35 1-3
78Statistics
79Quality Control
- Helps to understand what is happening with the
products that are being produced.
80Statistics
81Process Capability
This machines job is to fill 50 gallon
drums. What kind of results (gallons/barrel)
should I expect from this process?
If I know what is typical for this process I can
more easily spot when problems show up.
Drum-filling
FillingMachine
82Production
Goal Describe what is happening with the
drum-filling process. The intent is to fill each
drum with 50 gallons of fluid.
Drum-filling
FillingMachine
Mean 50.1 gallons St. Dev. 0.05 gallons
83Data from drum filling
Mean 50.1 gallons St. Dev. 0.05 gallons
99.7
50.1
50.15
50.05
50.2
50.0
49.95
50.25
Upper Control Limit 50.25 gal
UCL 50.25 gal
Lower Control Limit 49.95 gal
LCL 49.95 gal
84Once the UCL and LCL are computed, I now know the
capabilities of this process.
Drum-filling
FillingMachine
85Statistics
86Tolerance
- Tolerance information tells you how to judge
quality.
- Weight
- Length
- Volume
- Color
87Tolerance
Tolerance for Volume
50.0 gal ? 0.2 gal
Minimum Volume 50 0.2 49.8 gal Maximum
Volume 50 0.2 50.2 gal
Drum-filling
50.18 gal
49.75 gal
NO
OK
88Capability vs Tolerance
- Tolerance
- Minimum
- Maximum
89Capability vs Tolerance
- Tolerance
- Minimum
- Maximum
90Capability vs Tolerance
- Capability
- UCL 50.25 gal
- LCL 49.95 gal
- Tolerance
- Minimum 49.8 gal
- Maximum 50.2 gal
91Sample Problem
Tolerance for diameter 2.5000 ? 0.0015
Process mean diameter of 2.5003 standard
deviation of 0.0006
92Control Limits and Tolerance
- Control limits for this process
- mean 2.5003
- st. dev. 0.0006
- UCL __________
- LCL __________
- Tolerance limits
- 2.5000 ? 0.0015
- max dia __________
- min dia __________
93Capability vs Tolerance
- Capability
- UCL 2.5021
- LCL 2.4985
- Tolerance
- Minimum 2.4985
- Maximum 2.5015
94Control Charts
- Show what is happening with a process throughout
the day
---- g
240 g
242 g
239 g
240 g
238 g
95Control Chart
UCL
12.5 fl oz
12.0 fl oz
11.5 fl oz
LCL
7 am 12.4 12.5 12.3 12.3 12.5 62 12.4 0.2
8 am 12.5 12.4 12.5 12.3 12.3 62 12.4 0.2
9am etc etc etc etc etc etc 12.6 0.1
10am etc etc etc etc etc etc 12.4 0.4
11am etc etc etc etc etc etc 12.2 0.2
12 pm etc etc etc etc etc etc 12.4 0.5
1 pm etc etc etc etc etc etc 11.8 0.1
2 pm etc etc etc etc etc etc 12.0 0.3
3 pm
Measurements
96Control Chart
UCL
0.5
LCL
0
7 am 12.4 12.5 12.3 12.3 12.5 62 12.4 0.2
8 am 12.5 12.4 12.5 12.3 12.3 62 12.4 0.2
9am etc etc etc etc etc etc 12.6 0.1
10am etc etc etc etc etc etc 12.4 0.4
11am etc etc etc etc etc etc 12.2 0.2
12 pm etc etc etc etc etc etc 12.4 0.5
1 pm etc etc etc etc etc etc 11.8 0.1
2 pm etc etc etc etc etc etc 12.0 0.3
3 pm
Measurements
97Trends
UCL
0.5
LCL
0
98Statistics
99Introduction
- Correlation is there a relationship between two
sets of data? - Engine Size versus Fuel Economy
- Temperature versus Crime Rate
- Average temperature versus Home Heating Costs
100Procedure - Step 1
- Step 1 Create a scatter graph
101Procedure Step 2
Engine Size 4.0 L Mileage 20.5 mpg
102Procedure Step 2
Engine Size 2.0 L Mileage 23.5 mpg
103Procedure Step 3
104Procedure Step 3
I
II
IV
III
105Procedure Step 4
- Quadrant I _____
- Quadrant II _____
- Quadrant III _____
- Quadrant IV _____
I
II
IV
III
106Procedure Step 5
- Quadrant I Quadrant III _____
- Quadrant II Quadrant IV _____
I
II
- Positive Correlation
- Negative Correlation
- No Correlation
IV
III
107Procedure Step 6
?
?
?
Crime Rate
?
?
?
?
Temperature
108Procedure Step 6
?
?
Mileage
?
?
?
?
?
Engine Size
109Procedure Step 6
?
?
?
?
?
?
?
110Procedure Step 6
- Quadrant I Quadrant III __2___
- Quadrant II Quadrant IV __12___
I
II
- Positive Correlation
- Negative Correlation
- No Correlation
IV
III
111Correlation Criteria
- Correlation Rules (p. 55)
- Positive
- Sum of Quadrants I and III more than twice the
sum of Quadrants II and IV. - Negative
- Sum of Quadrants II and IV more than twice the
sum of Quadrants I and III. - No Correlation
- When neither of the above occur.
112Homework
- Practice Set 11
- Pages 59 60, 1 and 2