Where weve been - PowerPoint PPT Presentation

About This Presentation
Title:

Where weve been

Description:

FABRIC TYPE' VARIABILITY 'SUM OF SQUARES' IS WHAT GOES INTO ... Sqrt(MSE) describes standard deviation of burn time within each fabric type. ... – PowerPoint PPT presentation

Number of Views:52
Avg rating:3.0/5.0
Slides: 16
Provided by: johnstau
Category:
Tags: fabric | weve

less

Transcript and Presenter's Notes

Title: Where weve been


1
Where weve been where were going
  • We can use data to address following questions
  • 1. Question Is a mean some number? Large
    sample z-test and CI Small sample t-test and
    CI
  • 2. Question Is a proportion some
    ? Proportion version of large sample
    z-test and CI

2
Where weve been where were going
  • 3. Question Is a diff between two means some
  • Independent samples
  • large sample z test and CI
  • small sample t test and CI
  • paired samples
  • small sample paired t test and CI
  • 4. Question Is diff between 2 proportions some
  • Proportion version of large sample z test
    and CI

3
Topics to be covered in remaining 8 classes
(including today)
  • Analysis of Variance and Linear Regression
    (Chapters 11, 12 and 13)
  • response b0 b1 covariate 1 bp
    covariate p error
  • Categorical Data / Contingency Tables when
    response is discrete

4
Back to Fabric DataTried to light 4 samples of
4 different (unoccupied!) pajama fabrics on
fire.
18
Higher meanslessflamable
Mean16.85std dev0.94
17
16
15
e
m
i
14
T

n
13
r
u
B
12
Mean10.95std dev1.237
Mean11.00std dev1.299
11
Mean10.50std dev1.137
10
9
4
3
2
1
Fabric
5
Suppose we want to test
  • H0 m1m2m3m4
  • HA at least one mean is not equal.
  • at level a 0.05.

Note that this is the probability ofmaking a
false claim (if they are all equal).
First idea for how to do this do four tests at
level a (m1m2, m2m3 etc) and reject H0 if at
least one is rejected.
6
  • Test 1
  • H0 m1m2
  • HA not equal
  • Level a0.05
  • Test 2
  • H0 m2m3
  • HA not equal
  • Level a0.05

Reject all means equal if at least one test
fails. This will give you a decision, but whats
the overall probability of making a false claim
(if all means are equal) (a level) for this
procedure? gt,lt, or equal to a?
Test 3 H0 m3m4 HA not equal Level a0.05
Test 4 H0 m4m1 HA not equal Level a0.05
7
  • Overall a
  • Pr(Falsely reject H0 m1m2m3m4)
  • Pr( at least one test falsely rejects)
  • 1-Pr(none falsely reject)
  • 1-Pr( test 1 doesnt and and test 4 doesnt)
  • 1-(0.954) 0.19(last step uses independence)

Point We thought we were doing a level 0.05
test, but its actually level 0.18! Thats a
problem!
Name for this problem multiple testing
problem. Whats one solution?
8
Solution 1
  • Do the 4 tests each at a level less than a
  • Many methods to do this Bonferroni and Tukey
    are some common ones.
  • We wont go into much mathematical detail, but
    these test are often conservative. (True a is
    smaller than the planned a and power is lower
    than planned.)
  • For instance, divide a by of tests you do
  • 1-(1-(a/4))4 1-(1-0.05/4)4 0.0491

9
Solution 2 Analysis of Variance!
  • Idea
  • Variability in the fabric data occurs at two
    levels within fabric type and across fabric
    types.
  • If across fabric type variability is large
    relative to variability within each fabric type,
    then the means are not equal.

10
18
17
16
15
e
m
i
14
T

n
13
r
u
B
12
11
10
9
4
3
2
1
Fabric
Vertical spread of data points within each oval
is one type of variability. Vertical spread of
the ovals is another type of variability.
11
To use the idea to test, we need a fact about
variances
  • Suppose s12 s22
  • If s12 is estimated by s12 from n1 data points
    and s22 is estimated with s22 from n2 data points
    (and the data are normal and independent), then
  • s22 / s12 Fn2-1,n1-1

Another distribution. The F distribution. n2-1
numerator dfn1-1 denominator df
12
Use the test to define large
  • H0 s12 s22
  • HA s22 gt s12
  • Level a test reject H0 at level a if
  • s22 / s12 gt F1-a,n2-1,n1-1

13
  • Test for fabric
  • Formally
  • At least one of the means is different if
  • Variance among fabric types is greater than the
    variance within fabric types
  • Variance among fabric types / Variance within
    fabric types gt F1-a,3-1,16-3
  • When one does the test, one uses software that
    produce
  • Analysis of variance or ANOVA tables.

14
Suppose there are k treatments and n data
points.ANOVA table
ESTIMATE OF AMONG FABRIC TYPE VARIABILITY
  • Source Sum of Meanof Variation df Squares Squa
    re F P
  • Treatment k-1 SST MSTSST/(k-1) MST/MSE
  • Error n-k SSE MSESSE/(n-k)
  • Total n-1 total SS

ESTIMATE OF WITHIN FABRIC TYPE VARIABILITY
P-VALUEFOR TEST. (REJECT IF LESS THAN a)
SUM OF SQUARES IS WHAT GOES INTO NUMERATOR OF
s2 (X1-X)2 (Xn-X)2
15
One-way ANOVA Burn Time versus Fabric Analysis
of Variance for Burn Time Source DF SS
MS F P Fabric 3
109.81 36.60 27.15 0.000 Error 12
16.18 1.35 Total 15
125.99 Explaining why ANOVA is an analysis of
variance MST 109.81 / 3 36.60 Sqrt(MST)
describes standard deviation among the
fabrics. MSE 16.18 / 12 1.35 Sqrt(MSE)
describes standard deviation of burn time within
each fabric type. (MSE is estimate of variance
of each burn time.) F MST / MSE 27.15 It
makes sense that this is large and p-value
Pr(F4-1,16-4 gt 27.15) 0 is small because the
variance among treatments is much larger than
variance within the units that get each
treatment. (Note that the F test assumes the
burn times are independent and normal with the
same variance.)
Write a Comment
User Comments (0)
About PowerShow.com