Lecture 27: Analysis - PowerPoint PPT Presentation

1 / 38
About This Presentation
Title:

Lecture 27: Analysis

Description:

Targetted experiments (with analyses) can help guide design. Analysis of results can be used to support proposed design during reviews ... – PowerPoint PPT presentation

Number of Views:33
Avg rating:3.0/5.0
Slides: 39
Provided by: DD26
Category:
Tags: analysis | lecture

less

Transcript and Presenter's Notes

Title: Lecture 27: Analysis


1
Lecture 27 Analysis
09/10/99
2
Goals
  • Introduction to Data Analysis
  • How to use a statistical Package
  • Review of basic statistical concepts and methods

3
Steps in Data Analysis
  • Enter the Data
  • Choose the Analysis
  • Run the Analysis
  • Interpret the results
  • Explore more analyses
  • Learn and Communicate

4
Why Use Statistics?
  • Because Eyeballing data can be misleading
  • Because Decisions require objective proof
  • We need a consistent method for establishing
    proof
  • Statistics, combined with good experimental
    design, helps researchers overcome their biases,
    assumptions and expectations
  • e.g., the Canals on Mars problem
  • Statistics help us overcome
  • Experimental Error
  • Confusion of correlation with causation
  • complexity of effects studied

5
Types of Statistics
  • Inferential statistics
  • Concerned with making comparisons
  • e.g., A is bigger than B
  • e.g., There is a positive relationship (slope)
    between A and B
  • Inferential statistics emphasize tests of
    significance
  • Descriptive statistics
  • concerned with describing patterns and properties
    of data
  • e.g., there are three types of patient that
    differ on these dimensions
  • e.g., these three items in the personality test
    appear to be measuring the same underlying
    construct (e.g., extraversion)

6
Distributions
Yield Time order Method Yield 1 A 89.7 2 A
81.4 3 A 84.5 4 A 84.8 5 A 87.3 6 B 79.7 7 B
85.1 8 B 81.7 9 B 83.7 10 B 84.5
  • Data
  • Time Order X Yield
  • Method A vs. Method B
  • Does Method affect Result?

Method A Method B Difference 85.54 82.94 2..6
7
How the Distributions Look
  • A looks bigger
  • Lots of variation

8
Ways of Visualizing Distributions
  • Dot Diagrams
  • Histograms
  • Data Runs (previous slide)
  • Stem and Leaf Plots (Tukey)
  • and others

9
Ways of Describing Distributions
  • Measures of Location
  • Mean
  • Median
  • Mode
  • Measures of Variability
  • Standard Deviation (Variance)
  • Kurtosis (peakiness)

10
Randomization Test
  • Imagine that we had the 10 observations shown
    earlier and randomly shuffled them into two
    groups of five. What are the chances that the
    difference between the means of the two groups
    would be greater than or equal to the observed
    difference?
  • If we calculate this probability across all
    possible shuffles of the data, this is called a
    randomization test.

11
Significance Tests
  • Randomization test is a significance test
  • compute a statistic to test a hypothesis (e.g.,
    means are significantly different)
  • create reference distribution (for true
    hypothesis)
  • calculate probability of observed discrepancy
    occurring by chance
  • If probability is low enough, reject hypothesis
    and assert statistically significant difference

12
Normal and t-distributions
  • Normal distribution is bell-curve shaped
  • t-distribution reflects difference between 2
    Normal distributions (approximates Normal as
    number of observations increases)
  • Reasons why normal distribution is important
  • Central Limit Theorem
  • Robustness of statistical procedures to
    departures from normality

13
Characterizing Normal Distributions
  • mean
  • variance
  • probabilities under the distribution
  • z-values
  • z-values measure position in standard deviation
    units.
  • t-values are sample approximations of z values
  • probability that data point will differ by more
    than two standard deviations from the centre of a
    normal distribution is roughly 5

14
t-tests
  • See standard texts for calculation of t-statistic
  • divide difference in means by measure of
    variability
  • refer result to tables of t values
  • select entry based on degrees of freedom and
    check out assigned probability of difference
  • Paired vs. Independent samples t-test

15
Entering Data in SPSS
16
Putting in Numbers
17
Setting the Variable Type
18
Number with two decimal places
19
Data Matrix
20
Choosing an Analysis Method
21
Analysis Options
22
Sample Output
23
Choosing a Variable
24
Setting Options
25
Chart Output
26
Inferential Statistics and Analysis of Variance
  • Why use statistics?
  • Randomization and Randomization tests
  • Tests of Significance
  • Confidence Intervals
  • Accounting for Error Variation
  • Analysis of Variance

27
Sample Data for Paired (dependent) t-test
Mnths_6 Mnths_24 124 114 94 88 115 102 110 2 116 2
139 2 116 2 110 2 129 2 120 2 105 2 88 2 120 2 12
0 2 116 2 105 2 ... ... ... ... 123 132
28
T-test Results
29
Anova Example Data Matrix
30
Deviations from Grand Mean
31
Total Sums of Squares
32
One-way ANOVA
33
Two-way Data Matrix
34
Table of Means
35
Two-way Anova Summary
36
Application to Design
  • Targetted experiments (with analyses) can help
    guide design
  • Analysis of results can be used to support
    proposed design during reviews
  • Efficient experiments dont have to be expensive

37
Cautions
  • Formal experiments are often overkill
  • Many quick iterations are probably more effective
    than a few thorough iterations
  • Design sufficient power in the experiment

38
Summary
  • Design often requires observation
  • Methods of observation need to be carefully
    controlled (including sampling)
  • Formal experiments provide control
  • Careful analysis helps interpretation/insight
  • Descriptive Statistics
  • Inferential Statistics
  • T-tests and ANOVAs are typically used to analyze
    experiments
Write a Comment
User Comments (0)
About PowerShow.com