Title: Probability Sampling
1Probability Sampling
- uses random selection
- N number of cases in sampling frame
- n number of cases in the sample
- NCn number of combinations of n from N
- f n/N sampling fraction
2Variations
- Simple random sampling
- based on random number generation
- Stratified random sampling
- divide pop into homogenous subgroups, then simple
random sample w/in - Systematic random sampling
- select every kth individual (k N/n)
- Cluster (area) random sampling
- randomly select clusters, sample all units w/in
cluster - Multistage sampling
- combination of methods
3Non-probability sampling
- accidental, haphazard, convenience sampling ...
- may or may not represent the population well
4Measurement
- ... topics in measurement that we dont have
time to cover ...
5Research Design
- Elements
- Samples/Groups
- Measures
- Treatments/Programs
- Methods of Assignment
- Time
6Internal validity
- the approximate truth about inferences regarding
cause-effect (causal) relationships - can observed changes be attributed to the
program or intervention and NOT to other possible
causes (alternative explanations)?
7Establishing a Cause-Effect Relationship
- Temporal precedence
- Covariation of cause and effect
- if x then y if not x then not y
- if more x then more y if less x then less y
- No plausible alternative explanations
8Single Group Example
- Single group designs
- Administer treatment -gt measure outcome
- X -gt O
- assumes baseline of 0
- Measure baseline -gt treat -gt measure outcome
- 0 X -gt O
- measures change over baseline
9Single Group Threats
- History threat
- a historical event occurs to cause the outcome
- Maturation threat
- maturation of individual causes the outcome
- Testing threat
- act of taking the pretest affects the outcome
- Instrumentation threat
- difference in test from pretest to posttest
affects the outcome - Mortality threat
- do drop-outs occur differentially or randomly
across the sample? - Regression threat
- statistical phenomenon, nonrandom sample from
population and two imperfectly correlated measures
10Addressing these threats
- control group treatment group
- both control and treatment groups would
experience same history and maturation threats,
have same testing and instrumentation issues,
similar rates of mortality and regression to the
mean
11Multiple-group design
- at least two groups
- typically
- before-after measurement
- treatment group control group
- treatment A group treatment B group
12Multiple-Group Threats
- internal validity issue
- degree to which groups are comparable before the
study - selection bias or selection threat
13Multiple-Group Threats
- Selection-History Threat
- an event occurs between pretest and posttest that
groups experience differently - Selection-Maturation Threat
- results from differential rates of normal growth
between pretest and posttest for the groups - Selection-Testing Threat
- effect of taking pretest differentially affects
posttest outcome of groups - Selection-Instrumentation Threat
- test changes differently for the two groups
- Selection-Mortality Threat
- differential nonrandom dropout between pretest
and posttest - Selection-Regression Threat
- different rates of regression to the mean in the
two groups (if one is more extreme on the pretest
than the other)
14Social Interaction Threats
- Problem
- social pressures in research context can lead to
posttest differences that are not directly caused
by the treatment - Solution
- isolate the groups
- Problem in many research contexts, hard to
randomly assign and then isolate
15Types of Social Interaction Threats
- Diffusion or Imitation of Treatment
- control group learns about/imitates experience of
treatment group, decreasing difference in
measured effect - Compensatory Rivalry
- control group tries to compete w/treatment group,
works harder, decreasing difference in measured
effect - Resentful Demoralization
- control group discouraged or angry, exaggerates
measured effect - Compensatory Equalization of Treatment
- control group compensated in other ways,
decreasing measured effect
16Intro to Design/ Design Notation
- Observations or Measures
- Treatments or Programs
- Groups
- Assignment to Group
- Time
17Observations/Measure
- Notation O
- Examples
- Body weight
- Time to complete
- Number of correct response
- Multiple measures O1, O2,
18Treatments or Programs
- Notation X
- Use of medication
- Use of visualization
- Use of audio feedback
- Etc.
- Sometimes see X, X-
19Groups
- Each group is assigned a line in the design
notation
20Assignment to Group
- R random
- N non-equivalent groups
- C assignment by cutoff
21Time
- Moves from left to right in diagram
22Types of experiments
- True experiment random assignment to groups
- Quasi experiment no random assignment, but has
a control group or multiple measures - Non-experiment no random assignment, no
control, no multiple measures
23Design Notation Example
R O1 X O1,2
R O1 O1,2
Pretest-posttest treatment versus comparison
group randomized experimental design
24Design Notation Example
N O X O
N O O
Pretest-posttest Non-Equivalent
Groups Quasi-experiment
25Design Notation Example
X O
Posttest Only Non-experiment
26Goals of design ..
- Goalto be able to show causality
- First step internal validity
- If x, then y
- AND
- If not X, then not Y
27Two-group Designs
- Two-group, posttest only, randomized experiment
R X O
R O
Compare by testing for differences between means
of groups, using t-test or one-way Analysis of
Variance(ANOVA) Note 2 groups, post-only
measure, two distributions each with mean and
variance, statistical (non-chance) difference
between groups
28To analyze
- What do we mean by a difference?
29Possible Outcomes
30Measuring Differences
31Three ways to estimate effect
- Independent t-test
- One-way Analysis of Variance (ANOVA)
- Regression Analysis (most general)
- equivalent
32Computing the t-value
33Computing the variance
34Regression Analysis
Solve overdetermined system of equations for ß0
and ß1, while minimizing sum of e-terms
35Regression Analysis
36ANOVA
- Compares differences within group to differences
between groups - For 2 populations, 1 treatment, same as t-test
- Statistic used is F value, same as square of
t-value from t-test
37Other Experimental Designs
- Signal enhancers
- Factorial designs
- Noise reducers
- Covariance designs
- Blocking designs
38Factorial Designs
39Factorial Design
- Factor major independent variable
- Setting, time_on_task
- Level subdivision of a factor
- Setting in_class, pull-out
- Time_on_task 1 hour, 4 hours
40Factorial Design
- Design notation as shown
- 2x2 factorial design (2 levels of one factor X 2
levels of second factor)
41Outcomes of Factorial Design Experiments
- Null case
- Main effect
- Interaction Effect
42The Null Case
43The Null Case
44Main Effect - Time
45Main Effect - Setting
46Main Effect - Both
47Interaction effects
48Interaction Effects
49Statistical Methods for Factorial Design
- Regression Analysis
- ANOVA
50ANOVA
- Analysis of variance tests hypotheses about
differences between two or more means - Could do pairwise comparison using t-tests, but
can lead to true hypothesis being rejected (Type
I error) (higher probability than with ANOVA)
51Between-subjects design
- Example
- Effect of intensity of background noise on
reading comprehension - Group 1 30 minutes reading, no background noise
- Group 2 30 minutes reading, moderate level of
noise - Group 3 30 minutes reading, loud background noise
52Experimental Design
- One factor (noise), three levels(a3)
- Null hypothesis ?1 ?2 ?3
Noise None Moderate High
R O O O
53Notation
- If all sample sizes same, use n, and total N a
n - Else N n1 n2 n3
54Assumptions
- Normal distributions
- Homogeneity of variance
- Variance is equal in each of the populations
- Random, independent sampling
- Still works well when assumptions not quite
true(robust to violations)
55ANOVA
- Compares two estimates of variance
- MSE Mean Square Error, variances within samples
- MSB Mean Square Between, variance of the sample
means - If null hypothesis
- is true, then MSE approx MSB, since both are
estimates of same quantity - Is false, the MSB sufficiently gt MSE
56MSE
57MSB
- Use sample means to calculate sampling
distribution of the mean, - 1
58MSB
- Sampling distribution of the mean n
- In example, MSB (n)(sampling dist) (4) (1) 4
59Is it significant?
- Depends on ratio of MSB to MSE
- F MSB/MSE
- Probability value computed based on F value, F
value has sampling distribution based on degrees
of freedom numerator (a-1) and degrees of freedom
denominator (N-a) - Lookup up F-value in table, find p value
- For one degree of freedom, F t2
60Factorial Between-Subjects ANOVA, Two factors
- Three significance tests
- Main factor 1
- Main factor 2
- interaction
61Example Experiment
- Two factors (dosage, task)
- 3 levels of dosage (0, 100, 200 mg)
- 2 levels of task (simple, complex)
- 2x3 factorial design, 8 subjects/group
62Summary table
- SOURCE df Sum of Squares Mean Square
F p - Task 1 47125.3333
47125.3333 384.174 0.000 - Dosage 2 42.6667
21.3333 0.174 0.841 - TD 2 1418.6667
709.3333 5.783 0.006 - ERROR 42 5152.0000
122.6667 - TOTAL 47 53738.6667
- Sources of variation
- Task
- Dosage
- Interaction
- Error
63Results
- Sum of squares (as before)
- Mean Squares (sum of squares) / degrees of
freedom - F ratios mean square effect / mean square error
- P value Given F value and degrees of freedom,
look up p value
64Results - example
- Mean time to complete task was higher for complex
task than for simple - Effect of dosage not significant
- Interaction exists between dosage and task
increase in dosage decreases performance on
complex while increasing performance on simple
65Results