Title: How good are our measurements?
1How good are our measurements?
- The last three lectures were concerned with some
basics of psychological measurement What does it
mean to quantify a psychological variable? How do
we operationally define both observable and
latent variables? - The next important issue concerns the quality of
our measurements - How can we help make our measurements precise?
- How can we determine whether were measuring what
we think were measuring?
2Reliability
- Reliability the extent to which measurements are
free of random errors - Random error nonsystematic mistakes in
measurement - misreading a questionnaire item
- observer looks away when coding behavior
- nonsystematic misinterpretations of a behavior
3Reliability
- What are the implications of random measurement
errors for the quality of our measurements?
4Reliability
- O T E S
- O a measured score (e.g., performance on an
exam) - T true score (e.g., the value we want)
- E random error
- S systematic error
- O T E
- (well ignore S for now, but well return to it
later)
5Reliability
- O T E
- The error becomes a part of what were measuring
- This is a problem if were operationally defining
our variables using equivalence definitions
because part of our measurement is based on the
true value that we want and part is based on
error. - Once weve taken a measurement, we have an
equation with two unknowns. We cant separate
the relative contribution of T and E. - 10 T E
6Reliability Do random errors accumulate?
- Question If we sum or average multiple
observations, will random errors accumulate?
7Reliability Do random errors accumulate?
- Answer No. If E is truly random, we are just as
likely to overestimate T as we are to
underestimate T. - Height example
852 53 54 55 56 57 58 59 510 511 6 61 62 63 64 65 66 67 68 89
9Reliability Do random errors accumulate?
Note The average of the seven Os is equal to T
10Reliability Implications
- These demonstrations suggest that one important
way to help eliminate the influence of random
errors of measurement is to use multiple
measurements. - operationally define latent variables via
multiple indicators - use more than one observer when quantifying
behaviors
11Reliability Estimating reliability
- Question How can we estimate the reliability of
our measurements? - Answer Two common ways
- (a) test-retest reliability
- (b) internal consistency reliability
12Reliability Estimating reliability
- Test-retest reliability Reliability assessed by
measuring something at least twice at different
time points. - The logic is as follows If the errors of
measurement are truly random, then the same
errors are unlikely to be made more than once.
Thus, to the degree that two measurements of the
same thing agree, it is unlikely that those
measurements contain random error.
13(No Transcript)
14Reliability Estimating reliability
- Internal consistency Reliability assessed by
measuring something at least twice within the
same broad slice of time. - Split-half based on an arbitrary split (e.g,
comparing odd and even, first half and second
half) - Cronbachs alpha (?) based on the average of all
possible split-halves
15Less error
More error
Item A
4
3
Item B
5
5
Item C
6
7
Item D
5
5
Item E
4
3
Item F
5
5
Items A, B, C yield an average score of
(357)/3 5.
Items A, B, C yield an average score of
(456)/3 5.
Items D, E, F yield an average scores of (5, 3,
5)/3 4.3.
Items D, E, F yield an average scores of (5, 4,
5)/3 4.6.
These two estimates are off by only .4 of a point.
These two estimates are off by .7 of a point.
16Reliability Final notes
- An important implication As you increase the
number of indicators, the amount of random error
in the averaged measurement decreases. - An important assumption The entity being
measured is not changing. - An important note Common indices of reliability
range from 0 to 1 higher numbers indicate better
reliability (i.e., less random error).