The Determinants of Student Achievement: Different Estimates for Different Measures presentation

About This Presentation

Transcript and Presenter's Notes

Title: The Determinants of Student Achievement: Different Estimates for Different Measures

1
The Determinants of Student AchievementDifferent
Estimates for Different Measures
Tim Sass Department of Economics Florida State
University
CALDER Conference October 4, 2007
2
Different Measures

Types of Tests
Criterion Reference Tests
Test whether student has learned elements in
state established instructional standards
State specific
Nationally Normed Tests
Tests whether student has learned a set of
concepts and skills that may or may not
correspond to any particular states curriculum
benchmarks
Allows interstate comparisons

3
Different Measures

Scaling
Non-Vertically Aligned Scale Scores
Scale potentially different at each grade level
Cant compare learning gains
Criterion reference tests are typically not
vertically aligned
Vertical or Developmental Scales
A single equal-interval scale that spans all
grade levels
A one-unit change means the same at all levels
within and between grades
Some norm-referenced exams are of this type
Stanford Achievement Test

4
Non-Vertically Aligned Scores
Grade 10
Trigonometry
Grade 9
Grade 8
Grade 7
Grade 6
Grade 5
Single-Digit Addition
Grade 4
Grade 3
5
Vertically Scaled Scores
Trigonometry
Grade 10
Grade 9
Grade 8
Grade 7
Grade 6
If done right, vertically scaled exam ideal for
analyzing learning gains since one-point change
has same meaning everywhere on the scale.
Grade 5
Simple Addition
Grade 4
Grade 3
6
Different Measures

Scale Scores Normalized by Grade and Year
Frequently used by researchers to compare a
students performance on criterion referenced
tests over time
Compares a students performance relative to the
performance of other students taking the same
grade-level exam in the same year
Unit of measure is the standard deviation
If performance distribution changes from grade to
grade, normalized scores may not be comparable
Also sometimes used to try to equate performance
on different exams when a state changes their
test midstream

7
Normalized Scores
0
Normalized score sets mean to zero and rescales
score
Grade 5
8
Different Results

Analysis of the Effectiveness of NBPTS Certified
Teachers
Harris and Sass, The Effects of NBPTS-Certified
Teachers on Student Achievement (February, 2007)
Compares the effectiveness of NBPTS-certified
teachers (NBCTs) with the effectiveness of
non-NBCTs in Florida
In many cases, results vary whether use scores
from Floridas criterion reference test, the
FCAT-Sunshine State Standards exam (FCAT-SSS), or
the Stanford Achievement Test, a norm-referenced
test (FCAT-NRT)

9
Value-Added Estimates of Reading Achievement
Selected Explanatory Variables FCAT-SSS Developmental Scale FCAT-NRT Developmental Scale FCAT-SSS Normalized by Grade Year FCAT-NRT Normalized by Grade Year
NBPTS Certified 0.0163 0.0020 0.0186 0.0011
First-Year Teacher -0.0403 -0.0219 -0.0324 -0.0266
1-2 Years of Teaching Experience -0.0071 -0.0106 -0.0075 -0.0120
3-4 Years of Teaching Experience -0.0123 -0.0129 -0.0112 -0.0134
5-9 Years of Teaching Experience -0.0075 -0.0109 -0.0098 -0.0116
Advanced Degree -0.0128 0.0007 -0.0101 -0.0001
Class Size -0.0028 -0.0017 -0.0026 -0.0017
Note all coefficients expressed in standard
deviation units omitted experience
category is teachers with 10 years of
experience coefficients in green are
statistically significant at the 95 confidence
level
10
Different Results

More variation in estimated effects across exams
than in different scalings of same exam
Estimated effects of variables representing small
proportions of teachers most variable
NBPTS Certification
Advanced Degrees
Why are there differences across exams?
Differences in material covered
Differential ceiling effects

11
Vertically Scaled Scores With Ceiling
Trigonometry
Grade 10
Grade 9
Grade 8
Grade 7
Grade 6
Grade 5
Simple Addition
Grade 4
Grade 3
12
Conclusions

Not much difference between developmental scale
scores and non-vertically aligned scores that are
normalized by grade and year
Different tests can yield different results
Low-incidence variables seem to be most sensitive
to test instrument
Not clear whether differences due to material
tested or differential ceiling effects

Write a Comment

User Comments (0)

About PowerShow.com

The Determinants of Student Achievement: Different Estimates for Different Measures PowerPoint PPT Presentation