Research - PowerPoint PPT Presentation

1 / 43

About This Presentation

Title:

Research

Description:

you get a better understanding of what the results mean and provide you with ... concert with a substantive review of the actual item to decide if that item is ... – PowerPoint PPT presentation

Number of Views:29

Avg rating:3.0/5.0

Slides: 44

Provided by: uncg8

Category:

more less

Transcript and Presenter's Notes

Title: Research

1
Research

Dr. Terry Ackerman
Chair, Department of Educational
Research Methodology
University of North Carolina at Greensboro

2
Objectives of this Session In this session I
will walk you through a basic and thorough item
and test analysis to help you get a better
understanding of what the results mean and
provide you with suggestions about how this
information can be recycled to improve classroom
instruction
3

Topics that will be covered
Item/Test analysis using ITEMAL
Comparison of Groups using SPSS

4
An Overview of the ITEMAL5
5

When would you use this program?
ITEMAL5 is an item analysis program designed to
provide the researcher or practitioner with
psychometric information about the quality of
individual items. This program is designed to
analyze assessment data in which each item can be
scored as correct or incorrect. Once the output
is obtained the researcher or practitioner should
go through the statistics for each item in
concert with a substantive review of the actual
item to decide if that item is appropriate for
the assessment.

Why would you use this program?
Anytime you administer an assessment you must
perform an item analysis and evaluate both the
reliability and validity of the responses. This
should be done before you do any statistical
analysis with the results. Researchers should
always include a discussion of the reliability
and validity of their data as part of the
descriptive analysis before discussing any
statistical results. Educators can examine which
incorrect answers were most often chosen to help
identify errors in students reasoning.

How could you use this program?
If you have identifiable sub groups (e.g.,
gender, ethnicity, SES) you should probably run
the program for each group separate as well as
combined to see if there are any noticeable
differences. Remember, the item and test
statistics will change if you add or drop
examinees or add or drop items.
If an item has poor psychometric properties
(e.g., low point biserials, is too easy or too
difficult or correlates negatively with other
items) you need to carefully examine that item
and decide if you want to retain it. If you
decide to delete it then you would have to rerun
the analysis.

How would you run this program?
ITEMAL5 is an item analysis program designed to
read two input files the file containing your
response data and a key file. Both files must be
text files. It is easiest to create these in
either Notepad or Wordpad. The program will read
in the key, score and analyze the data and then
put the output into a file that you provide the
name for.

DATA File
The data file contains a row of item responses
for each examinee. If you had 36 examinees you
would have 36 rows. The length of each row should
equal the number of items. Convert each response
to an item to an integer using the same scheme
that is used in the key, (e.g., a 1, b 2, c
3, etc.) If an examinee does not answer an item
give them a zero for that item. Thus, if an
examinee had the responses abbcade_ab you would
enter 1223145012.

EXAMPLE INPUT RESPONSE FILE
DanC111323
MarK133142
SalM142322
JohP424232
AdrT513132

ID
Item responses
These data would be read in with the
following format (4x,6i1)
11

KEY File
The key file is simply a row of integers denoting
the correct responses. One integer for each item
(i.e., if you have 20 items your row would be 20
values long.) Use the following scheme a 1, b
2, c 3, d 4, e 5. Your key must be made up
of values from 1 to 5.

The user is prompted to answer the following
questions
What will appear on Type of Response Input
the screen by the user
Enter the file name containing item responses
mydata.txt
Enter response file format (e.g. (30I1)) (5i1)
Enter the file name containing the key
mykey.txt
Enter key file format (e.g. (30I1)) (5i1)

The user is prompted to answer the following
questions
What will appear on Type of Response Input
the screen by the user
Enter the number of items 5
Enter the number of people 15
Enter in the output file name output.txt

How do you interpret the output?

KEY
22232

The key is printed out so that the user can check
to make sure the program is reading the key file
correctly.
16

Response Vectors for the
First Five Examinees
22234
23221
22211
40121
23231

The response vectors for the first five examinees
are printed out so that the user can make sure
that the program is reading the dataset properly.
17
ITEM 1 0
1 2 3 4 5 PBIS .5817 UPP N
3 0 3 0 0 0 0 BIS
.9450 MID N 3 0 3 0 0 0
0 DIFF .8750 LOW N 2 1 1 0
0 0 0 IREL .5090 TOT N 8
1 7 0 0 0 0 DIS .4000
For skewed or nonnormal distributions the
frequencies may not be symmetrical
Subjects are divided into three groups based
upon their total score.
Middle Group Middle 46
Lower Group Bottom 27
Upper Group Top 27
Total score
18
For each item the possible responses are listed.
There is a maximum of 5 categories. Note
that the categories are numerical and are
designed to represent the choices in a multiple
choice item a1, b2,c3 etc. 0no response.
The asterisked value() represents the correct
answer.

Individual Item Statistics
ITEM 1 0 1 2
3 4 5
PBIS .8427 UPP N 9 0 0 9
0 0 0
BIS 1.1392 MID N 1 0 0 1
0 0 0
DIFF .7333 LOW N 5 0 1 1
1 2 0
IREL .3727 TOT N 15 0 1 11
1 2 0
DIS -.2000

19
The program determines the number of examinees
from each group that selected each alternative.
You would expect more subjects from the Upper
group to have the largest number of people
selecting the correct alternative. Note in the
Upper group below all 9 selected 2 or b. In the
Lower group of the 5 examinees, one selected 1
(a) one selected 2 (b) one selected 3 (c) and
two selected 4 (d).

Individual Item Statistics
ITEM 1 0 1 2
3 4 5
PBIS .8427 UPP N 9 0 0 9
0 0 0
BIS 1.1392 MID N 1 0 0 1
0 0 0
DIFF .7333 LOW N 5 0 1 1
1 2 0
IREL .3727 TOT N 15 0 1 11
1 2 0
DIS .8000

The total number of subjects selecting each
alternative is listed in the bottom row.
20
The PBIS is the point biserial. It is the
Pearson correlation between the scored responses
for an item and the total test score. It is a
measure of discrimination. Acceptable values are
between .3 and .8. Values less than .2 you
probably should edit.
Items Ss 1 2 3 4 5 Tot 1
1 0 1 1 0 3 2 0 0 0 1 0 1 3
1 1 0 0 0 2 4 1 1 1 1 0 4

Individual Item Statistics
ITEM 1 0 1 2
3 4 5
PBIS .8427 UPP N 9 0 0 9
0 0 0
BIS 1.1392 MID N 1 0 0 1
0 0 0
DIFF .7333 LOW N 5 0 1 1
1 2 0
IREL .3727 TOT N 15 0 1 11
1 2 0
DIS .8000

21
The BIS is the biserial. It is a theoretically
computed measure of discrimination. It is always
larger than the PBIS. Acceptable values are
between .3 and .8.

Individual Item Statistics
ITEM 1 0 1 2
3 4 5
PBIS .8427 UPP N 9 0 0 9
0 0 0
BIS 1.1392 MID N 1 0 0 1
0 0 0
DIFF .7333 LOW N 5 0 1 1
1 2 0
IREL .3727 TOT N 15 0 1 11
1 2 0
DIS -.2000

22
The DIFF is the p-value or proportion correct.
Acceptable values are between .3 and .8. The
larger the value the easier the item. It is
equal to the total number of correct answers
divided by the total number of subjects.

Individual Item Statistics
ITEM 1 0 1 2
3 4 5
PBIS .8427 UPP N 9 0 0 9
0 0 0
BIS 1.1392 MID N 1 0 0 1
0 0 0
DIFF .7333 LOW N 5 0 1 1
1 2 0
IREL .3727 TOT N 15 0 1 11
1 2 0
DIS -.2000

11/15.7333
23
The IREL is the item reliability. It is the
standard deviation of the scored responses of
the item times the point biserial. The item
standard deviation is

Individual Item Statistics
ITEM 1 0 1 2
3 4 5
PBIS .8427 UPP N 9 0 0 9
0 0 0
BIS 1.1392 MID N 1 0 0 1
0 0 0
DIFF .7333 LOW N 5 0 1 1
1 2 0
IREL .3727 TOT N 15 0 1 11
1 2 0
DIS -.2000

24
The DIS is another measure of item discrimination
called the item discrimination index. It is the
calculated as

Individual Item Statistics
ITEM 1 0 1 2
3 4 5
PBIS .8427 UPP N 9 0 0 9
0 0 0
BIS 1.1392 MID N 1 0 0 1
0 0 0
DIFF .7333 LOW N 5 0 1 1
1 2 0
IREL .3727 TOT N 15 0 1 11
1 2 0
DIS .8000

25
The Inter-item Correlations are the Pearson
correlations between the scored responses for
each pair of items. Items which correlate
negatively with other items cause the reliability
to decrease. These items need to be reviewed
substantively to see if they are measuring the
same skills as the other items.
Items Ss 1 2 3 4 5 Tot 1 1 0 1 1
0 3 2 0 0 0 1 0 1 3 1 1 0 0
0 2 4 1 1 1 1 0 4

Inter-item Correlations
1 2 3 4 5
ITEM 1 1.00 .36 .66 .49
.30
ITEM 2 .36 1.00 .36 .43
-.30
ITEM 3 .66 .36 1.00 .49
.30
ITEM 4 .49 .43 .49 1.00
-.07
ITEM 5 .30 -.30 .30 -.07 1.00

OVERALL TEST SCORE DISTRIBUTION
CUMULATIVE
SCORE FREQUENCY FREQUENCY
0 3 3
1 2 5
2 1 6
3 5 11
4 4 15
5 0 15

The Frequency and Cumulative Frequency
distributions are based upon the number correct
scores for all examinees
27
The Test Statistics represent the first four
moments of the raw scores. The KR20 value
represents the internal consistency measure of
reliability.

Test Statistics
MEAN 2.333
STANDARD DEVIATION 1.491
SKEWNESS -.461
KURTOSIS -1.284
KR20 .695

Remember, negative skewness means you have more
high scores than low (i.e., the test was not very
difficult).
KR-20 is an estimate of internal consistency
reliability
28
Introduction to SPSS (Statistical package for the
Social Sciences)

SPSS is a complete user-friendly software that
will allow researchers to do both descriptive and
inferential univariate and multivariate
statistical analysis.

29
Spearman-Brown formula a. A formula used to
estimate how much a test's reliability
will increase (or decrease when the length
of the test in increased (or decreased) by
adding (or removing) parallel items b.
Formula where L the number of
times longer the new test will be
30
Example of the Spearman-Brown FormulaGiven a
20-item test with 1. What would be the
reliability if the test length were tripled
(i.e., made three times longer by adding 40
parallel items)
31
What would the reliability be if I reduced the
test length to only 10 items?
32
How much longer (i.e., number of times longer)
would I have to make the test to achieve
? is what you have, is what you
want
33
f. Standard error of measurement - (SEM) 1.
The formula to calculate the SEM is given
as 2. Confidence intervals
(bands) CI Observed Score 1.0(SEM)
34
g. Estimating true scores from observed
scores 1. Estimated true score where
2. Estimated true scores are always
closer to the mean test score than the
obtained test score because all tests
contain measurement error
35
Make sure your variable columns are labeled
SPSS ANOVA Handout
For one-way ANOVA you put your data into two
columns one for grouping (i.e., levels of the
independent variable) and one for values of
the dependent variable.
Note Subject 4 belongs to Group 2 and scored
9 on the dependent variable.
These are the data used in Example 1 for Chapter
11
36
To perform the ANOVA, after the data are
entered click on STATISTICS (v.8) or ANALYZE
(v.9,10) COMPARE MEANS, ONE-WAY ANOVA.
37
Dependent variables are highlighted and brought
into the Dependent List.
Independent variables are highlighted and brought
into the FACTOR box.
OPTIONS includes the computation of means and
standard deviations for each of the levels of
the independent variable.
To have Post Hoc tests performed click on POST
HOC. These are discussed in Chapter 12.
Apriori identified contrasts can be performed.
38
Always ask for the Descriptive statistics
(means, standard deviations) to be calculated
Click on Continue to return to the ANOVA box
To perform Levenes test for the Homogeneity of
Variance assumption click here.
If you click on OPTIONS this box will appear.
If a subjects data are missing that subject or
case can be deleted from the entire
analysis (TOP choice) or that subject can
be deleted from only the levels for which data is
missing (BOTTOM choice).
The means for each level of the
independent variable can be plotted.
39
This is the box that appears when you click on
Post Hoc.
These post hoc tests apply only when the
homogeneity of variance assumption is met. These
tests are discussed in Chapter 12.
These post hoc tests apply when the
assumption for homogeneity of variance assumption
(Levenes Test) is NOT met.
Click on CONTINUE to return to the main ANOVA
box.
This is where you select the alpha level for the
post hoc tests.
40
These are the Means, Standard Deviations, Standard
Errors, 95 Confidence Intervals, and Minimum
and Maximum values for each level of the
independent variable.
If this Significance value is less than .05,
then the assumption of homogeneity of variance is
NOT met.
Levenes test of Homogeneity of Variance is an
F-test and thus has two degrees of freedom.
41
The Treatment effect corresponds to Between
Groups and the Error corresponds to Within Groups.
The Significance value represents the proportion
of area in the F-distribution from the F-value
on out. The Fcrit is based on two degrees of
freedom values, dfB and dfW.
The F ratio equals MSB/MSW
Sum of Squares, SS, refers to the numerator of
the variance. SST SSBSSW
Mean Square values are variances. They
equal the Sum of Squares values divided by the
degrees of freedom
df refers to your Degrees of Freedom dfT dfB
dfW
42
The Y-axis is for the dependent variable (i.e.,in
terms of the mean metric)
The graph shows the mean values of your
dependent variable for each level of independent
variable.
The X-axis is for the independent variable and
lists the values for the three groups.
43
(No Transcript)

Write a Comment

User Comments (0)