Language Assessment - PowerPoint PPT Presentation

1 / 23

About This Presentation

Title:

Language Assessment

Description:

(2) stays within appropriate time constraints (3) is relatively easy ... A reliable test is ... street noise, temperature, desks and chairs, the amount ... – PowerPoint PPT presentation

Number of Views:3063

Avg rating:5.0/5.0

Slides: 24

Provided by: ccw2Oc

Category:

more less

Transcript and Presenter's Notes

Title: Language Assessment

1
Language Assessment

Chap. 2 Principles of Language Assessment

2
Practicality

Practical means
(1) is not excessively expensive
(2) stays within appropriate time constraints
(3) is relatively easy to administer, and
(4) has a scoring/evaluation procedure that is
specific and time-efficient

3
Reliability

A reliable test is consistent and dependable.
On two different occasions or by different
people, the test should yield similar results.
Student-Related Reliability may be caused by
temporary illness, fatigue, bad day, anxiety,
and other physical or psychological factors.

Rater Reliability
Human error, subjectivity, and bias may enter
into the scoring process.
Inter-rater reliability occurs when two/more
scorers yield inconsistent scores of the same
test (scoring criteria, inexperience,
inattention,preconceived biases).

Intra-rater reliability occurs because of unclear
scoring criteria, fatigue, bias toward good and
bad students, or carelessness.
Test Administration Reliability
Unreliability may also result from the conditions
in which the test is administered.
Examples street noise, temperature, desks and
chairs, the amount of light.

6
Reliability Validity

Test Reliability
The test itself can cause measurement errors.
Examples a long test, a timed test, ambiguous
test items, or a test item with more than one
answer.
Validity the degree to which a test measures
what it is supposed to measure or can be used
successfully for the purposes for which it is
intended.

7
Validity

For example, a valid test of reading ability
actually measures reading ability.
Five types of validity content validity,
criterion-related validity, construct validity,
consequential validity, and face validity.
Content validity A test adequately and
sufficiently measures the particular
skills/behavior it sets out to measure.

8
Validity

Examples A test that requires the learner
actually to speak within an authentic context
(T).
An oral test asks students to answer
multiple-choice questions requiring grammatical
judgments (F).
Direct testing involves the test-taker in
actually performing the target task. e.g.
producing target words orally.

9
Validity

Indirect testing tests the learner with a task
that is related to the target task.
For example, in a test of oral production, the
mark of stressed syllables in a list of written
words is only indirect testing.
Criterion-related validity a form of validity in
which a test is compared or correlated with an
outside criterion measure.

10
Criterion-Related Validity

Concurrent validity A test has concurrent
validity if its results are supported by other
concurrent performance beyond the assessment
itself.
For example, a high score on the final exam. will
be substantiated by actual proficiency in the
language.
Predictive validity A test accurately predicts
future performance. e.g. a language aptitude test
predicts second/foreign language ability

11
Construct Validity

A construct is any theory, hypothesis, or method
that attempts to explain observed phenomena in
our universe of perceptions.
For example, proficiency and communicative
competence are linguistic constructs.
Construct validity The test items can reflect
the essential aspects of the theory on which the
test is based. (e.g. the relationship between a
test of communicative competence and the theory
of c. c.)

12
Construct Validity

The scoring analysis for the interview includes
pronunciation, fluency, grammatical accuracy,
vocabulary use, and socio-linguistic
appropriateness.
If an proficiency interview includes only
pronunciation and grammar being evaluated, the
construct validity is questionable. (TOEFL)

13
Consequential Validity

Consequential validity includes all the
consequences of a test, including the accuracy in
measuring intended criteria, the impact on the
preparation of test-takers, the effect on the
learner, and the social consequences of a tests
interpretation and use.

14
Face Validity

Face validity refers to the degree to which a
test looks right, and appears to measure the
knowledge or abilities it claims to measure,
based on the subjective judgment.
Face validity means that the students perceive
the test to be valid. (Does the test, on the face
of it, appear from the learners perspective to
test what it is designed to test?)

15
Authenticity

The language is as natural as possible.
Items are contextualized rather than isolated.
Topics are meaningful (relevant, interesting).
Thematic organization to items is provided.
Tasks represent, or close to, real-world tasks.

16
Washback

Washback is the effect of testing on teaching and
learning.
It generally refers to the effects the tests have
on instruction in terms of how students prepare
for the test.
Ss incorrect responses/correct
responses/strategies for success/ can be served
as learning devices.
Comment generously and specifically on Ss test
performance.

17
Washback

In reality, letter grades and numerical scores
give no information of intrinsic interest to the
student. Instead, give praise for strengths and
offer constructive criticism of weaknesses.
Formative tests provide washback with information
to the learner on progress toward goals. Teachers
tend to offer no means of washback except grades
in summative tests.

18
Applying Principles to the Evaluation

(1). Are the test procedures practical?
(administrative details, time frame, smooth
administration, materials and equipment,
cost, scoring system, reporting results)
(2). Is the test reliable?
(clean test sheet, audible sound
amplification,
equally visible video input,
lighting,temperature,
objective scoring procedures)

Intra-rater reliability guidelines
(consistent sets of criteria, uniform attention,
double check consistency, the same standards to
all, avoidance of fatigue)
(3) Does the procedure demonstrate content
validity? (two steps)
A Are classroom objectives identified and
appropriately framed?

B Are lesson objectives represented in the form
of test specification?
(4) Is the procedure face valid and biased for
best?
Conditions for face valid
a. Directions are clear.
b. The structure of the test is organized
logically.
c. Its difficulty level is appropriately
pitched.

d. The test has no surprises.
e. Timing is appropriate.
(5). Are the test tasks as authentic as possible?
a. as natural as possible
b. as contextualized as possible
c. interesting, enjoyable, and/humorous
d. thematic organization
e. real-world tasks

(6) Does the test offer beneficial washback to
the learner?
(content validity, preparation time before the
test, reviewing after the test, self-assessment,
and peer discussion of the test results)

23
(No Transcript)

Write a Comment

User Comments (0)