CS376 Evaluation

About This Presentation

Title:

Description:

Number of Views:52

Avg rating:3.0/5.0

Slides: 28

Provided by: jeffr263

Learn more at: https://hci.stanford.edu

Transcript and Presenter's Notes

Title: CS376 Evaluation

1
Evaluation Methods
Jeffrey Heer 28 April 2009
2
Project Abstracts

For final version (due online Fri 5/1 _at_ 7am)
Flesh out concrete details. What will you build?
If running an experiment, what factors will you
vary and what will you measure? What are your
hypotheses and why? Provide rationale!
Need to add study recruitment plan and related
work sections (see http//cs376/project.html).
Iterate more than once! Stop by office hours to
discuss.

3
What is Evaluation?

4
Establishing Research Validity

5
Evaluation Methods

6
(No Transcript)
7
What to evaluate?

8
(No Transcript)
9
(No Transcript)
10
(No Transcript)
11
UbiFit Consolvo et al
12
Momento
Momento Carter et al
13
Evaluation Methods in HCI

14
Proof by Demonstration

15
Inspection Methods

16
How many evaluators?
17
Usability Testing

18
Wizard-of-Oz Techniques
19
Controlled Experiments

20
Controlled Experiments

Measure response of dependent variables to
manipulation of independent variables.
Within or between-subjects design
Change indep vars within or across subjects
Randomization, replication, blocking
Learning effects
Choice of measure and statistical tests
t-Test, ANOVA, Chi-squared ?2, Non-parametric

21
Experimental Desiderata

P-value probability that results due to chance
Type I Error accept spurious result
Bonferronis principle if you run enough
significance tests, youll eventually get lucky
Type II Error mistakenly reject result
Inappropriate measure or test?
Statistical vs. practical significance
N1000, p lt 0.001, avg dt 0.12 sec.

22
Internal Validity

Internal validity is a causal relation between
two variables properly demonstrated?
Confounds is there another factor at play?
Selection (bias) approp. subject population?
Experimenter bias researcher actions

23
External Validity

24
Ecological Validity

The degree to which the methods, materials and
setting of the study approximate the real-life
situation under investigation.
Flight simulator vs. flying a plane
Simulated community activity vs. open web

25
(No Transcript)
26
(No Transcript)
27
Next Time Distributed Cognition

The Power of Representation in Things that Make
Us Smart, 1993, pp. 43-76.
Donald Norman
On Distinguishing Pragmatic from Epistemic
Action, Cognitive Science, 1994, pp. 513-549,
David Kirsh and Paul Maglio

Write a Comment

User Comments (0)