Anthony Santella, Maneesh Agrawala, Doug DeCarlo, David Salesin, Michael Cohen

About This Presentation

Title:

Anthony Santella, Maneesh Agrawala, Doug DeCarlo, David Salesin, Michael Cohen

Description:

Anthony Santella, Maneesh Agrawala, Doug DeCarlo, David ... Lazy Snapping, Li et al., 2004. subject. background. Our Approach: cropping. Implement basic rules ... – PowerPoint PPT presentation

Number of Views:40

Avg rating:3.0/5.0

Slides: 57

Provided by: research5

Category:

more less

Transcript and Presenter's Notes

Title: Anthony Santella, Maneesh Agrawala, Doug DeCarlo, David Salesin, Michael Cohen

1
Gaze-Based Interaction for Semi-Automatic Photo
Cropping

Anthony Santella, Maneesh Agrawala, Doug DeCarlo,
David Salesin, Michael Cohen

2
Problem

Crop images with minimal effort

Original
Crop
3
Application

Thumbnails

4
Application

Adaptive Documents (Jacobs et al., 2003)

5
Problem

Crop images with minimal interaction
Sub-problems
Identify important image content

6
Problem

Crop images with minimal interaction
Sub-problems
Identify important image content
Eye Tracking

7
Problem

Crop images with minimal interaction
Sub-problems
Identify important image content
Eye Tracking
Determine a goodcomposition

8
Identifying Image Content

What is the subject of this picture?

9
Identifying Image Content Interactive

Prior art draw box (traditional cropping)

10
Identifying Image ContentAutomatic

Prior art have the computer guess
Detect faces
Suh et al. 2002
Chen et al. 2003
Byers et al. 2003

11
Identifying Image ContentAutomatic

Prior art have the computer guess
Salience Identify prominent regions e.g. Itti
et al. 1998, Itti and Koch 2000
Suh et al. 2002
Chen et al. 2003
Setlur et al. 2004

12
Identifying Image ContentAutomatic

Prior art have the computer guess
Salience Identify prominent regions e.g. Itti
et al. 1998, Itti and Koch 2000
Suh et al. 2002
Chen et al. 2003
Setlur et al. 2004

13
Identifying Image Content

Prior art salience (Itti and Koch, 2000)

Salience estimate
14
Identifying Image Content

Prior art salience (Itti and Koch, 2000)

Salience estimate
15
Eye Tracking

Effortless
Someday (soonish) ubiquitous

16
Our Approach Eye Tracking
17
Our Approach Eye tracking
Fixation
18
Our Approach Eye tracking

Record of overt attention
For interaction and evaluation
Vertegaal, 1999
Duchowski, 2000
Crowe et al., 2000
DeCarlo and Santella, 2004

19
Our Approach
Our Approach

A viewer examines the image

20
Our Approach
Our Approach

A viewer examines the image

21
Our Approach

System segments image (Christoudias et al., 2002)

22
Our Goal Identifying Content

Relate fixations to segmentation
Label segments as content or background

Content
23
Our Approach Labeling Regions

Assign each region a value indicating how much it
was examined

24
Our Approach Partial Labeling

Most viewed 10 of regions are subject
Least viewed 50 are background

subject
unknown
background
25
Our Approach Graph Cut

Propagate partial labels to rest of image Lazy
Snapping, Li et al., 2004

subject
background
26
Our Approach cropping

Implement basic rules
Keep crop tight
Keep subject
Avoid cutting through subject
Avoid cutting background elements

27
Rules

Keep crop tight Minimize fraction ofarea
retained
Keep subject Minimize fraction of subject area
lost

28
Rules

Keep crop tight Minimize fraction of area
retained
Keep subject Minimize fraction of subject area
lost

29
Rules

Keep crop tight Minimize fraction of area
retained
Keep subject Minimize fraction of subject area
lost

30
Rules

Keep crop tight Minimize fraction ofarea
retained
Keep subject Minimize fraction of subject area
lost
Together result in tight inclusive crop

31
Rules

Avoid cutting subject Minimize amount of
subject on boundary
Avoid cutting background elements Minimize
crossings of segmentation

32
Rules

Avoid cutting subject Minimize amount of
subject on boundary
Avoid cutting background elements Minimize
crossings of segmentation

33
Rules

Avoid cutting subject Minimize amount of
subject on boundary
Avoid cutting background elements Minimize
crossings of segmentation

34
Optimization

Weights control relative importance of each rule
Exhaustive search minimizes weighted sum of rule
scores to find crop rectangle that best respects
rules

35
Results
Original
36
Results
Original
37
Results
Original
Crops
38
Results
Original
Crops
39
Results
Original
40
Results
Original
Crops
41
Results
Original
Crops
42
Results
Original
Crops
43
Evaluation
44
Evaluation
Original
Hand made
Salience(Suh et al. 03)
Gaze-Based
45
Experiment

Exhaustive forced choice
Eight subjects
50 images
350 trials per subject

Which image looks better?
46
Evaluation Results

Preference data not significant
Original Salience Gaze Hand
Original - .511 .439 .266
Salience .489 - .416 .339
Gaze .561 .581 - .325
Hand .734 .661 .675 -

47
Evaluation Results

Preference data not significant
Original Salience Gaze Hand
Original - .511 .439 .266
Salience .489 - .416 .339
Gaze .561 .581 - .325
Hand .734 .661 .675 -

48
Evaluation Results

Preference data not significant
Original Salience Gaze Hand
Original - .511 .439 .266
Salience .489 - .416 .339
Gaze .561 .581 - .325
Hand .734 .661 .675 -

49
Evaluation Results

Preference data not significant
Original Salience Gaze Hand
Original - .511 .439 .266
Salience .489 - .416 .339
Gaze .561 .581 - .325
Hand .734 .661 .675 -
Kendall analysis
hand gt gaze-based gt salience gt original (plt.01)

50
Future Work
Crops
51
Future Work

Eye tracking in context
Real environment
Range of real tasks

52
Future Work

Limitations of eye tracking small things

53
Future Work

Limitations of eye tracking small things

54
Future Work

Limitations of eye tracking small things

55
Future Work

Quantitative measures of composition and design
an interesting area of research
Strong future potential for implicit interaction
with images

56
Thank You

Deep thanks to B. Suh, H. Ling, B. Bederson and
D. Jacobs for access to their salience cropping
system
Thanks also to Eileen Kowler, Mary Czerwinski, Ed
Cutrell and to Phillip Greenspun for several
photos
This research is partially supported by the NSF
through grant HLC 0308121

Write a Comment

User Comments (0)