Anthony Santella, Maneesh Agrawala, Doug DeCarlo, David Salesin, Michael Cohen - PowerPoint PPT Presentation

1 / 56
About This Presentation
Title:

Anthony Santella, Maneesh Agrawala, Doug DeCarlo, David Salesin, Michael Cohen

Description:

Anthony Santella, Maneesh Agrawala, Doug DeCarlo, David ... Lazy Snapping, Li et al., 2004. subject. background. Our Approach: cropping. Implement basic rules ... – PowerPoint PPT presentation

Number of Views:40
Avg rating:3.0/5.0
Slides: 57
Provided by: research5
Category:

less

Transcript and Presenter's Notes

Title: Anthony Santella, Maneesh Agrawala, Doug DeCarlo, David Salesin, Michael Cohen


1
Gaze-Based Interaction for Semi-Automatic Photo
Cropping
  • Anthony Santella, Maneesh Agrawala, Doug DeCarlo,
    David Salesin, Michael Cohen

2
Problem
  • Crop images with minimal effort

Original
Crop
3
Application
  • Thumbnails

4
Application
  • Adaptive Documents (Jacobs et al., 2003)

5
Problem
  • Crop images with minimal interaction
  • Sub-problems
  • Identify important image content

6
Problem
  • Crop images with minimal interaction
  • Sub-problems
  • Identify important image content
  • Eye Tracking

7
Problem
  • Crop images with minimal interaction
  • Sub-problems
  • Identify important image content
  • Eye Tracking
  • Determine a goodcomposition

8
Identifying Image Content
  • What is the subject of this picture?

9
Identifying Image Content Interactive
  • Prior art draw box (traditional cropping)

10
Identifying Image ContentAutomatic
  • Prior art have the computer guess
  • Detect faces
  • Suh et al. 2002
  • Chen et al. 2003
  • Byers et al. 2003

11
Identifying Image ContentAutomatic
  • Prior art have the computer guess
  • Salience Identify prominent regions e.g. Itti
    et al. 1998, Itti and Koch 2000
  • Suh et al. 2002
  • Chen et al. 2003
  • Setlur et al. 2004

12
Identifying Image ContentAutomatic
  • Prior art have the computer guess
  • Salience Identify prominent regions e.g. Itti
    et al. 1998, Itti and Koch 2000
  • Suh et al. 2002
  • Chen et al. 2003
  • Setlur et al. 2004

13
Identifying Image Content
  • Prior art salience (Itti and Koch, 2000)

Salience estimate
14
Identifying Image Content
  • Prior art salience (Itti and Koch, 2000)

Salience estimate
15
Eye Tracking
  • Effortless
  • Someday (soonish) ubiquitous

16
Our Approach Eye Tracking
17
Our Approach Eye tracking
Fixation
18
Our Approach Eye tracking
  • Record of overt attention
  • For interaction and evaluation
  • Vertegaal, 1999
  • Duchowski, 2000
  • Crowe et al., 2000
  • DeCarlo and Santella, 2004

19
Our Approach
Our Approach
  • A viewer examines the image

20
Our Approach
Our Approach
  • A viewer examines the image

21
Our Approach
  • System segments image (Christoudias et al., 2002)

22
Our Goal Identifying Content
  • Relate fixations to segmentation
  • Label segments as content or background

Content
23
Our Approach Labeling Regions
  • Assign each region a value indicating how much it
    was examined

24
Our Approach Partial Labeling
  • Most viewed 10 of regions are subject
  • Least viewed 50 are background

subject
unknown
background
25
Our Approach Graph Cut
  • Propagate partial labels to rest of image Lazy
    Snapping, Li et al., 2004

subject
background
26
Our Approach cropping
  • Implement basic rules
  • Keep crop tight
  • Keep subject
  • Avoid cutting through subject
  • Avoid cutting background elements

27
Rules
  • Keep crop tight Minimize fraction ofarea
    retained
  • Keep subject Minimize fraction of subject area
    lost

28
Rules
  • Keep crop tight Minimize fraction of area
    retained
  • Keep subject Minimize fraction of subject area
    lost

29
Rules
  • Keep crop tight Minimize fraction of area
    retained
  • Keep subject Minimize fraction of subject area
    lost

30
Rules
  • Keep crop tight Minimize fraction ofarea
    retained
  • Keep subject Minimize fraction of subject area
    lost
  • Together result in tight inclusive crop

31
Rules
  • Avoid cutting subject Minimize amount of
    subject on boundary
  • Avoid cutting background elements Minimize
    crossings of segmentation

32
Rules
  • Avoid cutting subject Minimize amount of
    subject on boundary
  • Avoid cutting background elements Minimize
    crossings of segmentation

33
Rules
  • Avoid cutting subject Minimize amount of
    subject on boundary
  • Avoid cutting background elements Minimize
    crossings of segmentation

34
Optimization
  • Weights control relative importance of each rule
  • Exhaustive search minimizes weighted sum of rule
    scores to find crop rectangle that best respects
    rules

35
Results
Original
36
Results
Original
37
Results
Original
Crops
38
Results
Original
Crops
39
Results
Original
40
Results
Original
Crops
41
Results
Original
Crops
42
Results
Original
Crops
43
Evaluation
44
Evaluation
Original
Hand made
Salience(Suh et al. 03)
Gaze-Based
45
Experiment
  • Exhaustive forced choice
  • Eight subjects
  • 50 images
  • 350 trials per subject

Which image looks better?
46
Evaluation Results
  • Preference data not significant
  • Original Salience Gaze Hand
  • Original - .511 .439 .266
  • Salience .489 - .416 .339
  • Gaze .561 .581 - .325
  • Hand .734 .661 .675 -

47
Evaluation Results
  • Preference data not significant
  • Original Salience Gaze Hand
  • Original - .511 .439 .266
  • Salience .489 - .416 .339
  • Gaze .561 .581 - .325
  • Hand .734 .661 .675 -

48
Evaluation Results
  • Preference data not significant
  • Original Salience Gaze Hand
  • Original - .511 .439 .266
  • Salience .489 - .416 .339
  • Gaze .561 .581 - .325
  • Hand .734 .661 .675 -

49
Evaluation Results
  • Preference data not significant
  • Original Salience Gaze Hand
  • Original - .511 .439 .266
  • Salience .489 - .416 .339
  • Gaze .561 .581 - .325
  • Hand .734 .661 .675 -
  • Kendall analysis
  • hand gt gaze-based gt salience gt original (plt.01)

50
Future Work
Crops
51
Future Work
  • Eye tracking in context
  • Real environment
  • Range of real tasks

52
Future Work
  • Limitations of eye tracking small things

53
Future Work
  • Limitations of eye tracking small things

54
Future Work
  • Limitations of eye tracking small things

55
Future Work
  • Quantitative measures of composition and design
    an interesting area of research
  • Strong future potential for implicit interaction
    with images

56
Thank You
  • Deep thanks to B. Suh, H. Ling, B. Bederson and
    D. Jacobs for access to their salience cropping
    system
  • Thanks also to Eileen Kowler, Mary Czerwinski, Ed
    Cutrell and to Phillip Greenspun for several
    photos
  • This research is partially supported by the NSF
    through grant HLC 0308121
Write a Comment
User Comments (0)
About PowerShow.com