Experimental Analysis of Algorithms: What to Measure - PowerPoint PPT Presentation

About This Presentation
Title:

Experimental Analysis of Algorithms: What to Measure

Description:

... CD file storage; stock cutting; iPod file storage; generalizations to 2D, ... How well does the FF heuristic perform? Experimental Study of First Fit Bin Packing ... – PowerPoint PPT presentation

Number of Views:66
Avg rating:3.0/5.0
Slides: 57
Provided by: informat1812
Category:

less

Transcript and Presenter's Notes

Title: Experimental Analysis of Algorithms: What to Measure


1
Experimental Analysis of Algorithms What to
Measure
  • Catherine C. McGeoch
  • Amherst College

2
Theory vs Practice
Predict puck velocities using the new composite
sticks.
3
Experiment
Control Factors
Isolate Components
Attach Probes
4
Experiments on Algorithms
Good news Bad news
Easy to probe. Nearly total experimental control. Simple mechanisms. Model validation not a problem. Fast experiments (often). Tons of data points (often). Unusual questions few techniques. Unusually precise questions need advanced techniques. Unusual data parametric methods are weak. Large and infinite sample spaces sampling difficulties. Generalization/abstraction from the computational artifact extrapolation. NP-hard problems problematic.
5
Standard statistical techniques
  • Comparison (estimation and hypothesis testing)
    same/different, bigger/smaller.
  • Interpolation (linear and nonlinear regression --
    fitting models to data.
  • 3. Extrapolation (??) -- building models of
    data, explaining phenomena.

cost
parameter
6
Standard statistical techniques
  • Comparison (estimation and hypothesis testing)
    same/different, bigger/smaller.
  • Interpolation (linear and nonlinear regression --
    fitting models to data.
  • 3. Extrapolation () -- building models of data,
    explaining phenomena.

cost
numerical parameter
7
Standard statistical techniques
  • Comparison (estimation and hypothesis testing)
    same/different, bigger/smaller.
  • Interpolation (linear and nonlinear regression --
    fitting models to data. Interpolation
  • 3. Extrapolation () -- building models,
    explaining phenomena.

cost
numerical parameter
8
Some Nonstandard Techniques
  1. Graphical analysis (GA) -- big data sets, unusual
    questions, interpolation, extrapolation.
  2. Exploratory data analysis (EDA) -- model
    building, unusual data sets.
  3. Variance reduction techniques -- simple
    mechanisms, complete control.
  4. Biased estimators -- NP-Hard problems, large
    sample spaces.

Today!
9
Case Study First Fit Bin Packing
Applications CD file storage stock cutting
iPod file storage generalizations to 2D,
3D... Bin packing is NP-Hard. How well does the
FF heuristic perform?
u
l
Consider weights in order of arrival pack each
into the first (leftmost) bin that can contain
it.
10
Experimental Study of First Fit Bin Packing
Input categories -- uni1 n reals drawn
uniformly from (0, 1 -- uni8 n reals drawn
uniformly from (0, .8 -- file n file
sizes (scaled to 0..1). -- dict n
dictionary word sizes (in 0..1).
Run First Fit on these inputs, analyze
results....
11
What to Measure?
What performance indicator to use for assessing
heuristic solution quality? The obvious
choice Number of Bins Other performance
indicators suggested by data analysis -
Graphical analysis (GA) - Exploratory data
analysis (EDA) - Variance reduction techniques
(VRT) - Biased estimators
Todays topic!
12
First Fit
input type n
Bins
uni1 30,000 15,270
uni1 60,000 30,446
uni1 120,000 60,809
uni8 30,000 12,217
uni8 60,000 24,385
uni8 120,000 48,965
file 30,000 9
file 60,000 13
file 124,016 27
dicto 60,687 23,727
dicto 61,406 22,448
dicto 81,520 28,767
Tabular data Good for comparisons. Which is
better? How much better? When is it better?
13
Graphical Analysis
Identify trends Find common scales Discover
anomalies Build models/explanations
14
Graphical analysis Locate Correlations
GA Look for trends.
15
GA (pairs plot) Look for correlations
16
y x
GA (Weight Sum vs Bins) Find a common scale
17
Conjectured input properties affecting FF packing
quality symmetry, discreteness, skew.
GA (distribution of weights in input) look for
explanations
18
Graphical Analysis some results...
File data FF packings are optimal! Due to
extreme skew in the weight distribution (few big
weights, many tiny weights). Dicto data
Sorting the weights (FFD) makes the packing
worse! Due to discrete weights in bad
combinations. (Bad FFD packings can be predicted
to within 100 bins. ) Uniform data Smaller
weight distributions (0 .8) give worse packings
(compared to optimal) than larger weight
distributions (0, 1). Probably due to
asymmetry.
(more)...
19
Top line 1-u Bottom line .55 - u/2
Conjecture The distribution of empty space per
bin has holes when u lt .85. These holes cause
bad FF packings.
GA (details) u vs distribution of es in bins
20
Graphical Analysis What to Measure
Input Number of weights Sum of weights
Number of weights gt 0.5 Weight
distribution Output Number of bins Empty
space Bins - Weight Sum Gaps Empty space
per bin Distribution of gaps Animations of
packings
Trends
Scale
Details
21
Exploratory Data Analysis
Smooth and the rough look at general trends, and
(equally important) deviation from trends.
Categories of data tune analysis for type of
data -- counts and amounts, proportions, counted
fractions, percentages ... Data transformation
adjust data properties using logarithms, powers,
square roots, ratios ...
No a priori hypotheses, no models, no estimators.
22
Number of bins is nearly equal to weight sum.
EDA The smooth ...
23
Deviation of bins from weight sum lt 8 of bins,
depending on input class. (Focus on 8 and 1 ...)
... and the rough Weight Sum vs Bins/Weight Sum
24
Focus on Uniform Weight Lists
Given n weights drawn uniformly from (0, u,
for 0 lt u lt 1. Consider FF packing quality as
f(u), for fixed n.
25
Number of bins is proportional to upper bound on
weights.
EDA (the smooth) u vs Bins, at n100,000
26
EDA (the Rough) u vs Bins/Weight Sum ( nu/2).
27
EDA Categories of Data Bin efficiency
Bins/Weight Sum Always gt 1, mean is in
1.0,1.7, variance large but decreasing in n.
Does it converge to 1 (optimal) or to 1c? Empty
space Bins - Weight Sum Always gt 0,
mean is linear or sublinear in nu, variance
constant in n.) Is it linear or sublinear in n?
a ratio
a difference
Convergence in n is easier to see
28
Packing efficiency n vs Bins/(un/2).
29
Empty Space (Bins - Weight)/nu
30
Power law fits y x.974 .. x.998
EDA (data transformation) Linear growth on a
log-log scale.
31
EDA Some Results
Number of Bins is near Weight Sum nu/2, with
largest deviations near u .8. Empty space (a
difference) has clearer convergence properties
than Bin Efficiency (a ratio). Empty space
appears to be asymptotically linear in n --
non-optimal -- for all u except 1.
Smooth rough
Data Categories
Transformation
32
Variance Reduction Techniques
y
y
0
x
x
Is y asymptotically positive or negative?
33
VRT Control Variates
Subtract a source of noise if its expectation is
known and it is positively correlated with
outcome. Expected number of bins b Bins
Weight Sum Empty Space EWS - nu/2 0
EB - (WS - nu/2) b VarB - (WS - nu/2)
VarB Var(WS-nu/2) - 2CovB,
(WS-nu/2). B - WS nu/2 ES nu/2. Weight
Sum is a Control Variate for Bins. ES nu/2 is
a better estimator of b.
34
Estimating b with B.
35
Estimating b with ES
36
More Variance Reduction Techniques
Common Random Variates Compare heuristics on
identical inputs when performance is correlated.
Antithetic Variates Exploit negative
correlation in inputs. Stratification Adjust
variations in output according to known
variations in input. Conditional Monte Carlo
More data per experiment, using efficient tests.
37
Biased Estimators
z
y
x
x
Bad estimator of mean(y) vs good estimator of
zlb(y).
38
Biased Estimators
z
y
x
x
Bad estimator of mean(y) vs cheap estimator of
zlb(y).
39
Biased Estimators of Optimal Packing Quality
Bounds on optimal number of bins U FF number
of bins used (or any heuristic) L Weight
sum L Number items gt 0.5 L FF/2 L
(FF -2)10/17 L (FFD - 4 )9/11
40
Summary What to Measure
First Fit Packings number of bins number of
weights sum of weights distribution of
weights number of weights gt .5 packing
efficiency empty space empty space per bin
bounds on bin counts
GA trends GA scale GA details EDA smooth and
rough EDA data categories EDA data
transformation VRT alternatives with same mean,
lower variance BE lower/upper bounds on the
interesting quantity
41
Another Example Problem
10
TSP Given graph G with n vertices and m
weighted edges, find the least-cost tour through
all vertices.
12
9
12
20
6
4
18
17
3

6
Applications well known.
42
TSP What to Measure
VRTs and BEs Mean edge weight is a control
variate for Tour Length. Beginning Tour is a
control variate for Final Tour, in iterative
algorithms with random starts. f(MST Matching)
is a biased estimator of Tour Length (lower
bound). Held-Karp Lower Bound is biased estimator
of Tour Length. Can you think of others?
43
TSP Graphical Analysis
Trends
Input Vertices n and Edges m ... can you
think of others? Output Tour Length ... can
you think of others?
Scale
Details
44
TSP Exploratory Data Analysis
SmoothRough
Any ideas?
Categories
Transformation
45
References
Tukey, Exploratory Data Analysis. Cleveland,
Visualising Data. Chambers, Cleveland, Kleiner,
Tukey, Graphical Methods for Data
Analysis. Bratley, Fox, Schrage, A Guide to
Simulation. C. C. McGeoch, Variance Reduction
Techniques and Simulation Speedups, Computing
Surveys, June 1992.
46
Upcoming Events in Experimental Algorithmics
January 2007 ALENEX (Workshop on Algorithm
Engineering and Experimentation), New
Orleans. Spring 2007 DIMACS/NISS joint workshop
on experimental analysis of algorithms, North
Carolina. (Center for Discrete Mathematics and
Theoretical Computer Science, and National
Institute for Statistical Sciences. ) June 2007
WEA (Workshop on Experimental Algorithmics),
Rome. (Ongoing) DIMACS Challenge on Shortest
Paths Algorithms.
47
Unusual Functions
48
EDA What To Measure
The Smooth and the Rough Bins used is near nu/2,
with large deviations in packing quality near u -
.8. Categories of Data Empty space (a
difference) has clearer convergence properties
than Bin Efficiency (a ratio). Data
Transformation Empty space appears to be
asymptotically linear in n -- implying
non-optimal convergence -- for all u except 1.
49
Theory and Practice
50
Hockey Science Predict the in-game puck
velocities generated by new stick technologies.
Practice
Theory
51
Theory, Experiment, Practice
52
What is experimental algorithmics?
Experiment
Practice
Theory
53
What is experimental algorithmics?
Practice
Real systems CPU times Measurements Real inputs,
benchmarks
Experiment
Theory
Abstraction Dominant operations Asymptotics Worst-
case, Average Case
54
First Fit
Consider weights in order of arrival pack each
into the first (leftmost) bin that can contain
it.
L rand_list(n, l, u)
S(L)Sum of weights in L
u
Opt(L) Optimal number of bins
l
B(L)Number of bins used in FF
ES(L) Total empty space in bins B(L) -S(L)
ESL(L) Empty space in all but the last bin.
55
First Fit Bin Packing
Given n weights drawn uniformly from the real
range (l, u, pack into unit-capacity bins
according to the First Fit (FF) heuristic.
u

l
n
1
0
56
Type n Bins, Bins/Sum
unif 30,000 15,270 / 1.081
unif 60,000 30,446 / 1.024
unif 120,000 60,809 / 1.033
file 30,000 9 / 1.021
file 60,000 13 / 1.015
file 124,016 27 / 1.013
dicto 60,687 23,727/ 1.037
dicto 61,406 22,448/1.047
dicto 81,520 28,767/1.038
Write a Comment
User Comments (0)
About PowerShow.com