Foundations of Software Design Fall 2002 Marti Hearst - PowerPoint PPT Presentation

About This Presentation
Title:

Foundations of Software Design Fall 2002 Marti Hearst

Description:

Specific, not abstract. Can only be done after code is written. 24 ... This is why we talk about the time in terms of a function of n. In the ArrayMax example, ... – PowerPoint PPT presentation

Number of Views:49
Avg rating:3.0/5.0
Slides: 36
Provided by: coursesIs8
Category:

less

Transcript and Presenter's Notes

Title: Foundations of Software Design Fall 2002 Marti Hearst


1
Foundations of Software DesignFall 2002Marti
Hearst
Lecture 10 Math Review, Intro to Analysis of
Algorithms    
2
Today
  • Math review
  • Exponents and logarithms
  • Functions and graphs
  • Intro to Analysis of Algorithms

3
Functions, Graphs of Functions
  • Function a rule that
  • Coverts inputs to outputs in a well-defined way.
  • This conversion process is often called mapping.
  • Input space called the domain
  • Output space called the range

4
Functions, Graphs of Functions
  • Function a rule that
  • Coverts inputs to outputs in a well-defined way.
  • This conversion process is often called mapping.
  • Input space called the domain
  • Output space called the range
  • Examples
  • Mapping of speed of bicycling to calories burned
  • Domain values for speed
  • Range values for calories
  • Mapping of people to names
  • Domain people
  • Range Strings

5
Example How many calroies does Bicycling burn?
  • Miles/Hour vs. KiloCalories/Minute

For a 150 lb rider. Green riding on a gravel
road with a mountain bike Blue paved road
riding a touring bicycle Red racing bicyclist.
From Whitt, F.R. D. G. Wilson. 1982. Bicycling
Science (second edition). http//www.frontier.iarc
.uaf.edu/cswingle/misc/exercise.phtml
6
Functions and Graphs of Functions
  • Notation
  • Many different kinds
  • f(x) is read f of x
  • This means a function called f takes x as an
    argument
  • f(x) y
  • This means the function f takes x as input and
    produces y as output.
  • The rule for the mapping is hidden by the
    notation.

7
  • Here f(x) 7x
  • A point on this graph can be called (x,y) or (x,
    f(x))

http//www.sosmath.com/algebra/logs/log1/log1.html
8
  • We also say yf(x)
  • A point on this graph can be called (x,y) or (x,
    f(x))
  • A straight line is defined by the function y ax
    b
  • a and b are constants
  • x is variable

http//www.sosmath.com/algebra/logs/log1/log1.html
9
Exponents and Logarithms
  • Exponents shorthand for multiplication
  • Logarithms shorthand for exponents
  • How we use these?
  • Difficult computational problems grow
    exponentially
  • Logarithms are useful for squashing them

10
  • The exponential function f with base a is denoted
    by
  • and x is any real number.
  • Note how much more quickly the graph grows than
    the linear graph of f(x) x
  • Example If the base is 2 and x 4, the
    function value f(4) will equal 16. A
    corresponding point on the graph would be (4,
    16).

http//www.sosmath.com/algebra/logs/log1/log1.html
11
http//www.sosmath.com/algebra/logs/log1/log1.html
12
  • Logarithmic functions are the inverse of
    exponential functions.
  • If (4, 16) is a point on the graph of an
    exponential function, then (16, 4) would be the
    corresponding point on the graph of the inverse
    logarithmic function.

http//www.sosmath.com/algebra/logs/log1/log1.html
13
Zipf Distribution(linear and log scale)
14
Rank Freq1 37 system2 32
knowledg3 24 base4 20
problem5 18 abstract6 15
model7 15 languag8 15
implem9 13 reason10 13
inform11 11 expert12 11
analysi13 10 rule14 10
program15 10 oper16 10
evalu17 10 comput18 10
case19 9 gener20 9 form
Zipf Curves for Term Frequency
15
43 6 approach44 5
work45 5 variabl46 5
theori47 5 specif48 5
softwar49 5 requir50 5
potenti51 5 method52 5
mean53 5 inher54 5
data55 5 commit56 5
applic57 4 tool58 4
technolog59 4 techniqu
Zoom in on the Knee of the Curve
16
Other Functions
  • Quadratic function
  • This is a graph of

(2,0)
(-2,0)
http//www.sosmath.com/algebra/
17
Other Functions
  • This one takes analysis to figure out
  • Graph of f(x) -0.3(x2)x(x-1)

http//www.sosmath.com/algebra/
18
Function Pecking Order
  • In increasing order

Adapted from Goodrich Tamassia
19
Summation Notation
20
Summation Notation
21
Summation Notation and Java
22
Iterative form vs. Closed Form
23
Why Analysis of Algorithms?
  • To find out
  • How long an algorithm takes to run
  • How to compare different algorithms
  • This is done at a very abstract level
  • This can be done before code is written
  • Alternative Performance analysis
  • Actually time each operation as the program is
    running
  • Specific to the machine and the implementation of
    the algorithm
  • Specific, not abstract
  • Can only be done after code is written

24
Counting Primitive Operations
2 steps 1 to initialize i
2 step each time (compare i to n, inc i)
2 steps
2 steps
1 step
Between 4(n-1) and 6(n-1) in the loop
It depends on the order the numbers appear in in
A
Adapted from Goodrich Tamassia
25
Algorithm Complexity
  • Worst Case Complexity
  • the function defined by the maximum number of
    steps taken on any instance of size n
  • Best Case Complexity
  • the function defined by the minimum number of
    steps taken on any instance of size n
  • Average Case Complexity
  • the function defined by the average number of
    steps taken on any instance of size n

Adapted from http//www.cs.sunysb.edu/algorith/le
ctures-good/node1.html
26
Best, Worst, and Average Case Complexity
Number of steps
Worst Case Complexity
Average Case Complexity
Best Case Complexity
N (input size)
Adapted from http//www.cs.sunysb.edu/algorith/le
ctures-good/node1.html
27
Doing the Analysis
  • Its hard to estimate the running time exactly
  • Best case depends on the input
  • Average case is difficult to compute
  • So we usually focus on worst case analysis
  • Easier to compute
  • Usually close to the actual running time
  • Strategy try to find upper and lower bounds of
    the worst case function.

Upper bound
Actual function
Lower bound
Adapted from http//www.cs.sunysb.edu/algorith/le
ctures-good/node2.html
28
Names of Bounding Functions
  • f(n) is ?(g(n)) means cg(n) is an upper bound on
    f(n)
  • f(n) is ?(g(n)) means cg(n) is a lower bound on
    f(n)
  • f(n) is ?(g(n)) means c1g(n) is an upper bound
    on f(n) and c2g(n) is a lower bound on f(n)
  • If f(n) is ?(g(n)) and f(n) is ?(g(n)) then f(n)
    is ?(g(n))
  • Here c, c1 and c2 are constants.

Adapted from http//www.cs.sunysb.edu/algorith/le
ctures-good/node2.html
29
Constants versus n
  • We usually focus on big-oh.
  • It is important to understand the difference
    between constants and n
  • A constant has a fixed value, doesnt change
  • Doesnt really matter what the value of the
    constant is.
  • n reflects the size of the problem
  • So n can get really really big
  • This is why we talk about the time in terms of a
    function of n
  • In the ArrayMax example,
  • We dont really need to pay attention to 4(n-1)
    versus 6(n-1)
  • They are both order n

Adapted from http//www.cs.sunysb.edu/algorith/le
ctures-good/node2.html
30
Problems that have large n
  • Put a list of all contributors to the 2000
    presidential campaigns into alphabetical order.
  • Run a photoshop-style filter across all the
    pixels of a high-resolution image.
  • Others?

31
Constraining Large Problems
  • Number of ways there are to fly roundtrip from
    the Bay Area to Washington DC.
  • Here n is the number of available flights on a
    given day throughout the country
  • But have to add lots of constraints too
  • Choose SFO, OAK, or SJ
  • Choose BWI, Dulles, or National
  • Choose airline
  • Direct, one stop, two stops?
  • Connect through Dallas or Denver or Chicago or
    LAX or
  • How long must be allowed for layovers?
  • Which combos are cheapest?
  • What about open-jaw?
  • If you try all possible combinations, it will
    take a very long time to run!!

32
The Crossover Point
One function starts out faster for small values
of n. But for n gt n0, the other function is
always faster.
Adapted from http//www.cs.sunysb.edu/algorith/le
ctures-good/node2.html
33
More formally
  • Let f(n) and g(n) be functions mapping
    nonnegative integers to real numbers.
  • f(n) is ?(g(n)) if there exist positive constants
    n0 and c such that for all ngtn0, f(n) lt cg(n)
  • Other ways to say this
  • f(n) is order g(n)
  • f(n) is big-Oh of g(n)
  • f(n) is Oh of g(n)

34
Plot them!
Both x and y linear scales
Convert y axis to log scale
(that jump for large n happens because the last
number is out of range)
Notice how much bigger 2n is than nk
This is why exponential growth is BAD BAD BAD!!
35
Summary Analysis of Algorithms
  • A method for determining, in an abstract way, the
    asymptotic running time of an algorithm
  • Here asymptotic means as n gets very large
  • Useful for comparing algorithms
  • Useful also for determing tractability
  • Meaning, a way to determine if the problem is
    intractable (impossible) or not
  • Exponential time algorithms are usually
    intractable.
  • Well revisit these ideas throughout the rest of
    the course.
Write a Comment
User Comments (0)
About PowerShow.com