minimax - PowerPoint PPT Presentation

1 / 18

About This Presentation

Title:

minimax

Description:

column player: what is the best payoff I can guarantee if row player knows/guesses my move ... Row player is guaranteed to get at least the maximin value ... – PowerPoint PPT presentation

Number of Views:400

Avg rating:3.0/5.0

Slides: 19

Provided by: Vic448

Category:

more less

Transcript and Presenter's Notes

Title: minimax

1
minimax
2
Outline

zero-sum games
pure strategy minimax
saddle (equilibrium) points
mixed strategy minimax
minimax theorem
non-zero sum games
minimax as optimization

3
Minimax Beginning of Game Theory

Long before Nash
"As far as I can see, there could be no theory of
games without that theorem I thought there was
nothing worth publishing until the 'Minimax
Theorem' was proved," remarked von Neumann.

4
Zero-Sum Games

2 players
R_1(i,j) M(i,j)
R_2(i,j) -M(i,j)
player 1 is maximizer
player 2 is minimizer
Matching Pennies
Rock-Paper-Scissors

5
Strategies

row player what is the best payoff I can
guarantee if column player knows/guesses my move
maxi (minj M(i,j)) - maximin value
column player what is the best payoff I can
guarantee if row player knows/guesses my move
- minj (maxi M(i,j))
minj (maxi M(i,j)) - minimax value

opponent chooses the worst move for me
I move first
6
Example
7
Min and Max

maxi (minj M(i,j)) minj (maxi M(i,j))
better to go second can react
Proof
for any i
M(i,j) maxi M(i,j) for all j
minj M(i,j) minj (maxi M(i,j))
i arg maxi (minj M(i,j))
maxi (minj M(i,j)) minj (maxi M(i,j))

8
minimax gt maximinMatching Pennies example

maximin -1
minimax 1

9
Pure Strategy Saddle Points
(2,2) is a saddle point

Def (i,j) is a saddle point if
it is
1. minimum of the row i
M(i,j) M(i,j)
2. maximum of the column j
M(i,j) M(i,j)
Equivalent to pure strategy Nash Equilibria in
2-player zero-sum games

10
Property of Saddle Points

Let (i1,j1), (i2,j2) be two saddle points. Then
1. M(i1,j1) M(i2,j2)
2. M(i1,j2) M(i2,j1)
same value at all saddle points (i,j)
v M(i,j) value of the game

(1,1), (2,1), (1,3), (2,3)
saddle points
11
Pure Strategy Saddle Points and Minimax

In games with pure strategies, saddle point
exists if and only if
maxi (minj M(i,j)) minj (maxi M(i,j))
Row player is guaranteed to get at least the
maximin value
Column player is guaranteed to pay at most the
minimax value
Proof (?) a saddle point (i,j) exists if
maxi minj M(i,j) minj maxi M(i,j)

i arg maxi minj M(i,j)
maxi minj M(i,j) minj M(i,j) M(i,j) for
all j
j arg minj maxi M(i,j)
minj maxi M(i,j) maxi M(i,j) M(i,j)
But maxi minj M(i,j) minj maxi M(i,j) gt
minj M(i,j) M(i,j) maxi M(i,j)
Or equivalently
M(i,j) minj M(i,j) M(i,j)
M(i,j) maxi M(i,j) M(i,j)
Prove the other direction

(i,,j) minimum of row i
(i,,j) maximum of column j
13
Pure Strategy Games with Saddle Points

called strictly determined games
the outcome is known
can tell the opponent what you are going to do
what is there are no pure strategy saddle points?

14
Mixed Strategies

do not know what you are going to play
p (p1,..,pm)
q (q1,..,qm)
?ipi 1, ?jqj 1
M(p,q) ?i?jM(i,j) piqj pRqT
Def (p,q) is a saddle point if
M(p,q) M(p,q) M(p,q) for all p,q

15
Minimax Theorem

Every m-by-n 2-person zero-sum game has a
solution. More precisely, there is a unique
number v, called the value of the game, and there
are optimal (pure or mixed) strategies p,q such
that
maxpminq M(p,q) minqmaxp M(p,q) v M(p,q)
i.e., we know whats rational

16
Non-Zero Sum Games

minimax (maximin) strategies are not optimal
Battle of the Sexes (modified)
3 Nash equilibria

husband
wife
17
Nash vs Minimax

Minimax rational in 2-player zero-sum games,
i.e. players know what to do
Same value at all saddle points
Nash applies to all n-player games
Not clear how to reach Nash (multiple equilibria)
Nash outcome may not be the best one (Prisoners
Dilemma)
players do not know what to do

18
Maximin Optimization Problem

maxpminq M(p,q)
maxpv s.t.
v minq M(p,q)
?ipi 1, ?jqj 1
pi 0 for all i
qj 0 for all j

Mj column j, Mi row i
maxp,v v s.t.
pTMj v for all j
?ipi 1
pi 0 for all i

when column player knows the strategy of the row
player, there is no need to randomize
player 2 cannot make the payoff lower than v
19
Maximin Optimization Problem