Temperature Discovery - PowerPoint PPT Presentation

About This Presentation
Title:

Temperature Discovery

Description:

global game tree search. DS: Divide-and-conquer approach. Idea: Divide game into sub-games ... Play local game 'Coupon stack' ... – PowerPoint PPT presentation

Number of Views:14
Avg rating:3.0/5.0
Slides: 22
Provided by: martinm98
Category:

less

Transcript and Presenter's Notes

Title: Temperature Discovery


1
Temperature Discovery
  • Martin Müller, Markus Enzenberger and Jonathan
    Schaeffer
  • Introduction local and global search
  • Local search algorithms
  • Temperature
  • Environments and coupon stacks
  • Temperature discovery search
  • First results

2
Local and Global Search
  • Local search
  • Partition game into sum of subgames
  • Local analysis
  • Problem how to evaluate local results?
  • Central questionwhich sums of games are wins?
  • Global search
  • Single, monolithic game state
  • Full board evaluation
  • Single game tree,minimax backup
  • Central questionwhat is the minimax score?

3
Why Local Search?
  • Global Alpha-beta Search time exponential in
    size of full problem
  • Local search time exponential in size of
    subproblems

4
(No Transcript)
5
Results of Local Searches
  • 1. Exact combinatorial game value(Winning Ways,
    my Ph.D. thesis on Go endgames)
  • 2. Inexact, but very good temperatures,
    thermographs(Go Berlekamp, Spight, Fraser,
    Müller,Amazons Theo Tegos)
  • 3. Even less exact heuristic search to estimate
    the temperature (This work, with Markus and
    Jonathan)

6
1. Decomposition Search
  • Usual global game tree search
  • DSDivide-and-conquer approach
  • Idea
  • Divide game into sub-games
  • Do a local search
  • Combine local resultsCombinatorial game theory

7
2. Temperatures, Thermographs
8
3. Temperature Discovery
  • Problem Thermographs computed bottom-up
  • Needs complete local game tree
  • Sometimes too expensive
  • Heuristic evaluation works well in global search
  • Idea use it in local search to estimate
    temperature

9
Temperature Discovery
  • A different way to compute temperatures
    (Berlekamp)
  • Play local game Coupon stack
  • Choose between play on the board and coupon
    (move of known value)
  • Temperature of coupon of value t is t. So can
    estimate temp of board!

10
Example
  • Coupon stack 3,2,1,0,-1
  • Amazons board
  • Search depth 4
  • 1. B Coupon(3)2. W C8-C7xC83. B Coupon(2)
  • 4. W Coupon(1)

9 . . X . 8 . . W . 7 X . . B 6 . X . . A B
C D
11
Example (contd)
  • Uses heuristic evaluation of board
  • Depth-limited search
  • Result
  • when does it change from taking coupons to board?
  • Estimate for the temperature

12
Experiments (1)
  • Run temp. discovery search on small areas
  • Compare estimated t against exact t from Theo
    Tegos Databases
  • Plot real t vs estimated t
  • Works OK, but still some problems/bugs?

13
Experiments (2)
  • Sample starting positions with 2, 4 and 6
    subgames
  • Subgame size 4x4, 5x5
  • Temperature discovery in each local game
  • Simple hotstrat player
  • Play 2x200 games against Arrow (full board search)

14
Two, four, six 4x4 subgames
  • Coupon player vs Arrow
  • About 10 sec./move

15
Two, four, six 4x4 subgames
16
Two and Four 5x5 subgames
17
5x5 subgames
  • 13.25 average over 200 pairs of games (stdDev
    11.5)

18
Control experiment
  • Arrow(10sec) vs Arrow on four 4x4
  • Different time limits for opponent

5s
1s
30s
10s
19
Sample 4x5x5 Game
20
To Do...
  • More experiments, e.g. 6x5x5, 6x6,...
  • Try on real games
  • Better sum game algorithm
  • Tune, fix temperature discovery search
  • Optimal solver? (Needs global search too)
  • The real goal - apply to Go!

21
Summary
  • Local search algorithm
  • Try to discover temperature by minimax search
  • Applications Amazons, future Go
  • First results it works...
  • Still lots of open questions
Write a Comment
User Comments (0)
About PowerShow.com