SMARTS:Accelerating Microarchitecture Simulation via Rigorous Statistical Sampling - PowerPoint PPT Presentation

1 / 16
About This Presentation
Title:

SMARTS:Accelerating Microarchitecture Simulation via Rigorous Statistical Sampling

Description:

The SFW plot shows the simulation rate when using functional warming to bound W. ... The right chart shows that U = 1000 is a reasonable choice across benchmarks and ... – PowerPoint PPT presentation

Number of Views:201
Avg rating:3.0/5.0
Slides: 17
Provided by: itKor
Category:

less

Transcript and Presenter's Notes

Title: SMARTS:Accelerating Microarchitecture Simulation via Rigorous Statistical Sampling


1
SMARTSAccelerating Microarchitecture Simulation
via Rigorous Statistical Sampling
  • Roland E. Wunderlich Thomas F. Wenisch Babak
    Falsafi James C. Hoe
  • Computer Architecture Laboratory
  • Carnegie Mellon University, Pittsburgh, PA
  • Joo hyung Kim

2
Contents
  • Introduction
  • Statistical sampling
  • The SMARTS framework
  • SMARTS in practice
  • Using SMARTS
  • Conclusion

3
Introduction
  • Current approaches
  • Abbreviated instruction execution streams
  • Fewer or smaller input sets
  • Shortcomings
  • On the efficiency front, large sampling units
  • On the accuracy front, no tight error bounds
  • The SMARTS approach
  • Statistical sampling theory
  • Exact and constructive procedure

4
Statistical sampling
  • Theory of sampling
  • Choosing a minimal
  • Representative sample to achieve a quantifiable
    accuracy and precision in the estimate
  • Not presume a normally distributed population
  • Our goal
  • Identify a minimal but representative sample from
    the population for microarchitecture simulation
  • Establish a confidence level for the error on
    sample estimates

Sampling variables
5
The SMARTS framework
  • Technique overview
  • Detailed mode
  • Functional mode
  • Detailed warming short-comings
  • Expensive
  • Difficult to derive analytically
  • Functional warming
  • The cache hierarchies and branch predictors are
    prime candidates

SMARTS variables
Systematic sampling in SMARTS
6
The SMARTS framework
  • Benchmarks
  • Demonstrate the effectiveness of SMARTS
  • CPI and EPI of the SPEC CPU2000 integer and
    floating-point
  • SimpleScalar 3.0sim-outorder simulator

Machine configurations
7
The SMARTS framework
  • Speedup opportunity

Coefficient of variation of CPI
VCPI decreases with increasing U because
Short-term CPI variations within a window of U
instructions are hidden by averaging over the
sampling unit.
8
The SMARTS framework
  • Simulation speedup model

Modeled SMARTS simulation rate The two SD plots
show the simulation rate without function
warming. The SFW plot shows the simulation rate
when using functional warming to bound W.
9
SMARTS in practice
  • SMARTSim
  • Sim-outorder
  • Support functional simulation prior to starting
    detailed simulation
  • SMARTSim
  • Repeated transitions back-and-forth between
    functional and detailed simulation modes
  • Accepts the systematic sampling parameters
  • Fast-forwarding options
  • Functional simulation
  • Functional simulation with warming

10
SMARTS in practice
  • Optimal sampling unit size

Optimal U
The left chart shows that the optimal U increases
with W. The right chart shows that U 1000 is a
reasonable choice across benchmarks and W.
11
SMARTS in practice
  • Effectiveness of detailed warming

Detailed warming requirements without functional
warming. (8-way) Choosing U 1000 and n
sufficient for a 99.7 confidence interval of 3
12
SMARTS in practice
  • Effectiveness of functional warming
  • All benchmarks have bias under 2.0
  • Only 6 benchmarks in each configuration exceed
    1.0

CPI bias achieved with functional warming and
minimal detailed warming
13
Using SMARTS
  • SMARTS procedure
  • W is selected to exceed the bounded history of
    the microarchitectural state
  • Setting U 1000
  • Determine n, and correspondingly k
  • Correct value for n
  • A sampling measurement is made using a generic
    initial value ninit
  • ntuned for a second run is calculated from the Vx
    of the initial run

14
Using SMARTS
  • Comparison to SimPoint
  • SimPoint advantages
  • Obviates the need for functional warming
  • Quick integration into a simulation
    infrastructure
  • Early termination of simulation
  • SimPoint shortcomings
  • High CPI error
  • Unquantifiable confidence in estimates

15
Using SMARTS
  • Comparison to SimPoint
  • SimPoint has a higher average error (3.7 vs.
    0.6)
  • higher worst-case error (-14.3 for gcc-2)
  • SimPoint estimate based on just a single instance
    of the basic block sequences yields a large
    error.
  • SMARTS uses the measured coefficient of variation
    to help gauge both the required sample size and
    the confidence in the estimates.

Comparison of SMARTS with SimPoint SimPoints
mean runtime per benchmark is 2.8 hours compared
to 5.0 hours for SMARTS.
16
Conclusion
  • Evaluation Results
  • Average error 0.64 on CPI and 0.59 on EPI
  • Average speedups of 35 and 60 times
  • Future simulator designs
  • Designers should focus on the simulators
    flexibility and realism
  • Designers should focus on techniques to speed up
    fast-forwarding and functional warming
Write a Comment
User Comments (0)
About PowerShow.com