Title: WebTraff: A GUI for Web Proxy Cache Workload Modeling and Analysis
1WebTraffA GUI for Web Proxy Cache Workload
Modeling and Analysis
- Nayden Markatchev
- Carey Williamson
- Department of Computer Science
- University of Calgary
2Introduction
- What is WebTraff?
- - An extended and improved version of ProWGen
(Proxy Workload Generator), including a GUI
interface to a useful set of tools for Web
traffic modeling and analysis - Purpose To facilitate the easy generation and
analysis of controllable and representative
workloads for Web caching simulations
3Talk Overview
- WebTraff General Information
- System Requirements, Data Formats,
Assumptions, Inputs, Outputs, Usage - Simple Demo
- Using WebTraff to generate and analyze a
workload, plus Web proxy cache simulation - Questions and Discussion
4System Requirements
- Software Requirements
- Unix based environment running X windows
- cc, gcc, g, tcl 8.0 or newer, tk 8.0 or newer,
wish, perl 5.0 or newer, gnuplot, gs
- Hardware Requirements
- 64 MB or more RAM
- 100 MB hard disk space (for storing long workload
traces)
Future Work Port to Windows (volunteers?)
5Example of the Web Workload Trace Format Used in
WebTraff
6Overview of WebTraff
- The WebTraff toolkit provides three main
functions - Web workload trace generation
- Web workload trace analysis
- Web proxy cache simulation
- Graphs displayed in PostScript format
7WebTraff GUI Interface
8Web Workload Generation
9Web Workload Generation
- This portion of the tool provides a GUI to
ProWGen Busari/Williamson 2001 - ProWGen models four key characteristics of Web
proxy workloads. - Zipf-like document popularity distribution
- High degree of one-time referencing
- Heavy-tailed file and transfer size distributions
- Temporal locality property in references
10Web Workload Generation (contd)
- Name of trace file being generated
- Sliding widgets for
- Number of references (lines) in a workload file
- Number of distinct Web objects in workload
- Percentage of objects that are one-timers
- Slope of Zipf-like document popularity profile
- Slope of Pareto tail for document size
distribution - Degree of statistical correlation (if any)
between size and popularity for Web objects
11Web Workload Generation (contd)
- The notion of temporal locality refers to
temporal correlation in referencing behaviour
(e.g., recent past good predictor of near future) - Four models for referencing behaviour
- Independent Reference Model (IRM)
- Static LRU Stack Model (SLRU)
- Dynamic LRU Stack Model (DLRU)
- New LRU Stack Model (NLRU)
12Web Workload Generation (contd)
- Popularity Bias parameter (hack!)
- This button was added to remedy a problem in
earlier version of ProWGen, which tended to
choose one-timers early in the trace and popular
documents late in the trace - Can now control this in workload generation
- Can visually check for stationarity of cache hit
ratio during simulations
13Web Workload Analysis
14Web Workload Analysis
- Two main categories of analysis functions
- Time series analysis (on the left)
- Web workload analysis (on the right)
- Radio buttons, slide bars and text boxes
available to control plotting characteristics
15Requests per Interval(time series plot)
16Bytes per Interval(time series plot)
17Popularity Distribution plot
18Document Size Distribution (zoomed)
19Log-Log Complementary Distribution (LLCD) plot
(size)
20LRU Stack Depth Analysis(time series plot)
21LRU Stack Depth Analysis(marginal distribution)
22Web Proxy Cache Simulation
23Web Proxy Cache Simulation
- Application-level caching simulation parameters
- Cache size
- Cache replacement policy
- Five replacement policies currently available
- Random replacement (RAND)
- First-In-First-Out (FIFO)
- Least-Recently-Used (LRU) (default setting)
- Least-Frequently-Used (LFU)
- Greedy-Dual-Size (GDS)
24DHR Results from Run Sizes
25DHR Results from Run Policies
26BHR Results from Run Policies
27Assessing Cache Steady State
28For More Information
- WebTraff toolkit
- http//www.cpsc.ucalgary.ca/carey/software.htm
- ProWGen A Synthetic Workload Generation Tool
for the Simulation Evaluation of Web Proxy
Caches - Busari/Williamson, Computer Networks, Vol 38, No
6, June 2002 - http//www.cpsc.ucalgary.ca/carey/publications.ht
m - Contact information
- Email carey,nayden_at_cpsc.ucalgary.ca