Admission Control and Request Scheduling in Dynamic ECommerce Web Sites - PowerPoint PPT Presentation

1 / 50

About This Presentation

Title:

Admission Control and Request Scheduling in Dynamic ECommerce Web Sites

Description:

The Gatekeeper Transparent Proxy. Transparently intercepts DB requests ... Pre-emption isn't free (context switch costs, cache affinity) Priorities and inheritance ... – PowerPoint PPT presentation

Number of Views:131

Avg rating:3.0/5.0

Slides: 51

Provided by: EricH66

Category:

more less

Transcript and Presenter's Notes

Title: Admission Control and Request Scheduling in Dynamic ECommerce Web Sites

1
Admission Control and Request Scheduling in
Dynamic E-Commerce Web Sites

Sameh Elnikety, Erich Nahum,
John Tracey, Willy Zwaenepoel

C.S. Dept. EPFL
IBM T.J.Watson Research Center
2
Dynamic Content
1
2
3
3
Increasing Online Commerce

11B in 3rd Quarter 2002 (up 37)
11B in last 2 months of 2002 (up 40)

(Source News.com)
4
Two Key Problems

Overloaded Web Sites
The Slashdot Effect
Unanticipated load causes site to crash
Unresponsive Web Sites
The Abandoned Shopping Cart
Unacceptable delays lead to reduced usage
Reduced usage leads to reduced

How can we address these problems for dynamic
sites?
5
Generating Dynamic Content
Database Server
Web Server
Dynamic Content Generator
http

Consists of 3 Components
Web Server static content
Dynamic Content Generator Java servlets
DB Server state of the business

6
Outline

Motivation Background
The Gatekeeper Proxy
Admission Control
Request Scheduling
Experimental Environment
Results
Summary and Conclusions

7
Admission Control

To prevent overload, perform admission control
Notion of capacity in the system
Identify the job ahead of time amount of work
generated
Only let jobs in if they wont overload system
Once you reach full capacity
Make jobs wait
Drop jobs

8
The Gatekeeper Transparent Proxy
Web Server
Dynamic Content Generator
Gate Keeper
Database Server
http

Transparently intercepts DB requests
connections to the DB via the JDBC interface
Maintains several measurement-based estimates
Total capacity of the database
Current estimate of DB load
Work generated by each query type

9
Estimating Work by Query Type
Web Server
Dynamic Content Generator
Gate Keeper
Database Server
http

Key Observations
Queries of the same type take (roughly) the same
time
Different queries differ greatly in execution
time
Any web site has a finite number of query types
Gatekeeper maintains per-query work estimates

10
Service Time Distributions
11
Service Time Distributions
12
TPC-W Execution Times
(note times are in log scale)
13
Estimating System Capacity
Web Server
Dynamic Content Generator
Gate Keeper
Database Server
http

Query execution time load or work units of a
job
Database capacity max work units before
overload
Rough approximation
Unit approximates resource usage
Use binary search to determine capacity
More elaborate methods (adaptive, control
theoretic, etc)

14
Admission Control - Example
15
Scheduling Theory and Practice

Theory SRPT scheduling is best
SRPT shortest remaining processing time
Proven to have minimum response time (Schrage 68)
Perfect prediction of work costs
Pre-emption has zero overhead, does not affect
service time
Practice not so simple
Pre-emption isnt free (context switch costs,
cache affinity)
Priorities and inheritance
Deadlock (e.g., Q1 is holding a lock when
pre-empted)
Gatekeeper
Use shortest job first (SJF) policy
Once a job (query) is admitted, it is never
pre-empted

16
Request Scheduling - Example

(0500) (50010)
1010 ? 505
(010) (10500)
520 ? 260

500
10
10
500
17
Outline

Motivation Background
The Gatekeeper Proxy
Experimental Environment
Software Hardware
Metrics Methodology
Results
Summary and Conclusions

18
Workload Generation
Requests
Responses

Workload generators typically used for
experimental server performance evaluation
Many available for use with static content
WebStone, SPECweb, SURGE, httperf, WaspClient
Only 1 available for e-Commerce TPC-W

19
TPC-W

Transaction Processing Council (TPC-W)
TPC more known for database workloads like TPC-D
Provides specification, not source
Use the implementation from Dynaserver project at
Rice
Models a large e-commerce site Amazon
Web serving, searching, browsing, shopping carts
Secure purchasing (SSL), best sellers, new
products
Customer registration, administrative updates
Persistent data
Static images on Web Server
All others on back-end database

20
TPC-W Snapshot
Image
Promo
Shopping Cart
Next Interaction
21
TPC-W Interactions

14 Interactions, e.g.
Home (read-only query)
Best sellers (complex)
Secure payment (ssl)
Shopping cart (update query)
Workload Mixes
Browsing (95 read-only)
Shopping (80 read-only)
Ordering (50 read-only)

22
TPC-W Queries

SELECT c_uname FROM customer WHERE c_id 10
SELECT i_id, i_title, a_fname, a_lname
FROM item, author, order_line
WHERE item.i_id order_line.ol_i_id
AND item.i_a_id author.a_id
AND order_line.ol_o_id
(SELECT MAX(o_id)-3333 FROM orders)
AND item.i_subject ARTS
GROUP BY i_id, i_title, a_fname, a_lname
ORDER BY SUM(ol_qty) DESC
FETCH FIRST 50 ROWS ONLY

3 ms
4000 ms
23
TPC-W Frequencies
24
Software
Database Server
Web Server
Dynamic Content Generator
http
25
Hardware
Apache Tomcat
MySQL DB2
http
sql
26
Emulated Clients
Emulated Clients
Apache Tomcat
MySQL DB2
http
sql

Remote Browser Emulator
Session duration
Think time
Markov model
Load is a function of the number of clients

27
Experiments

Performance Metrics
Throughput (interactions/minute)
Response time (msec, submission to completion)
Examine each as a function of load ( of clients)
Examine two locking approaches
Locking in the database (slower, more general)
Locking in the application server (faster, less
general)
Methodology
Average of 5 runs
Each run lasts 600 seconds
Measurement starts after 100 second warm-up
90 confidence intervals

28
Outline

Motivation Background
The Gatekeeper Proxy
Experimental Environment
Results
Admission Control
Request Scheduling
Summary and Conclusions

29
Admission Control - Throughput
30
Admission Control - Throughput
31
Admission Control - Explanation

(Captured using systat utility on Linux)
32
Admission Control - Explanation