Load%20Balancing%20in%20Web%20Clusters - PowerPoint PPT Presentation

About This Presentation
Title:

Load%20Balancing%20in%20Web%20Clusters

Description:

The State of the Art in Locally Distributed Web-server ... L. Zhao, Y. Luo, L. Bhuyan and R. Iyer, 'A Network Processor ... LARD (Locality aware request ... – PowerPoint PPT presentation

Number of Views:120
Avg rating:3.0/5.0
Slides: 43
Provided by: Zhiy7
Learn more at: http://www.cs.ucr.edu
Category:

less

Transcript and Presenter's Notes

Title: Load%20Balancing%20in%20Web%20Clusters


1
Load Balancing in Web Clusters
  • CS 213
  • LECTURE 15
  • From IBM Technical Report

2
References
  1. The State of the Art in Locally Distributed
    Web-server Systems by Valeria Cardellini,
    Emiliano Casalicchio, Michele Colajanni and
    Philip S. Yu
  2. L. Zhao, Y. Luo, L. Bhuyan and R. Iyer, A
    Network Processor Based, Content Aware Switch,
    IEEE Micro, Special Issue on High-Performance
    Interconnects, May/June 2006, pp. 72-84.

3
Concepts
  • Web server System
  • Providing web services
  • Trend
  • 1. Increasing number of clients
  • 2. Growing complexity of web applications
  • Scalable Web server systems
  • The ability to support large numbers of
    accesses and resources while still providing
    adequate performance

4
Architecture Solutions
5
Single Node Solution
  • Hardware scale-up
  • expanding a system by adding more resources
  • Software scale-up
  • specific operating system and web server
    software

6
Multiple Nodes Solution
  • Local scale-out
  • Locally distributed Web Systems
  • nodes are deployed at a single network
    location
  • Global scale-out
  • nodes are located at different geographical
    locations

7
Model Architecture
8
Locally Distributed Web System
  • Cluster Based Web System
  • the server nodes mask their IP addresses to
    clients, using a Virtual IP address corresponding
    to one device (web switch) in front of the set of
    the servers Web switch receives all packets and
    then sends them to server nodes
  • Distributed Web System
  • the IP addresses of the web server nodes are
    visible to clients. No web switch, just a layer 3
    router may be employed to route the requests

9
Cluster based Architecture
10
Distributed Architecture
11
Two Approaches
  • Depends on which OSI protocol layer at which the
    web switch routes inbound packets
  • layer-4 switch Determines the target server
    when TCP SYN packet is received. Also called
    content-blind routing because the server
    selection policy is not based on http contents at
    the application level
  • layer-7 switch The switch first establishes a
    complete TCP connection with the client, examines
    http request at the application level and then
    selects a server. Can support sophisticated
    dispatching policies, but large latency for
    moving to application level Also called
    Content-aware switches or Layer 5 switches in
    TCP/IP protocol.

12
(No Transcript)
13
Cluster based architecture Taxonomy
14
Layer-4 two-way architecture
15
Two-Way Routing
  • Both inbound packets and outbound packets to the
    cluster pass through the web switch
  • Each server in the cluster has a unique private
    IP address, visible to the web switch but not to
    clients
  • The web switch rewrites inbound packets by
    changing the VIP address to the target servers
    IP address
  • For the outbound packets, the web switch rewrites
    the source address to its VIP

16
Layer-4 one-way architecture
17
Layer-4 one-way mechanisms
  • The requests pass through the web switch, but
    replies from the servers are sent directly to the
    clients through a separate path. Routing to the
    target server is done in one of following ways.
  • Packet single-rewriting
  • The web switch replaces its VIP address with
    selected servers IP address in each inbound
    packet
  • Packet Tunneling (Packet Encapsulation)
  • Encapsulate IP datagrams within IP datagrams
    (Read from paper)
  • Packet Forwarding
  • web switch rewrite the layer-2 destination
    address to the MAC address of the server (See
    paper)

18
Web Switch or Layer 5/7 Switch or Content Aware
Switch
www.yahoo.com
Internet
Image Server
APP. DATA
TCP
IP
Application Server
Switch
GET /cgi-bin/form HTTP/1.1 Host www.yahoo.com
HTML Server
  • Layer 4 switch
  • Content blind
  • Storage overhead
  • Difficult to administer
  • Content-aware (Layer 5/7) switch
  • Partition the servers database over different
    nodes
  • Increase the performance due to improved hit rate
  • Server can be specialized for certain types of
    request

19
Layer-7 two-way architecture
20
Layer-7 two-way mechanisms
  • TCP gateway
  • An application level proxy running on the web
    switch mediates the communication between the
    client and the server makes separate TCP
    connections to client and server
  • TCP splicing
  • reduce the overhead in TCP gateway. For
    outbound packets, packet forwarding occurs at
    network level by rewriting the client IP address

21
Layer-7 Two-way Mechanisms
  • TCP gateway Application level proxy on the web
    switch mediates the communication between the
    client and the server
  • TCP splicing Reduce the overhead in TCP
    gateway by forwarding directly by OS

user
kernel
user
kernel
22
TCP Splicing
  • Establish connection with the client
  • Three-way handshake
  • Choose the server
  • Establish connection with the server
  • Splice two connections
  • Map the sequence for subsequent packets

Time
Client
Switch
Server
23
Latency on a Linux-based switch
  • Latency is reduced by TCP splicing

24
Layer-7 one-way architecture
25
Layer-7 one-way mechanisms
  • TCP handoff
  • Handoff protocol is layered on top of TCP.
  • The switch hands off the TCP connection
    endpoint to the server
  • TCP connection hop
  • encapsulating the IP packet in an RPX packet
    and sending it to the server.
  • Details of TCP handoff will be covered in
    next class

26
Summary
27
Layer-4 Products
28
Layer 7 products
29
Design Options
  • Option (a) Linux-based switch
  • Overhead of moving data across PCI bus
  • Interrupt or polling still needed
  • Option (b) Put a control processor (CP) in the
    interface to setup connections, and execute
    complicated applications. Data Procesors (DPs)
    process packets for forwarding, classification
    and simple processing
  • But, the CP may have its own protocol stack Ex.
    embedded Linux!
  • Option (c) DPs handle connection setup, splicing
    forwarding But large Code Size is a huge
    problem due to limited instruction memory size of
    the DPs!

30
Latency
31
Throughput
32
Dispatching Algorithms
  • Strategies to select the target server of the web
    clusters
  • Static Fastest solution to prevent web switch
    bottleneck, but do not consider the current state
    of the servers
  • Dynamic Outperform static algorithms by using
    intelligent decisions, but collecting state
    information and analyzing them cause expensive
    overheads
  • Requirements (1) Low computational complexity
    (2) Full compatibility with web standards (3)
    state information must be readily available
    without much overhead

33
(No Transcript)
34
Content blind approach
  • Static Policies
  • Random
  • distributes the incoming requests uniformly
    with equal probability of reaching any server
  • Round Robin (RR)
  • use a circular list and a pointer to the
    last selected server to make the decision
  • Static Weighted RR (For heterogeneous
    severs)
  • A variation of RR, where each server is
    assigned a weight Wi depending on its capacity

35
Content blind approach (Cont.)
  • Dynamic
  • Client state aware
  • static partitioning the server nodes and to
    assign group
  • of clients identified through the clients
    information, such
  • as source IP address
  • Server State Aware
  • Least Loaded, the server with the lowest
    load.
  • Issue Which is the server load index?
  • Least Connection
  • fewest active connection first

36
Content blind approach (Cont.)
  • Server State Aware Contd.
  • Fastest Response
  • responding fastest
  • Weighted Round Robin
  • Variation of static RR, associates each server
    with a dynamically evaluated weight that is
    proportional to the server load
  • Client and server state aware
  • Client affinity
  • instead of assigning each new connection to a
    server only on the basis of the server state
    regardless of any past assignment, consecutive
    connections from the same client can be assigned
    to the same server

37
Considerations of content blind
  • Static approach is the fastest, easy to
    implement, but may make poor assignment decision
  • Dynamic approach has the potential to make better
    decision, but it needs to collect and analyze
    state information, may cause high overhead
  • Overall, simple server state aware algorithm is
    the best choice, least loaded algorithm is
    commonly used in commercial products

38
(No Transcript)
39
Content aware approach
  • Sever state aware
  • Cache Affinity
  • the file space is partitioned among the
    server nodes.
  • Load Sharing
  • . SITEA (Size Interval Task Assignment with
    Equal Load)
  • switch determines the size of the requested
    file and select the target server based on this
    information
  • . CAP (Client-Aware Policy)
  • web requests are classified based on their
    impact on system resources such as I/O bound,
    CPU bound

40
Content aware approach (Cont.)
  • Client state aware
  • Service Partitioning
  • employ specialized servers for certain type
    of requests.
  • Client Affinity
  • using session identifier to assign all web
    transactions from the same client to the same
    server

41
Content aware approach (Cont.)
  • Client and server state aware
  • LARD (Locality aware request distribution)
  • direct all requests to the same web object to
    the same server node as long as its utilization
    is below a given threshold.
  • Cache Manager
  • a cache manager that is aware of the cache
    content of all web servers.

42
(No Transcript)
Write a Comment
User Comments (0)
About PowerShow.com