ASM 2003, slide 1 - PowerPoint PPT Presentation

About This Presentation
Title:

ASM 2003, slide 1

Description:

ASM 2003, 1. Communication Pattern Based Node Selection for Shared Networks ... communication pattern. synchronization pattern. CPU usage. memory usage ... – PowerPoint PPT presentation

Number of Views:43
Avg rating:3.0/5.0
Slides: 13
Provided by: camp203
Learn more at: https://www2.cs.uh.edu
Category:
Tags: asm | pattern

less

Transcript and Presenter's Notes

Title: ASM 2003, slide 1


1
Communication Pattern Based Node Selection for
Shared Networks
Srikanth Goteti Interactive Data Corp Jaspal
Subhlok University of Houston AMS Symposium 2003
2
Resource Selection for Network/Grid Applications
Model
Data
GUI
Sim 1
Pre
Stream
Application
?
where is the best performance
Network
3
Current Approaches to Node Selection
Model
Data
GUI
Sim 1
Pre
Stream
  • Measure and model network properties, such as
    available bandwidth and CPU loads (with tools
    like NWS)
  • Find best nodes for execution based on network
    status
  • But expected application performance based on
    measured network status may not be accurate
  • depends on application characteristics
  • translation, e.g., unused bandwidth vs expected
    throughput
  • data may be stale as frequent measurements are
    expensive

4
Performance Skeleton
  • Performance Skeleton is a synthetic short running
    program whose execution characteristics mirror
    the application it represents
  • An application and its skeleton have similar
  • communication pattern
  • synchronization pattern
  • CPU usage
  • memory usage
  • Goal Performance of a skeleton is directly
    related to the performance of the application
    under any condition
  • e.g., a skeleton executes in .1 of the time the
    application takes to execute on any part of a
    shared network

5
Node Selection with Performance Skeletons
Model
Data
GUI
Sim 1
Pre
Stream
Select candidate node sets based on network status
Execute the skeleton on them
Select the node set with best skeleton
performance to schedule actual application
6
Node Selection Procedure
  • Construct a performance skeleton
  • mostly by hand in this paper, subject of ongoing
    work
  • Select candidate node sets
  • identify the communication graph of the
    application
  • typically a chain, ring or all-all structure
  • obtain available bandwidth between nodes with NWS
    (Network Weather Service) and build a graph
  • select nodes to maximize the minimum available
    bandwidth between pairs of communicating nodes
  • best possible node sets based on application
    structure and network status
  • Execute the skeleton on each candidate node set
  • Select the node set with best skeleton
    performance, map one process to each node

7
Communication Structure of NAS Benchmarks
1
1
0
0
2
3
3
2
BT
IS
1
1
1
0
0
0
2
2
2
3
3
3
LU
MG
SP
1
0
2
3
EP
8
(No Transcript)
9
Validation Experiments
  • Best nodes to execute benchmarks selected by
    each of the following methods
  • skeleton based full framework discussed
  • all to all based on maximizing the minimum
    available bandwidth between on the network graph
  • random
  • compare performance of the application on nodes
    selected by each of these procedures on a busy
    network
  • Experiments repeated a large number of times to
    get statistically meaningful results

10
Experimental Framework
  • Linux cluster of 10 dual CPU 1.7GHz Pentium nodes
    connected by 100 MHz links and crossbar switch
  • experiments with Class B NAS MPI benchmark suite
  • Class W NAS benchmarks (avrg runtime 1.5 seconds
    on our cluster) used as skeletons for class B
    benchmarks
  • available bandwidth between nodes is varied with
    Linux iproute2 for the duration of experiments as
    follows
  • path between a pair of nodes is shared by S
    streams
  • i.e., available bandwidth is set to 1/S of peak
  • one stream is randomly added to or removed from
    the cluster every 30 seconds

11
Performance Results slowdown due to network
traffic
  • skeleton based has average slowdown of 20,
    versus 40 for random and 27 for all to all
  • significant variation across benchmarks, most
    benefit for CG it is communication heavy and
    uses only 3 links

12
Conclusions type slide
  • Performance skeletons have a role in resource
    management for grids
  • removes limitations of using NWS type systems
    (what you measure versus what you get problem)
  • A lot more experimentation is needed to establish
    and validate the concepts
  • Automatic construction of performance skeletons
    is a major open challenge
  • Skeletons may have other uses a fast way of
    estimating the performance of an application
  • e.g. on a slow simulated future system
Write a Comment
User Comments (0)
About PowerShow.com