Sun COE Hardware - PowerPoint PPT Presentation

1 / 21
About This Presentation
Title:

Sun COE Hardware

Description:

Information Server and Register ... EZ-Grid Register. No existing mechanism for automated resource discovery. Register keeps track of availability information ... – PowerPoint PPT presentation

Number of Views:51
Avg rating:3.0/5.0
Slides: 22
Provided by: barbara179
Category:
Tags: coe | hardware | register | sun

less

Transcript and Presenter's Notes

Title: Sun COE Hardware


1
Sun COE Hardware
  • 3 Sun Fire 6800s (1 x 24 CPUs, 2 x 12 CPUs)
  • Ultra Sparc III 750 Mhz, Compute servers
  • 2 Sun Fire V880s (4 x 8 CPUs, 7 x 4 CPUs)
  • Ultra Sparc III 750 Mhz, Compute servers
  • 2 Sun Blade 1000s (2 x 12 CPUs, 4 x 2 CPUs)
  • Ultra Sparc III 750 Mhz, Compute servers
  • SunBlade 1000 login node, SunBlade 100 Admin node
  • StorEdge Disk array of 4 TB storage
  • Myrinet and Gigabit networking

2
Major Software on Galaxy Cluster
  • Solaris 8 standard UNIX tools
  • Sun's MPI / ClusterTools
  • Open-source seismic / geoscience libraries
  • Some proprietary Geoscience programs
  • Forte 6u2 compilers
  • Desired
  • Grid Related
  • Globus 1.1.4, EZ-Grid system, Sun Grid Engine
    Enterprise Edition
  • KAI compilers, Totalview debugger
  • vampir ( vampirtrace) profiler

3
Sun Cluster at HPCC
4
EZ-Grid A Gateway to Grids
Dr. Barbara Chapman Babu Sundaram Kiran
Thyagaraja Noah Yan Satheesh Kolathur Hari
Krishna Donepudi
5
Major Objectives of the Project
  • To realize easier setting up / managing and
    effective usage of Grid environments through the
    following services
  • Generic brokerage services
  • Usage policy management frameworks that allow
    precise control over grid resources
  • User interfaces for easier and intuitive
    interaction with grids
  • Implement services that address the deficiencies
    in existing systems (dynamic information mgmt)

EZ-Grid Broker Kernel (Matchmaker)
EZ-Grid Policy Engine
EZ-Grid Interface for job submission / Resource
info viewing
EZ-Grid Information Server Register
6
Overview of the system
  • Existing work
  • User interfaces for job submission/monitoring,
    info viewing
  • Policy Engine framework, Basic brokerage services
  • Information services Register, EZ-Grid Server
  • GSI-FTP and GASS based transparent file transfer
  • Work in Progress
  • Tighter integration with Sun Grid Engine software
  • Enable direct job submission (circumventing GRAM)
  • Information gathering, Load/queue length
    prediction
  • Brokerage algorithm examination and
    experimentation
  • Campus Grid activities, Sun Center Of Excellence
    work

7
System details
  • Implemented in Java for platform independence
  • XML for all information representation
  • Globus for middleware services (GSI, GRAM?)
  • Interfaced with Globus thro CoG kit
  • Services offered
  • Resource management
  • Matchmaking
  • Information
  • Security
  • Job submission and monitoring

8
Interaction with other tools
Applications and EZ-Grid Services
CoG Kit libraries interface
Globus toolkit services
Sun Grid Engine
Underlying Grid resources
9
System Architecture
Remote resources
EZ-Grid Server Resource A
EZ-Grid Server Resource B
EZ-Grid Server Resource C
EZ-Grid Register
Client machine
Interface Broker Policy Engine
10
Brokering
  • Brokering phases examines
  • Resource sufficiency (use resource information
    gathered)
  • Resource usage policy evaluation (Policy Engine)
  • Run-time prediction based on History Info
    (algorithms exist!)
  • Network latency and bandwidth (NWS)
  • Queue load prediction (interfaces with schedulers
    like SGE)
  • These tasks should ultimately result in ranking
    of resources (indicating suitability for this
    job submission)

11
Current work in Brokering
  • Algorithms for history predictions, refining
  • Services for queue load prediction, study related
    algorithms such as those used in NWS
  • Ranking of resources and normalizing ranks for
    evaluating resources comparatively
  • Iterating and evaluating the various algorithms
    based on our experiences with Campus grid and the
    Center of excellence

12
Policy Engine
  • This model consists of two aspects
  • Services and utilities for admins to specify
    usage policies, validate job requests from users
  • Tools for users (broker) to examine and evaluate
    remote usage policies while making resource
    choices
  • XML used a policy language (attribute-value
    pairs), JAXP for parsing

13
Uses of Policy Engine
  • Resource Provider Perspective
  • Utilities (GUI) to express usage
    permissions/restrictions in XML
  • Can enforce global / user-level policies
  • Policy files are self-contained they include
    associated rules too
  • Authorization services to validate job requests
    against these policies before handing them over
    to GRAM
  • User Perspective
  • Transparent service to fetch, examine and
    evaluate usage policies
  • GSI-FTP or GASS used for policy file transfer
  • Local caching for efficiency (modifications
    checked for though)
  • Cannot circumvent local PE additional level of
    authorization done remotely

14
EZ-Grid User Interface
15
EZ-Grid User Interface
16
Information Server and Register
  • Existing information services do not address
    dynamic information gathering and mgmt (critical
    to brokerage)
  • EZ-Grid Info Server supports
  • Application Profile information, User Profiles
  • Information on history of job submissions
  • Queue status information (Queue length, loads)
  • Interfaces with local schedulers like SGEEE
  • Network bandwidth information (Planned)
  • EZ-Grid Register
  • No existing mechanism for automated resource
    discovery
  • Register keeps track of availability information
    about Grid resources
  • EZ-Grid servers are configured to send periodic
    hello messages
  • On-demand, EZ-Grid users and broker can query for
    a list of available grid resources

17
Interaction with Sun Grid Engine
  • We want to tightly integrate the EZ-Grid system
    with the Grid Engine software
  • Major benefits include the following
  • Dynamic Queue information gathering
  • Queue length and Load prediction
  • Direct Job submission interfaces to circumvent
    GRAM where possible
  • Precise job control and monitoring
  • Checkpointing and migration across clusters??

18
Campus Grid Activities
EZGrid interface
HPCTools
Geosciences
Globus
SGE
Mechanical Engg
HPCC
EZGrid Broker
19
Future work
  • User Interface to be updated after getting
    feedback from pilot users
  • Direct Interfacing with Grid Engine
  • Brokering for better history, network and queue
    analysis
  • Web access to EZ-Grid and web services

20
Conclusions
  • Setting up, managing and accessing Grids must be
    made easier for the user
  • Generic resource brokers will be of immense help
    for efficient use of Grid resources
  • Usage policies must be strictly enforced
  • Sun Grid Engine software provides a powerful tool
    for building campus grids such as the one at UH
  • Bottom-up approach towards Grids is good,
    campus grids, state-wide grids, national grids..

21
For more information
  • www.cs.uh.edu/ezgrid
  • www.cs.uh.edu/hpctools
  • Dr. Chapman chapman_at_cs.uh.edu
Write a Comment
User Comments (0)
About PowerShow.com