Resource Management and Accounting Working Group - PowerPoint PPT Presentation

About This Presentation
Title:

Resource Management and Accounting Working Group

Description:

Initial study on PBS to determine viability of dissection possibilities and ... revision control, documentation, problem tracking, online project schedule... – PowerPoint PPT presentation

Number of Views:30
Avg rating:3.0/5.0
Slides: 18
Provided by: scottmj
Learn more at: https://www.csm.ornl.gov
Category:

less

Transcript and Presenter's Notes

Title: Resource Management and Accounting Working Group


1
Resource Management and Accounting Working Group
  • Working Group Scope and Components
  • Progress made
  • Current issues being worked
  • Next steps
  • Discussions involving larger group

2
Working Group Scope
  • The Resource Management Working Group encompasses
    the areas of resource management, scheduling and
    accounting.
  • This working group will focus on the following
    software components
  • Job Manager(/Queue Manager)
  • Scheduler
  • Allocation Manager (and accounting)
  • Meta Scheduler

3
Proposed Component Architecture
Meta Scheduler
Security System
Information Service
Allocation Manager
Scheduler
Discovery Service
Job/Queue Manager
Collector
Color Key Working Group
Resource Management and Accounting
Execution Management and Monitoring
Node Config and Infrastructure
Node Manager
Process Manager
4
Proposed Component Architecture
Scheduler
Job Management
Node Management
1
4
2
3
PBS server
Queue Manager
Collector
Job Manager
a
b
PBS Mom
Process Manager
Node Monitor
5
Component Interaction DiagramJob submitted to
Queue Manager
User Interface
Node Manager
Meta Scheduler
Job Manager
Allocation Manager
Scheduler
Process Manager
1
2
3
4
5
6
7
8
11
9
10
6
Component Interaction TraceJob submitted to
Queue Manager
  1. A user submits a job to the Queue Manager
  2. The Queue Manager does a sanity balance check
    with the Bank
  3. The Queue Manager notifies the Scheduler that a
    new job has arrived
  4. The Scheduler queries node and job status until
    job can run
  5. A bank reservation is made with the Allocation
    Manager
  6. The Scheduler requests the Queue Manager to run
    the job
  7. The Queue Manager passes job control to the
    Process Manager
  8. The Process Manager notifies Queue Manager of job
    completion
  9. The Queue Manager notifies Scheduler of job
    completion
  10. A bank withdrawal is made with the Allocation
    Manager
  11. The user is notified of job completion

7
General Progress
  • Creation of XML marshaller/unmarshaller
  • Establishment of CVS repository
  • Prototype demonstration Scheduler makes a
    deposit to allocation manager using XML interface

8
Scheduler Progress
  • Creation of SSS Resource Manager interface
    (RMType SSS half-open sockets)
  • Creation of SSS Allocation Manager interface
  • Creation of allocation manager and resource
    manager objects for management of arbitrary
    attributes
  • Integration of XML marshaller/unmarshaller
  • Maui enhancements to link with C libs (Xerxes)
  • Additional regression tests

9
Meta Scheduler Progress
  • Added support for data-staging interface
  • Added support for network proximity optimization
  • Initial support for checkpoint/restart
  • Checkpoint aware statistics
  • Checkpoint aware preemption optimizations
  • Sqsub client created allowing PBS-style jobs to
    be submitted and metascheduled
  • Initial work on translation library (PBS-gtsilver
    silver-gtRS2)
  • Stability enhancements

10
Job Manager Progress
  • Initial job manager specification defined
  • Interacted with process manager working group and
    drafted specification proposals for task manager
    and node manager and how they will interact with
    RMWG components
  • Initial study on PBS to determine viability of
    dissection possibilities and functionality
    enhancements

11
Allocation Manager Progress
  • Draft requirements document underway
  • XML schema version 0.3 reworked to have explicit
    request response elements
  • From scratch allocation manager being used as
    prototype to test XML interface
  • Implemented create, query, modify and delete for
    user, account and membership objects (interacting
    with database over JDBC)

12
Allocation Manager Progress (contd)
  • Stubbed in dummy withdrawal and successfully
    demod XML interface with scheduler (validating
    against schema)
  • Logging, config files, error handling
  • General purpose dcecp-like client allows output
    formatting by utilizing metadata from queries

13
Current Issues
  • Job Manager/Queue Manager as separate or unified
    components
  • How to split up PBS (if at all) and at what
    levels (if any) to refit with XML interface
  • Working with Software Engineering Working Group
    to decide on test framework

14
Next Work
  • All components under CVS
  • Establish initial resource management interface
    specifications for release
  • Scheduler demos by next face-to-face
  • Scheduler to process manager (over XML)
  • Scheduler to node manager (over XML)
  • Scheduler to job manager (over XML)
  • Drive an end-to-end checkpoint request
  • Scheduler talks to registry and discovery service

15
Next Work
  • Job manager/queue manager milestones
  • Submission client submits job to queue manager
    and queue manager reports status to user client
  • Scheduler implements query to obtain job info
    from queue manager
  • Scheduler starts a job (requires implementation
    of task manager interface) also cancel job
  • No prolog, epilog initially. Batch only. Simple
    single-step jobs. Supports polling mode only. No
    data-staging.

16
Next Work
  • Allocation manager
  • Completion of XML schema for remaining
    objects/services
  • Review of requirements (SDSC, NCSA )
  • Complete (1st draft of) initial requirements
  • Implement machine class, allocations,
    reservations, withdrawals, transaction register,
    simple charging algorithm

17
Issues requiring inter-group coordination
  • Need to solidify SSS-wide standards for
    packaging, revision control, documentation,
    problem tracking, online project schedule and
    establish mechanisms and places to home them.
Write a Comment
User Comments (0)
About PowerShow.com