Updates on JSS Job Submission Service - PowerPoint PPT Presentation

About This Presentation
Title:

Updates on JSS Job Submission Service

Description:

Updates on JSS Job Submission Service Massimo Sgaravatto Marco Verlato INFN Padova JSS Started implementing the process for parsing the job log file Cancelling the ... – PowerPoint PPT presentation

Number of Views:45
Avg rating:3.0/5.0
Slides: 6
Provided by: MassimoSg4
Category:

less

Transcript and Presenter's Notes

Title: Updates on JSS Job Submission Service


1
Updates on JSS Job Submission Service
  • Massimo Sgaravatto
  • Marco Verlato
  • INFN Padova

2
JSS
  • Started implementing the process for parsing the
    job log file
  • Cancelling the job if failures gt RetryCount
  • Mail to UserContact when job starts running
  • Notifying the RB (JOB_DONE, JOB_CANCELLED,
    JOB_ABORTED)
  • Notifying the LB service (JSSTransferEvent,
    JSSJobDone, JSSJobAborted)

3
JSS
  • Missing Condor-G functionalities hopefully
    delivered by the end of next week
  • Info about failure of a submission to a Globus
    resource missing in the job log file (only
    present in the gridmanager log file)
  • Necessary if we want to exploit the
    libcondorapi.a to parse the log file
  • Info about when a job has been successfully sent
    to a Globus resource missing
  • Only the events submitted, running and
    completed are recorded
  • Necessary to notify the LB service
  • New gridmanager able to exploit the persistent
    jobmanager
  • New reliable (two phase commit) submission
    protocol

4
Refresh of user proxy
  • Agreed with Condor team on
  • New command (e.g.
  • condor_refresh_proxy ltcondor-job-idgt
    ltnew-proxy-filegt
  • The problem of forwarding the new fresh proxy
    to the Globus jobmanager is addressed by killing
    the jobmanager and restarting a new one
  • Not the ideal solution, but it should be
    easier/faster to implement than adding proxy
    refresh functionality to Globus
  • Should be ready by mid of August

5
What is needed
  • Assumption
  • RB doesnt need user proxy at PM9
  • CESNET variant 4 ?
  • MyProxy server
  • Running in the RB/JSS machine ?
  • Long-term limited proxy moved from UI to RB/JSS
    machine
  • Do we know how to generate long term limited
    proxy ?
  • What should be changed in the UI ?
  • JSS before submitting a job
  • Retrieve full short time proxy from MyProxy
    server, using the long-term limited proxy
    (without using password)
  • What is necessary to modify for not using
    password ? Who ?
  • Polling by JSS, and for proxies that are being
    expired
  • Retrieve full short time proxy from MyProxy
    server, using the long term limited proxy
    (without using password)
  • Issue condor_refresh_proxy
Write a Comment
User Comments (0)
About PowerShow.com