SMBL and Blast - PowerPoint PPT Presentation

About This Presentation
Title:

SMBL and Blast

Description:

SMBL and Blast – PowerPoint PPT presentation

Number of Views:22
Avg rating:3.0/5.0
Slides: 14
Provided by: Csw5
Category:
Tags: smbl | blast | hued

less

Transcript and Presenter's Notes

Title: SMBL and Blast


1
SMBL and Blast
  • Joe Rinkovsky
  • Unix Systems Support Group
  • Indiana University

2
Introduction
  • IU has around 2000 Windows PCs in public Student
    Technology Centers
  • Condor is used to harvest unused cycles
  • Simple Message Brokering Library(SMBL) used for
    parallelizing applications on Windows
  • Web portal for user interaction

3
Project History
  • SETI_at_home Was used as initial test of Condor
  • SMBL was created to address the lack of a
    general purpose parallel library on Windows that
    could tolerate sporadically available systems
  • FastDNAml was ported to SMBL
  • Web portal created
  • Other apps ported to SMBL(MEME,BLAST)

4
System Architecture
  • Condor server running on Linux
  • BLAST databases served via Samba on a second
    Linux machine
  • Apache/MySQL/PHP web portal
  • Windows clients

5
What is SMBL?
  • Simple Message Brokering Library
  • Open Source(http//smbl.sf.net)
  • Uses master / worker model
  • Process and Port Manager(PPM) manages SMBL
    servers and master processes
  • Number of master /foreman processes is different
    for each application
  • SMBL workers contact the SMBL master to get work
  • SMBL server terminates workers when they are no
    longer needed

6
Condor and SMBL
  • Condor is used as the scheduling and delivery
    system for SMBL workers
  • SMBL workers contact the SMBL server when they
    start running to begin receiving work.
  • SMBL server seperates the work to be into smaller
    pieces depending on the number of workers
  • Work is redistributed if a worker is lost
  • SMBL server terminates workers when there is no
    work left

7
Applications using SMBL
  • FastDNAml Generates phylogenic trees from
    molecular data
  • MEME Detects patterns in nucleotide and protein
    sequences
  • NCBI BLAST(blastall) Query molecular sequences
    against sequence databases

8
The Challenges of porting BLAST to SMBL
  • BLAST relies on the availability of large
    database files
  • Files too large for efficient delivery via Condor
  • Local copies of databases on pool machines would
    be difficult to manage
  • Sharing DB files via Samba is the best solution
  • Samba was moved to a seperate server to increase
    perfomance

9
The Challenges of porting BLAST to SMBL(cont.)
  • BLAST jobs take more time to complete than
    FastDNAml and MEME
  • Dissapearing worker problem
  • Pool machines would end up in CLAIMED/IDLE state
  • Size of our Condor pool made the problem hard to
    track
  • Only jobs taking more than 30 minutes were
    affected
  • Problem was determined to be state table
    sessions timing out on the machine room
    firewall.
  • Machines were removed from firewall and switched
    to host-based iptables firewall.

10
Web portal
  • Apache/MySQL/PHP based
  • Jobs are submitted via portal ONLY
  • Condor submit files are dynamically generated
    based on user input
  • Status of jobs can be checked using the portal
  • Results retrieved from the portal

11
(No Transcript)
12
(No Transcript)
13
Questions?
Write a Comment
User Comments (0)
About PowerShow.com