Globus Virtual Workspaces An Update - PowerPoint PPT Presentation

1 / 28
About This Presentation
Title:

Globus Virtual Workspaces An Update

Description:

Separation of job scheduling and resource management ... Fermi. VWS/EC2. BNL. Running jobs : 230. Running jobs : 150. Running jobs : 50. Running jobs : 150 ... – PowerPoint PPT presentation

Number of Views:27
Avg rating:3.0/5.0
Slides: 29
Provided by: Office2004237
Category:

less

Transcript and Presenter's Notes

Title: Globus Virtual Workspaces An Update


1
Globus Virtual WorkspacesAn Update
  • SC 2007, Reno, NV
  • Kate Keahey
  • Argonne National Laboratory
  • University of Chicago
  • keahey_at_mcs.anl.gov

2
Motivation and Background
3
Why Virtual Workspaces?
  • Quality of Service
  • We get batch-style provisioning
  • One size fits all
  • Side-effect of job scheduling
  • We need advance reservations, urgent computing,
    periodic, best-effort, and others
  • Separation of job scheduling and resource
    management
  • E.g. workflow-based apps and batch apps have
    different needs
  • Quality of Life
  • We have a 100 nodes we cannot use
  • Complex applications
  • Hard to install
  • Require validation
  • Separation of environment preparation and
    resources leasing

4
What are Virtual Workspaces?
  • A dynamically provisioned environment
  • Environment definition we get exactly the
    (software) environment we need on demand.
  • Resource allocation Provision the resources the
    workspace needs (CPUs, memory, disk, bandwidth,
    availability), allowing for dynamic renegotiation
    to reflect changing requirements and conditions.
  • Implementation
  • Traditional means publishing, automated
    configuration, coarse-grained enforcement
  • Virtual Machines encapsulated configuration and
    fine-grained enforcement

Paper Virtual Workspaces Achieving Quality of
Service and Quality of Life in the Grid
5
Virtual Machines
Parallels
App
App
App
App
App
Xen
Guest OS (Linux)
Guest OS (NetBSD)
Guest OS (Windows)
VMWare
UML
Virtual Machine Monitor (VMM) / Hypervisor
KVM
Hardware
etc.
  • Bring your environment with you
  • Fast to deploy, enables short-term leasing
  • Excellent enforcement, performance isolation
  • Very good isolation

6
Globus Virtual WorkspacesHow Do They Work?
7
Virtual Workspaces Vital Stats
  • The GT4 Virtual Workspace Service (VWS) allows an
    authorized client to deploy and manage workspaces
    on-demand.
  • GT4 WSRF front-end (one per site)
  • Leverages GT core and services, notifications,
    security, etc.
  • Follows WS-Agreement provisioning model
  • Currently implements workspaces as Xen VMs
  • Other implementations could also be used
  • Implements multiple deployment modes
  • Best-effort, leasing, etc.
  • Current release 1.3 (November 07)
  • Globus incubator project
  • More information at http//workspace.globus.org

8
Deploying WorkspacesRemotely
Pool node
Pool node
Pool node
VWS Service
Pool node
Pool node
Pool node
  • Workspace
  • Workspace metadata
  • Pointer to the image
  • Logistics information
  • Deployment request
  • CPU, memory, node count, etc.

Pool node
Pool node
Pool node
Pool node
Pool node
Pool node
9
Interacting with Workspaces
The workspace service publishes information on
each workspace as standard WSRF
Resource Properties.
Pool node
Pool node
Pool node
VWS Service
Pool node
Pool node
Pool node
Users can query those properties to find
out information about their workspace (e.g. what
IP the workspace was bound to)
Pool node
Pool node
Pool node
Pool node
Pool node
Pool node
Users can interact directly with their workspaces
the same way the would with a physical machine.
Trusted Computing Base (TCB)
10
Workspace Service Components
Workspace WSRF front-end that allows clients to
deploy and manage virtual workspaces
VWS Service
Pool node
Pool node
Pool node
Workspace back-end
Pool node
Pool node
Pool node
Resource manager for a pool of physical
nodes Deploys and manages Workspaces on the
nodes
Pool node
Pool node
Pool node
Each node must have a VMM (Xen)? installed, as
well as the workspace control program that
manages individual nodes
Pool node
Pool node
Pool node
Contextualization creates a common context for a
virtual cluster
Trusted Computing Base (TCB)
11
Workspace Service Components
  • GT4 WSRF front-end
  • Leverages GT core and services, notifications,
    security, etc.
  • Follows the OGF WS-Agreement provisioning model
  • Publishes available lease terms
  • Provides lease descriptions
  • Workspace Service back-end
  • Currently focused on Xen
  • Works with multiple Resource Managers
  • Workspace Control
  • Contextualization
  • Put the virtual appliance in its deployment
    context

12
Managing Resources with Virtual Workspaces
13
Workspace Back-Ends
  • Default resource manager (basic slot fitting)
  • Commercial datacenter technology would also fit
  • Challenge finding Xen-enabled resources
  • Amazon Elastic Compute Cloud (EC2)
  • Selling cycles as Xen VMs
  • Software similar to Workspace Service
  • No virtual clusters, contextualization,
    fine-grain allocations, etc.
  • Solution develop a back-end to EC2
  • Grid credential admission -gt EC2 charging model

14
Virtual Workspaces for STAR
  • STAR image configuration
  • A virtual cluster composed of an OSG headnode and
    STAR worker nodes
  • Using the workspace service over EC2 to provision
    resources
  • Allocations of up to 100 nodes
  • Dynamically contextualized for out-of-the-box
    cluster

15
with thanks to Jerome Lauret and Doug Olson of
the STAR project
Running jobs 230
Running jobs 150
Running jobs 150
Running jobs 142
Running jobs 124
Running jobs 109
Running jobs 94
Running jobs 73
Running jobs 42
Running jobs 0
VWS/EC2
BNL
Running jobs 300
Running jobs 300
Running jobs 300
Running jobs 282
Running jobs 243
Running jobs 221
Running jobs 195
Running jobs 140
Running jobs 76
Running jobs 0
WSU
Fermi
Running jobs 150
Running jobs 200
Running jobs 195
Running jobs 183
Running jobs 152
Running jobs 136
Running jobs 96
Running jobs 54
Running jobs 37
Running jobs 0
Running jobs 50
Running jobs 50
Running jobs 42
Running jobs 39
Running jobs 34
Running jobs 27
Running jobs 21
Running jobs 15
Running jobs 9
Running jobs 0
PDSF
Job Completion
File Recovery
16
with thanks to Jerome Lauret and Doug Olson of
the STAR project
with thanks to Jerome Lauret and Doug Olson of
the STAR project
Nersc PDSF
EC2 (via Workspace Service)
WSU
Accelerated display of a workflow job state Y
job number, X job state
17
Workspace Back-Ends
  • Default resource manager (basic slot fitting)
  • Commercial datacenter technology would also fit
  • Challenge finding Xen-enabled resources
  • Amazon Elastic Compute Cloud (EC2)
  • Selling cycles as Xen VMs
  • Software similar to Workspace Service
  • No virtual clusters, contextualization,
    fine-grain allocations, etc.
  • Grid credential admission -gt EC2 charging model
  • Solution develop a back-end to EC2
  • Challenge integrating VMs into current
    provisioning models
  • Solution gliding in VMs with the Workspace Pilot

18
Providing Resources The Workspace Pilot
  • Challenge find the simplest way to integrate VMs
    into current provisioning models
  • Glide-ins (Condor) poor mans resource leasing
  • Best-effort semantics submit a job pilot that
    claims resources but does not run a job
  • The Workspace Pilot
  • Resources booted to dom0
  • Pilot adjusts memory
  • VWS leases slots to VMs
  • Functional closure kill-all facility, etc.

19
Workspace Control
  • VM control
  • Starting, stopping etc.
  • To be replaced by Xen API
  • Integrating into the network
  • Assigning MAC addresses and IP addresses
  • DHCP Delivery tool
  • Building up a trusted networking layer
  • VM image propagation
  • Image management and reconstruction
  • creating blank partitions
  • Talks to the workspace service via ssh
  • To be replaced

20
Workspace Back-Ends
  • Default resource manager (basic slot fitting)
  • Commercial datacenter technology would also fit
  • Challenge finding Xen-enabled resources
  • Amazon Elastic Compute Cloud (EC2)
  • Selling cycles as Xen VMs
  • Software similar to Workspace Service
  • No virtual clusters, contextualization,
    fine-grain allocations, etc.
  • Grid credential admission -gt EC2 charging model
  • Solution develop a back-end to EC2
  • Challenge integrating VMs into current
    provisioning models
  • Solution gliding in VMs with the Workspace Pilot
  • Long-term solutions
  • Interleaving soft and hard leases
  • Providing better articulated leasing models
  • Developed in the context of existing schedulers

21
So -- youve deployed some VMs Now What?
  • Do they have public IP addresses? Do they
    actually represent something useful?(BTW, I need
    an OSG cluster) Can the VMs find out about each
    other? Can they share storage? How do they
    integrate into the site storage/account system?
    Do they have host certificates? And gridmapfile?
    And all the other things that will integrate them
    into my VO?

22
Virtual Clusters
  • Challenge what is a virtual cluster?
  • A more complex virtual machine
  • Networking, shared storage, etc. that will be
    portable across sites and implementations
  • Available at the same time and sharing a common
    context
  • Example
  • A set of worker nodes with some edge services in
    front and NFS-based shared storage
  • Solution management of ensembles and sharing
  • Configurable cluster deployment
  • A set of worker nodes
  • A few Edge Services enabling access to those
    nodes
  • Exporting and sharing a common context
  • Configuring and joining context
  • Networking
  • Edge Services have public IPs
  • Worker nodes are on a private network shared with
    the Edge Services

Paper Virtual Clusters for Grid Communities,
CCGrid 2006
23
Contextualization
  • Challenge Putting a VM in the deployment context
    of the Grid, site, and other VMs
  • Assigning and sharing IP addresses, name
    resolution, application-level configuration, etc.
  • Solution Management of Common Context
  • Configuration-dependent
  • providesrequires
  • Common understanding between the image vendor
    and deployer
  • Mechanisms for securely delivering the required
    information to images across different
    implementations

contextualization agent
Common Context
IP hostname pk
Paper A Scalable Approach To Deploying And
Managing Appliances, TeraGrid conference 2007
24
Where do VM images come from?
25
Appliance Management
  • Short term solution Marketplaces
  • The Workspace Marketplace
  • http//workspace.globus.org/vm/marketplace.html
  • Providing described images for scientific
    community Appliance providers and marketplaces
  • Long-term solution Appliance Providers
  • Automated image production, attestation and
    signing
  • Automated management
  • Collaboration with configuration management
    communities and projects
  • rPath company the rBuilder project (DOE SBiR)
  • Bcfg2, adopted on many ANL resources
  • Osfarm _at_ CERN OpenLab, serving the scientific
    community Appliance providers

26
Workspace Ecosystem
27
Parting Thoughts
  • VMs are the raw materials from which a working
    system can be built
  • But we still have to build it!
  • Technical challenges taking one step at a time
  • Social/procedural challenges
  • Division of labor
  • Resource providers
  • Appliance providers
  • Can we build trust between these two groups?
  • If you think we can help you out, give us a call
  • http//workspace.globus.org

28
Acknowledgements
  • Workspace team
  • Kate Keahey
  • Tim Freeman
  • Borja Sotomayor
  • Funding
  • NSF SDCI Missing Links
  • NSF CSR Virtual Playgrounds
  • DOE CEDPS Project
  • With thanks to many collaborators
  • Jerome Lauret (STAR, BNL), Doug Olson (STAR,
    LBNL), Marty Wesley (rPath), Stu Gott (rPath),
    Ken Van Dine (rPath), Predrag Buncic (Alice,
    CERN), Haavard Bjerke (CERN), Rick Bradshaw
    (Bcfg2, ANL), Narayan Desai (Bcfg2, ANL), Duncan
    Penfold-Brown (Atlas,uvic), Ian Gable (Atlas,
    uvic), David Grundy (Atlas, uvic), Ti Leggit
    (University of Chicago), Greg Cross (University
    of Chicago), Mike Papka (University of
    Chicago/ANL)
Write a Comment
User Comments (0)
About PowerShow.com