Nomad: Migrating OSbypass Networks in Virtual Machines - PowerPoint PPT Presentation

1 / 28
About This Presentation
Title:

Nomad: Migrating OSbypass Networks in Virtual Machines

Description:

Modifying guest os driver: free and reallocate communication resources, ... VMWare Workstation, ESX, Xen. Migration Ethernet. Xen: carry IP address, Unsolicited ARP ... – PowerPoint PPT presentation

Number of Views:29
Avg rating:3.0/5.0
Slides: 29
Provided by: Jas146
Category:

less

Transcript and Presenter's Notes

Title: Nomad: Migrating OSbypass Networks in Virtual Machines


1
Nomad Migrating OS-bypass Networks in Virtual
Machines
  • Wei Huang, M.Koop, D.K.Panda
  • The Ohio State University
  • J.Liu, B.Abali
  • IBM T.J. Watson Research Center
  • In Proc. VEE07

2
Outline
  • Intro
  • Background
  • Challenges
  • Design
  • Implimentation
  • Evaluation
  • Related Works
  • QA

3
1.Intro
  • Intro
  • Importance of VMs Performance, Scalability,
    Management
  • Low Latency High BW Network Infiniband,
    Myrinet, Quadrics -gt OS bypass, RDMA -gt Suitable
    for cluster enviroment
  • Bad for Migration
  • Intelligent NICs manage location dependent
    resources
  • Apps in cluster environment expect reliable
    services, need to make migration transparent to
    apps.

4
1.Intro (cont.)
  • Our goal
  • Address the migration problem of modern OS-bypass
    interconnects.
  • Target cluster environment very tightly coupled
    systems with stringent communication performance
    requirements
  • Three parts
  • User level communication Library suspend
    resume communication, namespace virtualization.
  • Modifying guest os driver free and reallocate
    communication resources, namespace
    virtualization.
  • Framework, coordinator coordinator in privileged
    domain and a central server.

Xiang Xiaojia
Department of Computer Science Slide 4
5
2.Background
  • OS bypass IO
  • Inside OS, impose overhead
  • Context switch user kernel
  • Extra data copy
  • User level communication
  • user process exec frequent, time critical
    operations, such as IO communication

Xiang Xiaojia
Department of Computer Science Slide 5
6
2. Background (cont)
  • OS bypass IO
  • Infiniband Architecture
  • QP,CQ
  • Buffer key
  • Initiating data transmission
  • Insert work to QP
  • Ringing a doorbell

Xiang Xiaojia
Department of Computer Science Slide 6
7
2. Background (cont)
  • Xen
  • Hypervisor lowest level
  • Split device driver model
  • Frontend guest os
  • Backend IDD
  • Migration
  • Frontend suspend / resume callback
  • Backend do jobs
  • IP MAC with OS
  • IP lost / out of order TCP layer

Xiang Xiaojia
Department of Computer Science Slide 7
8
2. Background (cont)
  • Direct IO
  • Backend Module - Proxy
  • Create virtual access point
  • Coordinating access among VMs

Xiang Xiaojia
Department of Computer Science Slide 8
9
3. Challenges
  • Environment

Xiang Xiaojia
Department of Computer Science Slide 9
10
3. Challenges (cont.)
  • Location dependent resources
  • Opaque handlers -gt HCA resources
  • Migration-gtInvalid handlers
  • IB port address LID
  • LID can be shared by VMs
  • Cant be changed in migration
  • QPNs, CQNs

Xiang Xiaojia
Department of Computer Science Slide 10
11
3. Challenges (cont.)
  • User level communications
  • Cache opaque handlers
  • Memory keys, QPNs, anywhere
  • RDMA need some handlers be cached in remote peers
  • Hard to suspend communication from kernel
  • User level direct communication

Xiang Xiaojia
Department of Computer Science Slide 11
12
3. Challenges (cont.)
  • Hardware managed connection states
  • Hardware store connection state information
  • No os stack processing, good performance
  • No easy way to migrate hardware conn states
  • The hardware cant recovery dropped packets
    during migration.
  • Dropped or out of order packets may cause fatal
    error

Xiang Xiaojia
Department of Computer Science Slide 12
13
4. Design
Xiang Xiaojia
Department of Computer Science Slide 13
14
4. Design (cont.)
  • Location dependent Resources
  • Opaque Handlers

Xiang Xiaojia
Department of Computer Science Slide 14
15
4. Design (cont.)
  • Location dependent Resources

Xiang Xiaojia
Department of Computer Science Slide 15
16
4. Design (cont.)
  • Location dependent Resources
  • Memory Keys

Xiang Xiaojia
Department of Computer Science Slide 16
17
4. Design (cont.)
  • User level Communication
  • Communication intercepted in Library
  • Example

Xiang Xiaojia
Department of Computer Science Slide 17
18
4. Design (cont.)
  • Connection state
  • Method bring the connection (QP) states to
    deterministic state
  • 1.Mark all QP suspended -gt no in flight packets
    originating from this VM
  • 2.Sending suspend request to all connected VMs
  • 3.

Xiang Xiaojia
Department of Computer Science Slide 18
19
4. Design (cont.)
  • Unreliable Datagram Service (UD)
  • Just dealing with address data structure

Xiang Xiaojia
Department of Computer Science Slide 19
20
5.Implementation
  • Arch. Xen IB

Xiang Xiaojia
Department of Computer Science Slide 20
21
5.Implementation(cont.)
  • Migrating
  • Optimization
  • QP Active VS. InActive

Xiang Xiaojia
Department of Computer Science Slide 21
22
6.Evaluation
  • Setup
  • Bench
  • IB verbs layer micro-bench
  • HPC-BenchNAS Parallel Bench (NPB) MVAPICH (MPI
    IB Implementation)

Xiang Xiaojia
Department of Computer Science Slide 22
23
6.Evaluation (cont.)
  • Microbench - Perftest

Xiang Xiaojia
Department of Computer Science Slide 23
24
6.Evaluation (cont.)
  • HPC Bench

Xiang Xiaojia
Department of Computer Science Slide 24
25
6.Evaluation (cont.)
  • HPC Bench

Xiang Xiaojia
Department of Computer Science Slide 25
26
6.Evaluation (cont.)
  • HPC Bench
  • Multiple-VMs

Xiang Xiaojia
Department of Computer Science Slide 26
27
7.Related Works
  • OS bypass
  • Active Messages, U-Net, FM, VMMC, Arsenic
  • VMM IO
  • VMWare Workstation, ESX, Xen
  • Migration Ethernet
  • Xen carry IP address, Unsolicited ARP
  • Process Level Migration
  • Zap VNAT
  • Mobile IP

Xiang Xiaojia
Department of Computer Science Slide 27
28
QA
  • ??!
Write a Comment
User Comments (0)
About PowerShow.com