Motivation

About This Presentation

Transcript and Presenter's Notes

Title: Motivation

1
Skippy Enabling Long-Lived Snapshots of the Long-Lived Past Skippy Enabling Long-Lived Snapshots of the Long-Lived Past Skippy Enabling Long-Lived Snapshots of the Long-Lived Past
Ross Shaullrshaull_at_cs.brandeis.edu Liuba Shriraliuba_at_cs.brandeis.edu Hao Xuhxu_at_cs.brandeis.edu
Lab for Experimental Software SystemsBrandeis University Lab for Experimental Software SystemsBrandeis University Lab for Experimental Software SystemsBrandeis University
Analysis

Expected cost to build SPT
Calculate expected cost of building SPT by
factoring in
acceleration
cost to read sequentially at each level
cost of disk seeks between each level
Plot shows time to build SPT versus the number of
Skippy levels for various skews

Motivation
Cost of Scan
Experimental Evaluation

An old problem Time travel in databases
Retaining past state at logical record level
(ImmortalDB, Postgres) changes arrangement of
current state
File system-level approaches block transactions
to get consistency (VSS)
A new solution Split Snapshots
Integrates with page cache and transaction
manager to provide disk page-level snapshots
Application declares transactionally-consistent
snapshots at any time with any frequency
Snapshots are retained incrementally using
copy-on-write (COW), without reorganizing
database
All applications and access methods run
unmodified on persistent, on-line snapshots
Achieve good performance in same manner as the
database leverage db recovery to defer snapshot
writes
A new problem How to index copy-on-write split
snapshots?

Let overwrite cycle length L be the number of
page updates required to overwrite entire
database
Overwrite cycle length determines the number of
mappings that must be scanned to construct SPT
Let N be the number of pages in the database
For a uniformly random workload, L N ln N (by
the coupon collectors waiting time problem)
Skew in the update workload lengthens overwrite
cycle by introducing many more repeated mappings
For example, skew of 80/20 (80 of updates to 20
of pages) increases L by a factor of 4

Implementation

Skippy-based Snapshot System Implemented in
Berkeley DB (BDB)
Page-level snapshots created using COW
Page cache is augmented to intercept requests for
pages
Pages which are write locked get COWd on the
first request after declaration of a snapshot
Pages are flushed periodically from a background
thread to a second disk
Snapshot metadata is created in-memory at
transaction commit
Metadata (and any unflushed pages COWd before
the checkpoint) are written to disk during
checkpoint
Recovery of snapshot pages and metadata can be
done in one log pass

!
Skew hurts
Results Cost of Scanning to Construct SPT
Indexing Split Snapshots
Combat Skew with Skippy

Setup
100M database
50K node (holds 2560 mappings, which is 1/10th
the number of database pages)
10,000rpm disk
Conclusions
Skippy could counteract 80/20 skew in 3 levels
99/1 has hot section much smaller than node size,
so one level is enough

Skew Skippy Levels Time to Build SPT (s)
50/50 0 13.8
80/20 0 19.0
80/20 1 15.8
80/20 2 14.7
80/20 3 13.9
99/1 0 33.3
99/1 1 6.69

Each snapshot needs its own page table (an SPT)
which points to current-state and COWd pages

Key Idea
For faster scan, create higher-level logs of FEMs
with fewer repeated mappings

Order of Events
Snapshot 1 declared
Page 1 modified
Snapshot 2 declared
Page 1 modified again
Page 2 modified

P1
P1
P2
Database
Snapshot pages
P1
P3
P2

Divide Maplog into equal-sized chunks called
nodes
Copy each FEM in a Maplog node into Skippy Level
1
At the end of each node record an up-link that
points to the next position in Skippy Level 1
where a mapping will be stored
To construct Skippy Level N, recursively apply
the same procedure to the previous Skippy Level
When scanning, follow up-links to Skippy Levels
(a Skippy scan)

P1
SPT1
P1
P2
P1
P2
P1
P3
P3
Skippy Level 1
P2
P3

P2 is shared by SPT1 and SPT2
P3 has not been modified so SPT1 and SPT2 point
to P3 into the database

Impact of Taking Snapshots
P1
SPT2
P1
P1
P2
P1
P1
P1
P2
P1
P3
P3
Maplog
P2

Can we create split snapshots with a Skippy index
efficiently?
Plot shows time to complete a single-threaded
updating workload of 100,000 transactions in a
66M database with each of 50/50, 80/20, and 99/1
skews
We can retain a snapshot after every transaction
for a 68 penalty

P3
Snap 1
Snap 2
Snap 3
Snap 4
Snap 5
Snap 6
Start
Solid arrows denote pointers Dotted arrows
indicate copying
Updating SPTs on disk would be costly, since one
COW may change the pointers in multiple SPTs
Accessing Snapshots with Maplog

Instead of maintaining many SPTs, append mappings
to snapshot pages into a log, the maplog
Ordering invariant
All mappings retained for snapshot X are written
into maplog before mappings retained for snapshot
X1
Construct SPT for snapshot X by scanning for
first-encountered mappings (FEMs)
Any page for which a mapping is not found in
maplog is still in the database (i.e., has not
been COWd yet)

A Skippy scan that begins at Start(X) constructs
the same SPTX as a maplog scan
More Stuff
Acceleration of Scan

Across-Time Execution (ATE)
Skippy can be used to cheaply construct SPTs for
a window of snapshots for across-time analysis
Garbage collection
Collecting old records from a log is simple, just
chop off the tail
Skippy enables cheap generational garbage
collection of snapshots according to
priorities(Thresher, Shrira and Xu, USENIX 06)

P1
SPT1

Skippy acceleration is the ratio of number of
mappings in top Skippy level to maplog
Acceleration depends on
Node size (repeated mappings are only eliminated
if they appear in the same node)
Workload (likelihood of repetitions)

P2
P3
Snap 1
Snap 2
P1
P1
P2
References
Maplog
Start

Shaull, R., Shrira, L., and Xu, H. Skippy
Enabling Long-Lived Snapshots of the Long-Lived
Past. ICDE 2008.
Shrira, L., van Ingen, C., and Shaull, R. Time
Travel in the Virtualized Past. SYSTOR 2007.
Shrira L., and Xu, H. Snap Efficient Snapshots
for Back-In-Time Execution. ICDE 2005.

Write a Comment

User Comments (0)

About PowerShow.com

Motivation PowerPoint PPT Presentation