Replication Degree Customization for High Availability - PowerPoint PPT Presentation

About This Presentation

Title:

Replication Degree Customization for High Availability

Description:

Number of Views:50

Avg rating:3.0/5.0

Slides: 14

Provided by: kais4

Learn more at: https://www.cs.rochester.edu

Category:

more less

Transcript and Presenter's Notes

Title: Replication Degree Customization for High Availability

1
Replication Degree Customization for High
Availability

2
Problem Context

Replication degree tradeoff between availability
and space
Skewed data popularity distributions
intuitive to highly replicate popular objects
goal improve availability under certain space
constraint
Should we worry about space constraint today?
decentralized wide-area systems high machine
failure/inaccessibility rate demand high-degree
replication
0.265 failure rate from a Planetlab machine
accessibility trace
centrally-managed local-area clusters often
requiring data in-memory and relatively high
memory cost

3
Related Work

4
Basic Analytical Result

Problem formulation
p is machine failure probability ? unavailability
of an object with k replicas is pow(p,k)
a system with n objects object i popularity is
ri, size is si
find object replication degrees k1, k2, , kn to
minimize expected unavailability ?1in ri
pow(p,ki)
subject to space constraint ?1in si ki K
Result
Lagrangian function ?1in ri pow(p,ki) ?
(? 1in si ki K)
optimization is reached when the functions
partial derivatives on kis and ? are all zero
therefore ki C log1/p(ri/si), C is a
constant

5
Challenges in Systems Context

6
Complex System Models

Basic analytical result can be adapted to handle
complex system models.
Multi-object operations
multi-object op unavailability is approximately
the sum of the unavailabilities of individual
objects
when xis are small, 1-?1im (1-xi) ?1im xi
Nonuniform machine failure rates
redefine p as the average per-machine failure
rate

7
Object Popularity/Size Skewness

8
Object Popularity Stability

Stable object popularity is important for
learning the object popularity and for low
adjustment overhead
Illustration of stability across month-long trace
segments
Not designed to handle flash crowds

9
Dynamic Maintenance Overhead

10
Trace-driven Evaluation Availability Improvement

Availability improvement on four real application
traces and two machine failure patterns
Compared to uniform replication under same space
constraint

11
Trace-driven Evaluation Changing Space
Constraint

12
Trace-driven Evaluation Dynamic Maintenance
Overhead

13
Conclusion

Results
analytical result optimal replication when
object replication degree is linear to
log(popularity/size)
address systems issues in complex system models,
realistic workload behaviors, and maintenance
overhead
Big picture skewed stable data distributions
motivate per-object adaptation in distributed
system management
adapt replication degree for high availability
this paper
adapt Bloom filter hash number for low
false-positive rate other result
adapt co-placement of correlated data objects for
fast multi-object operations other result