Handling Churn in a DHT - PowerPoint PPT Presentation

About This Presentation

Title:

Handling Churn in a DHT

Description:

Peer-to-peer algorithm to offering put/get interface ... Called Vivaldi; used by MIT Chord implementation. Compare with TCP-style under recursive routing ... – PowerPoint PPT presentation

Number of Views:47

Avg rating:3.0/5.0

Slides: 20

Provided by: tri5284

Category:

Tags: dht | churn | handling | vivaldi

Transcript and Presenter's Notes

Title: Handling Churn in a DHT

1
Handling Churn in a DHT

USENIX Annual Technical Conference
June 29, 2004
Sean Rhea, Dennis Geels,
Timothy Roscoe, and John Kubiatowicz
UC Berkeley and Intel Research Berkeley

2
Whats a DHT?

Distributed Hash Table
Peer-to-peer algorithm to offering put/get
interface
Associative map for peer-to-peer applications
More generally, provide lookup functionality
Map application-provided hash values to nodes
(Just as local hash tables map hashes to memory
locs.)
Put/get then constructed above lookup
Many proposed applications
File sharing, end-system multicast, aggregation
trees

3
How Does Lookup Work?

Assign IDs to nodes
Map hash values to node with closest ID
Leaf set is successors and predecessors
All thats needed for correctness
Routing table matches successively longer
prefixes
Allows efficient lookups

4
Why Focus on Churn?
Chord is a scalable protocol for lookup in a
dynamic peer-to-peer system with frequent node
arrivals and departures -- Stoica et al., 2001
Authors Systems Observed Session Time
SGG02 Gnutella, Napster 50 lt 60 minutes
CLL02 Gnutella, Napster 31 lt 10 minutes
SW02 FastTrack 50 lt 1 minute
BSV03 Overnet 50 lt 60 minutes
GDS03 Kazaa 50 lt 2.4 minutes
5
A Simple lookup Test

Start up 1,000 DHT nodes on ModelNet network
Emulates a 10,000-node, AS-level topology
Unlike simulations, models cross traffic and
packet loss
Unlike PlanetLab, gives reproducible results
Churn nodes at some rate
Poisson arrival of new nodes
Random node departs on every new arrival
Exponentially distributed session times
Each node does 1 lookup every 10 seconds
Log results, process them after test

6
Early Test Results

Tapestry (the OceanStore DHT) falls over
completely
Worked great in simulations, but not on more
realistic network
Despite sharing almost all code between the two
And the problem isnt limited to Tapestry

7
Handling Churn in a DHT

Forget about comparing different impls.
Too many differing factors
Hard to isolate effects of any one feature
Implement all relevant features in one DHT
Using Bamboo (similar to Pastry)
Isolate important issues in handling churn
Recovering from failures
Routing around suspected failures
Proximity neighbor selection

8
Recovering From Failures

For correctness, maintain leaf set during churn
Also routing table, but not needed for
correctness
The Basics
Ping new nodes before adding them
Periodically ping neighbors
Remove nodes that dont respond
Simple algorithm
After every change in leaf set, send to all
neighbors
Called reactive recovery

9
The Problem With Reactive Recovery

Under churn, many pings and change messages
If bandwidth limited, interfere with each other
Lots of dropped pings looks like a failure
Respond to failure by sending more messages
Probability of drop goes up
We have a positive feedback cycle (squelch)
Can break cycle two ways
Limit probability of false suspicions of
failure
Recovery periodically

10
Periodic Recovery

Periodically send whole leaf set to a random
member
Breaks feedback loop
Converges in O(log N)
Back off period on message loss
Makes a negative feedback cycle (damping)

11
Routing Around Failures

Being conservative increases latency
Original next hop may have left network forever
Dont want to stall lookups
DHT has many possible routes
But retrying too soon leads to packet explosion
Goal
Know for sure that packet is lost
Then resend along different path

12
Calculating Good Timeouts

Use TCP-style timers
Keep past history of latencies
Use this to compute timeouts for new requests
Works fine for recursive lookups
Only talk to neighbors, so history small, current

Iterative

In iterative lookups, source directs entire
lookup
Must potentially have good timeout for any node

13
Virtual Coordinates

Machine learning algorithm to estimate latencies
Distance between coords. proportional to latency
Called Vivaldi used by MIT Chord implementation
Compare with TCP-style under recursive routing
Insight into cost of iterative routing due to
timeouts

14
Proximity Neighbor Selection (PNS)

For each neighbor, may be many candidates
Choosing closest with right prefix called PNS
One of the most researched areas in DHTs
Can we achieve good PNS under churn?
Remember
leaf set for correctness
routing table for efficiency?
Insight extend this philosophy
Any routing table gives O(log N) lookup hops
Treat PNS as an optimization only
Find close neighbors by simple random sampling

15
PNS Results(very abbreviated--see paper for more)

Random sampling almost as good as everything else
24 latency improvement free
42 improvement for 40 more b.w.
Compare to 68-84 improvement by using good
timeouts
Other algorithms more complicated, not much better

16
Related Work

Liben-Nowell et al.
Analytical lower bound on maintenance costs
Mahajan et al.
Simulation-based study of Pastry under churn
Automatic tuning of maintenance rate
Suggest increasing rate on failures!
Other simulations
Li et al.
Lam and Liu
Zhuang
Cooperative failure detection in DHTs
Dabek et al.
Throughput and latency improvements w/o churn

17
Future Work

Continue study of iterative routing
Have shown virtual coordinates good for timeouts
How does congestion control work under churn?
Broaden methodology
Better network and churn models
Move beyond lookup layer
Study put/get and multicast algorithms under churn

18
Conclusions/Recommendations

Avoid positive feedback cycles in recovery
Beware of false suspicions of failure
Recover periodically rather than reactively
Route around potential failures early
Dont wait to conclude definite failure
TCP-style timeouts quickest for recursive routing
Virtual-coordinate-based timeouts not prohibitive
PNS can be cheap and effective
Only need simple random sampling

19
For code and more informationbamboo-dht.org

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

Introducing-PowerShowcom PowerPoint PPT Presentation

Introducing-PowerShowcom - Introducing-PowerShowcom (Without Music)

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

Handling Churn in a DHT PowerPoint PPT Presentation

Handling Churn in a DHT - Handling Churn in a DHT Andreas Wigmostad Bjerkhaug Hva er DHT og churn? DHT = Distributed Hash Table: Et assosiativt kart for P2P-applikasjoner, gj r det mulig ... | PowerPoint PPT presentation | free to view

Peer-to-peer Communication Services PowerPoint PPT Presentation

Peer-to-peer Communication Services - How to discover an existing DHT? How to construct a DHT efficiently ... How to construct a DHT efficiently from scratch. first time or after major disruption ... | PowerPoint PPT presentation | free to view

CS 194: Distributed Systems DHT Applications: What and Why PowerPoint PPT Presentation

CS 194: Distributed Systems DHT Applications: What and Why - Title: Systems Area: OS and Networking Author: Campus User Last modified by: Ion Stoica Created Date: 2/16/1997 2:02:43 PM Document presentation format | PowerPoint PPT presentation | free to view

Paper Survey of DHT PowerPoint PPT Presentation

Paper Survey of DHT - Each entry is small, but large amount of entries. Mutable. Special ... TCP Vivaldi fixed. Query: Proximity Neighbor Route. Measure methods. Global Sampling ... | PowerPoint PPT presentation | free to view

Symphony: Distributed Hashing in a Small World PowerPoint PPT Presentation

Symphony: Distributed Hashing in a Small World - Symphony: Distributed Hashing in a Small World Gurmeet Singh Manku Mayank Bawa Prabhakar Raghavan Presented by Satpreet Singh Motivation GOAL: To maintain a large DHT ... | PowerPoint PPT presentation | free to view

Beyond Theory: DHTs in Practice PowerPoint PPT Presentation

Beyond Theory: DHTs in Practice - Instead, implement a new DHT called Bamboo. Same overlay structure as Pastry ... Been running Bamboo / OpenDHT on PlanetLab since April 2004. Constantly run a ... | PowerPoint PPT presentation | free to view

Resource Allocation in OpenHash: a Public DHT Service PowerPoint PPT Presentation

Resource Allocation in OpenHash: a Public DHT Service - Palimpsest. Other networking work. Internet backplane. Discussion ... Some hybrid? Lottery Scheduling? What other models make sense? Palimpsest? ... | PowerPoint PPT presentation | free to view

Defending Sybil Attack in Peer2Peer Networks PowerPoint PPT Presentation

Defending Sybil Attack in Peer2Peer Networks - Title: PowerPoint Presentation Last modified by: popel Created Date: 1/1/1601 12:00:00 AM Document presentation format: On-screen Show (4:3) Other titles | PowerPoint PPT presentation | free to view

The Case for a Hybrid P2P Search Infrastructure PowerPoint PPT Presentation

The Case for a Hybrid P2P Search Infrastructure - The Case for a Hybrid P2P Search Infrastructure. Boon Thau Loo. UC Berkeley ... Hybrid Search Infrastructure: Flooding for popular items, DHT for rare items ... | PowerPoint PPT presentation | free to view

Large Scale Overlay Measurements PowerPoint PPT Presentation

Large Scale Overlay Measurements - 10 for Azureus have tested 50. 10 for Mainline have tested 200 ... Azureus can't reliably determine ownership. DHT Traffic. Break time into 5 second windows ... | PowerPoint PPT presentation | free to view

Finally, a Use for Componentized Transport Protocols PowerPoint PPT Presentation

Finally, a Use for Componentized Transport Protocols - Petros Maniatis, Sean Rhea, Timothy Roscoe. U.C. Berkeley and Intel Research Berkeley ... S. Rhea, D. Geels, T. Roscoe, and J. Kubiatowicz. Handling Churn in a ... | PowerPoint PPT presentation | free to view

Architectures%20and%20Algorithms%20for%20Internet-Scale%20(P2P)%20Data%20Management PowerPoint PPT Presentation

Architectures%20and%20Algorithms%20for%20Internet-Scale%20(P2P)%20Data%20Management - This file was generated using MS PowerPoint 2004 for Mac. ... Loo, Robert Morris, Sriram Ramabhadran, Sean Rhea, Ion Stoica, David Wetherall ... | PowerPoint PPT presentation | free to view

SmartSeer: Using a DHT to Process Continuous Queries Over PeertoPeer Networks PowerPoint PPT Presentation

SmartSeer: Using a DHT to Process Continuous Queries Over PeertoPeer Networks - ... locate work of interest among the torrent of newly-generated papers will become ... The DN responds with a bit vector specifying the presence of each term ... | PowerPoint PPT presentation | free to view

The P2P MultiRouter: a Black Box Approach to Runtime Adaptivity for P2P DHTs PowerPoint PPT Presentation

The P2P MultiRouter: a Black Box Approach to Runtime Adaptivity for P2P DHTs - Peer to Peer DHTs. DHT=Distributed Hash Table. Offers 2 basic calls ... (orinally from a 3000 node deployment) to 500 nodes on a 1275 trans-stub topology ... | PowerPoint PPT presentation | free to view

CS234 PowerPoint PPT Presentation

CS234 - ... etc Distributed CDN : Fast Replica Disseminate ... DHT based Overlay Hash Tables Distributed Hash Table ... XOR based closeness Kademlia : Binary Tree of ... | PowerPoint PPT presentation | free to view

Architectures and Algorithms for InternetScale P2P Data Management PowerPoint PPT Presentation

Architectures and Algorithms for InternetScale P2P Data Management - The 'Internet Screensaver' Engage end users: education and prevention ... Trackability and liability will prevent this being used for free speech. Now consider p2p ... | PowerPoint PPT presentation | free to view

1 PowerPoint PPT Presentation

1 - Modified the LimeWire Gnutella Client. Run as leaf or ultrapeer. Monitor Gnutella traffic ... Log of Gnutella queries from LimeWire clients. Reissued Gnutella queries ... | PowerPoint PPT presentation | free to view

Pure P2P architecture PowerPoint PPT Presentation

Pure P2P architecture - Problem when both Alice and Bob are behind 'NATs' ... Peers can now communicate through NATs via relay. 2: Application Layer. 19. Chapter 2: Summary ... | PowerPoint PPT presentation | free to view

P2P Web Search: Give the Web Back to the People PowerPoint PPT Presentation

P2P Web Search: Give the Web Back to the People - Title: P2P Web Search: Give the Web Back to the People Subject: Talk IPTPS 2006 Author: Christian Zimmer Keywords: P2P, Chord, Minerva, Directory, Correlation ... | PowerPoint PPT presentation | free to view

Peer-to-Peer Filesystems PowerPoint PPT Presentation

Peer-to-Peer Filesystems - Homework 5 and 6 due in April. Project 5 and 6 due in April and ... (Bit)Torrents for faster download. Legality. Are there any good legal uses for P2P systems? ... | PowerPoint PPT presentation | free to view

UDP Server PowerPoint PPT Presentation

UDP Server - UDP Server import java.io.*; import java.net.*; class UDPServer {public static void main(String argv[]) throws Exception {String sentence; String capitalizedSentence; | PowerPoint PPT presentation | free to view

Protocol and System Design, Reliability, and Energy Efficiency in Peer-to-Peer Communication Systems PowerPoint PPT Presentation

Protocol and System Design, Reliability, and Energy Efficiency in Peer-to-Peer Communication Systems - Protocol and System Design, Reliability, and Energy Efficiency in Peer-to-Peer Communication Systems Salman Abdul Baset salman@cs.columbia.edu Thesis defense | PowerPoint PPT presentation | free to view

A brief introduction to Pastry PowerPoint PPT Presentation

A brief introduction to Pastry - A brief introduction to Pastry Based on: A. Rowstron and P. Druschel, Pastry: Scalable, decentralized object location and routing for large-scale peer-to-peer systems ... | PowerPoint PPT presentation | free to view

An OceanStore Retrospective PowerPoint PPT Presentation

An OceanStore Retrospective - An OceanStore Retrospective John Kubiatowicz University of California at Berkeley | PowerPoint PPT presentation | free to view

Davis Social Links: A Next Generation Communication System PowerPoint PPT Presentation

Davis Social Links: A Next Generation Communication System - ePost, i3, Secure & Resilient Email, etc. Chord, Pastry, CAN, etc. ... Lookup(K19) K19. Social Links. Chord Links. Chord Social Links. Previous Work. SPROUT ... | PowerPoint PPT presentation | free to view

PeertoPeer p2p Querying PowerPoint PPT Presentation

PeertoPeer p2p Querying - Modified the LimeWire Gnutella Client. Run as leaf or ultrapeer. Monitor Gnutella traffic ... Log of Gnutella queries from LimeWire clients. Reissued Gnutella queries ... | PowerPoint PPT presentation | free to view

Paxos made simple PowerPoint PPT Presentation

Paxos made simple - Its main functions include call admission (or call ... for call control ... over the network and our methodology for rating a call Section IV ... | PowerPoint PPT presentation | free to view