PASTRY - PowerPoint PPT Presentation

1 / 23

About This Presentation

Title:

PASTRY

Description:

Node Departures/Failures. Lazy repair ... is not updated for a failed or departed node until a failed attempt to contact that node ... – PowerPoint PPT presentation

Number of Views:346

Avg rating:3.0/5.0

Slides: 24

Provided by: owenke

Category:

more less

Transcript and Presenter's Notes

Title: PASTRY

1
PASTRY

Scalable, decentralized object location and
routing for large-scale peer-to-peer systems

2
Background P2P Systems

Early P2P Models
Original Napster
Centralized content management system
Peers queried the central server which responded
with nodes that contain the requested content
Does not truly follow peer-to-peer model
Gnutella
Content is found via query-flooding
Subject to scalability issues and severely
over-utilizes bandwidth
Kazaa, Grokster
Hybrid systems with concept of superpeers
Reverts back to a type of client/server model and
still suffers from scalability problems

3
Distributed Hash Tables

Distributed Hash Tables solve the problem of
efficiently locating and routing objects in a P2P
system
Chord, CAN, Pastry, Tapestry all in some way
implement this technique
Given a key of the desired object, the node that
contains the object can be found quickly and
efficiently
No need for centralized servers
No need for query flooding or high bandwidth
search techniques

4
PASTRY

Pastry is a self-organizing P2P system based on
this concept of distributed hashing
Basic structure
Each node has a 128 bit unique identifier
Thus nodes are organized in a ring structure
When given a message and a key, a node routes the
message to the node whos nodeId is numerically
closest to the key

5
PASTRY Routing Algorithm

Pastrys routing algorithm is designed to not
only route a message to a node with the closest
ID, but also to minimize the physical distance
traveled (according to some definable metric e.g.
IP hops)
Each node maintains a routing table, and the
criteria for choosing which node to forward a
message to is dependent both on nodeIds and the
proximity metric described

6
PASTRY Routing Algorithm

The routing mechanism in Pastry is dependent on
two configuration parameters
L Each node maintains a list of the L nodes in
the network that have the numerically closest
Ids to the particular nodes Id. (Typical value
16 or 32)
b When routing messages, nodeIds and keys are
converted to base 2b. Each hop during the
routing phase brings the message to a node whose
nodeId shares one more digit in common with the
key than in the previous hop. (Typical value 4)
This allows the algorithm to route messages to
the node with the numerically closest key in
O(log2bN) where N is the number of nodes in the
Pastry network

7
PASTRY Routing Algorithm
L nodes which are numerically closest to node
(L/2 smaller and L/2 larger)
Routing table contains one row for each digit in
the nodeID (log2bN rows). Each row contains an
entry for each possible value for that digit.
All of the preceding digits for the entry must
match the current nodeIds while the following
digits may be any combination.
State of pastry node where b2 and L8. Thus
numbers are in base 4
8
PASTRY Routing Algorithm

A message with key D arrives at a node with
nodeId A
The node first checks the leaf set
If D is within the range of the nodeIds in the
leaf set, the message is forwarded to the node in
the set whose nodeId is closest to D
If not, the routing table is used.
Let k be the number of digits that D and A have
in common
Forward the message to the node in the kth row
that has the k1th digit in common with D
If this entry does not exist, forward the message
to the node that has the next closest numerical
value out of all nodes in the leaf set and
routing table

9
PASTRY Routing Algorithm

Example route

Message key d46a1c Source node 65alfc 0
matching digits
10
PASTRY Routing Algorithm

Example route

Message key d46a1c First hop d13da3 1
matching digit
11
PASTRY Routing Algorithm

Example route

Message key d46a1c Second hop d4213f 2
matching digits
12
PASTRY Routing Algorithm

Example route

Message key d46a1c Third hop d462ba 3
matching digits
13
PASTRY Routing Algorithm

Example route

Message key d46a1c Fourth hop d467c4 3
matching digits
No live nodes with 4 matching digits exists.
Forwarded to node with next closest numerical id
with 3 matching digits. Destination reached.
14
Node Additions
X

A new node with nodeId X joins the Pastry ring by
contacting a known node with nodeId A
Node A sends a special join message with key X
out to the Pastry network
The message is routed like any other message,
eventually reaching the Pastry node Z with the
closest nodeId to X
Like any other message, a number of intermediate
hops are passed through

join
A
B
C
D
Z
15
Node Additions

Each node along the path of the join message
sends its routing state information to node X
Node A is used for the basis of the neighborhood
set since it is assumed to be in close proximity
to X
Node Z is used for the basis of the leaf set
since it has the closest numerical nodeId in the
network
The routing table for X is built using the state
information from each of the intermediate nodes

A
B
C
X
D
Z
16
Node Additions

Routing table construction for node X
Consider the properties of the Pastry routing
algorithm. The nodes along the path to the join
method obey the following general properties
Node A shares 0 digits in common with X
Node B shares 1 digit in common with X
Node C shares 2 digits in common with X
Each row in the routing table can be built using
the corresponding row of the corresponding hop
Row 0 can be built using Row 0 of Node A
Row 1 can be built using Row 1 of Node B

17
Node Additions

Once node X has finished building its state
information, it sends a notification message to
each of the nodes in its routing table, leaf set,
and neighborhood set
Each of these nodes update their state
information accordingly

18
Node Additions

Maintaining locality
One last phase before node addition is complete
Node X requests state information from all nodes
in its new routing table
When it receives these states, it updates its
routing table according to the proximity metric,
replacing nodes in its table with ones that are
closer to it
Total messages required for node addition
O(log2bN)

19
Node Departures/Failures

Lazy repair
State information of a Pastry node is not updated
for a failed or departed node until a failed
attempt to contact that node
If the node is in the leaf set, contact the
highest id node in the leaf set and repair the
leaf set using that nodes leaf set
If the node is in the routing table
Contact a different node in the same row and ask
for the appropriate entry
If no node is found from the entries in that row,
ask the nodes in the next row, etc.

20
PASTRY Applications

Pastry is conducive for use in a number of
applications
PAST
Distributed file system
Each file name is hashed to a unique fileId
The fileId is used as the Pastry key for the file
To retrieve a file, simply send a message on the
Pastry network with the fileId as the key
Store each file in the k nodes with nodeIds
closest to the fileId for redundancy and to
account for node failure

21
PASTRY Applications

SCRIBE
Publish/subscribe system
Topics are hashed to topicIds which are used as
the Pastry id for a particular subscription group
Subscribers can then subscribe to a topic using
the topicId and the node closest to the topicId
maintains a list of subscribers forming a
multicast system for topics

22
PASTRY Applications

Many other applications that mesh well with the
PASTRY architecture and require minimal effort to
implement in terms of node organization and
communication
Problem applications
Napster, Gnutella, and other search services that
require keyword based searching are more
difficult to implement in PASTRY
Some research in this area has been done by
leveraging Pastrys distributed hash table
architecture to maintain a keyword hash table
that maps keywords to nodes
I am doing research in this area for my project

23
References

1 A. Rowstron and P. Druschel, "Pastry
Scalable, distributed object location and routing
for large-scale peer-to-peer systems". IFIP/ACM
International Conference on Distributed Systems
Platforms (Middleware), Heidelberg, Germany,
pages 329-350, November, 2001
2 P. Druschel and A. Rowstron, "PAST A
large-scale, persistent peer-to-peer storage
utility", HotOS VIII, Schoss Elmau, Germany, May
2001
3 M. Castro, P. Druschel, A-M. Kermarrec and
A. Rowstron, "SCRIBE A large-scale and
decentralised application-level multicast
infrastructure", IEEE Journal on Selected Areas
in Communications (JSAC) (Special issue on
Network Support for Multicast Communications).
2002, to appear
4 Patrick Reynolds and Amin Vahdat, "Efficient
Peer-to-Peer Keyword Searching," to appear in
Middleware 2003