PASTRY - PowerPoint PPT Presentation

1 / 23
About This Presentation
Title:

PASTRY

Description:

Node Departures/Failures. Lazy repair ... is not updated for a failed or departed node until a failed attempt to contact that node ... – PowerPoint PPT presentation

Number of Views:346
Avg rating:3.0/5.0
Slides: 24
Provided by: owenke
Category:

less

Transcript and Presenter's Notes

Title: PASTRY


1
PASTRY
  • Scalable, decentralized object location and
    routing for large-scale peer-to-peer systems

2
Background P2P Systems
  • Early P2P Models
  • Original Napster
  • Centralized content management system
  • Peers queried the central server which responded
    with nodes that contain the requested content
  • Does not truly follow peer-to-peer model
  • Gnutella
  • Content is found via query-flooding
  • Subject to scalability issues and severely
    over-utilizes bandwidth
  • Kazaa, Grokster
  • Hybrid systems with concept of superpeers
  • Reverts back to a type of client/server model and
    still suffers from scalability problems

3
Distributed Hash Tables
  • Distributed Hash Tables solve the problem of
    efficiently locating and routing objects in a P2P
    system
  • Chord, CAN, Pastry, Tapestry all in some way
    implement this technique
  • Given a key of the desired object, the node that
    contains the object can be found quickly and
    efficiently
  • No need for centralized servers
  • No need for query flooding or high bandwidth
    search techniques

4
PASTRY
  • Pastry is a self-organizing P2P system based on
    this concept of distributed hashing
  • Basic structure
  • Each node has a 128 bit unique identifier
  • Thus nodes are organized in a ring structure
  • When given a message and a key, a node routes the
    message to the node whos nodeId is numerically
    closest to the key

5
PASTRY Routing Algorithm
  • Pastrys routing algorithm is designed to not
    only route a message to a node with the closest
    ID, but also to minimize the physical distance
    traveled (according to some definable metric e.g.
    IP hops)
  • Each node maintains a routing table, and the
    criteria for choosing which node to forward a
    message to is dependent both on nodeIds and the
    proximity metric described

6
PASTRY Routing Algorithm
  • The routing mechanism in Pastry is dependent on
    two configuration parameters
  • L Each node maintains a list of the L nodes in
    the network that have the numerically closest
    Ids to the particular nodes Id. (Typical value
    16 or 32)
  • b When routing messages, nodeIds and keys are
    converted to base 2b. Each hop during the
    routing phase brings the message to a node whose
    nodeId shares one more digit in common with the
    key than in the previous hop. (Typical value 4)
  • This allows the algorithm to route messages to
    the node with the numerically closest key in
    O(log2bN) where N is the number of nodes in the
    Pastry network

7
PASTRY Routing Algorithm
L nodes which are numerically closest to node
(L/2 smaller and L/2 larger)
Routing table contains one row for each digit in
the nodeID (log2bN rows). Each row contains an
entry for each possible value for that digit.
All of the preceding digits for the entry must
match the current nodeIds while the following
digits may be any combination.
State of pastry node where b2 and L8. Thus
numbers are in base 4
8
PASTRY Routing Algorithm
  • A message with key D arrives at a node with
    nodeId A
  • The node first checks the leaf set
  • If D is within the range of the nodeIds in the
    leaf set, the message is forwarded to the node in
    the set whose nodeId is closest to D
  • If not, the routing table is used.
  • Let k be the number of digits that D and A have
    in common
  • Forward the message to the node in the kth row
    that has the k1th digit in common with D
  • If this entry does not exist, forward the message
    to the node that has the next closest numerical
    value out of all nodes in the leaf set and
    routing table

9
PASTRY Routing Algorithm
  • Example route

Message key d46a1c Source node 65alfc 0
matching digits
10
PASTRY Routing Algorithm
  • Example route

Message key d46a1c First hop d13da3 1
matching digit
11
PASTRY Routing Algorithm
  • Example route

Message key d46a1c Second hop d4213f 2
matching digits
12
PASTRY Routing Algorithm
  • Example route

Message key d46a1c Third hop d462ba 3
matching digits
13
PASTRY Routing Algorithm
  • Example route

Message key d46a1c Fourth hop d467c4 3
matching digits
No live nodes with 4 matching digits exists.
Forwarded to node with next closest numerical id
with 3 matching digits. Destination reached.
14
Node Additions
X
  • A new node with nodeId X joins the Pastry ring by
    contacting a known node with nodeId A
  • Node A sends a special join message with key X
    out to the Pastry network
  • The message is routed like any other message,
    eventually reaching the Pastry node Z with the
    closest nodeId to X
  • Like any other message, a number of intermediate
    hops are passed through

join
A
B
C
D
Z
15
Node Additions
  • Each node along the path of the join message
    sends its routing state information to node X
  • Node A is used for the basis of the neighborhood
    set since it is assumed to be in close proximity
    to X
  • Node Z is used for the basis of the leaf set
    since it has the closest numerical nodeId in the
    network
  • The routing table for X is built using the state
    information from each of the intermediate nodes

A
B
C
X
D
Z
16
Node Additions
  • Routing table construction for node X
  • Consider the properties of the Pastry routing
    algorithm. The nodes along the path to the join
    method obey the following general properties
  • Node A shares 0 digits in common with X
  • Node B shares 1 digit in common with X
  • Node C shares 2 digits in common with X
  • Each row in the routing table can be built using
    the corresponding row of the corresponding hop
  • Row 0 can be built using Row 0 of Node A
  • Row 1 can be built using Row 1 of Node B

17
Node Additions
  • Once node X has finished building its state
    information, it sends a notification message to
    each of the nodes in its routing table, leaf set,
    and neighborhood set
  • Each of these nodes update their state
    information accordingly

18
Node Additions
  • Maintaining locality
  • One last phase before node addition is complete
  • Node X requests state information from all nodes
    in its new routing table
  • When it receives these states, it updates its
    routing table according to the proximity metric,
    replacing nodes in its table with ones that are
    closer to it
  • Total messages required for node addition
    O(log2bN)

19
Node Departures/Failures
  • Lazy repair
  • State information of a Pastry node is not updated
    for a failed or departed node until a failed
    attempt to contact that node
  • If the node is in the leaf set, contact the
    highest id node in the leaf set and repair the
    leaf set using that nodes leaf set
  • If the node is in the routing table
  • Contact a different node in the same row and ask
    for the appropriate entry
  • If no node is found from the entries in that row,
    ask the nodes in the next row, etc.

20
PASTRY Applications
  • Pastry is conducive for use in a number of
    applications
  • PAST
  • Distributed file system
  • Each file name is hashed to a unique fileId
  • The fileId is used as the Pastry key for the file
  • To retrieve a file, simply send a message on the
    Pastry network with the fileId as the key
  • Store each file in the k nodes with nodeIds
    closest to the fileId for redundancy and to
    account for node failure

21
PASTRY Applications
  • SCRIBE
  • Publish/subscribe system
  • Topics are hashed to topicIds which are used as
    the Pastry id for a particular subscription group
  • Subscribers can then subscribe to a topic using
    the topicId and the node closest to the topicId
    maintains a list of subscribers forming a
    multicast system for topics

22
PASTRY Applications
  • Many other applications that mesh well with the
    PASTRY architecture and require minimal effort to
    implement in terms of node organization and
    communication
  • Problem applications
  • Napster, Gnutella, and other search services that
    require keyword based searching are more
    difficult to implement in PASTRY
  • Some research in this area has been done by
    leveraging Pastrys distributed hash table
    architecture to maintain a keyword hash table
    that maps keywords to nodes
  • I am doing research in this area for my project

23
References
  • 1 A. Rowstron and P. Druschel, "Pastry
    Scalable, distributed object location and routing
    for large-scale peer-to-peer systems".  IFIP/ACM
    International Conference on Distributed Systems
    Platforms (Middleware), Heidelberg, Germany,
    pages 329-350, November, 2001
  • 2 P. Druschel and A. Rowstron, "PAST A
    large-scale, persistent peer-to-peer storage
    utility", HotOS VIII, Schoss Elmau, Germany,  May
    2001
  • 3 M. Castro, P. Druschel, A-M. Kermarrec  and
    A. Rowstron, "SCRIBE A large-scale and
    decentralised application-level multicast
    infrastructure", IEEE Journal on Selected Areas
    in Communications (JSAC) (Special issue on
    Network Support for Multicast Communications).
    2002, to appear
  • 4 Patrick Reynolds and Amin Vahdat, "Efficient
    Peer-to-Peer Keyword Searching," to appear in
    Middleware 2003
Write a Comment
User Comments (0)
About PowerShow.com