What is a P2P system - PowerPoint PPT Presentation

About This Presentation

Title:

What is a P2P system

Description:

Hospitals, government agencies, etc. These services need to be robust ... Directory blocks, inode blocks, data blocks; Chord ID = H(block contents) 995: key=901 ... – PowerPoint PPT presentation

Number of Views:67

Avg rating:3.0/5.0

Slides: 42

Provided by: robert1000

Category:

Tags: p2p | system

more less

Transcript and Presenter's Notes

Title: What is a P2P system

1
What is a P2P system?
Node
Node
Node
Internet
Node
Node

A distributed system architecture
No centralized control
Nodes are symmetric in function
Large number of unreliable nodes
Enabled by technology improvements

2
How to build critical services?

Many critical services use Internet
Hospitals, government agencies, etc.
These services need to be robust
Node and communication failures
Load fluctuations (e.g., flash crowds)
Attacks (including DDoS)

3
The promise of P2P computing

Reliability no central point of failure
Many replicas
Geographic distribution
High capacity through parallelism
Many disks
Many network connections
Many CPUs
Automatic configuration
Useful in public and proprietary settings

4
Traditional distributed computingclient/server
Server
Client
Client
Internet
Client
Client

Successful architecture, and will continue to be
so
Tremendous engineering necessary to make server
farms scalable and robust

5
The abstractionDistributed hash table (DHT)
(File sharing)
Distributed application
data
get (key)
put(key, data)
(DHash)
Distributed hash table
lookup(key)
node IP address
(Chord)
Lookup service

Application may be distributed over many nodes
DHT distributes data storage over many nodes

6
A DHT has a good interface

Put(key, value) and get(key) ? value
Simple interface!
API supports a wide range of applications
DHT imposes no structure/meaning on keys
Key/value pairs are persistent and global
Can store keys in other DHT values
And thus build complex data structures

7
A DHT makes a good shared infrastructure

Many applications can share one DHT service
Much as applications share the Internet
Eases deployment of new applications
Pools resources from many participants
Efficient due to statistical multiplexing
Fault-tolerant due to geographic distribution

8
Recent DHT-based projects

File sharing CFS, OceanStore, PAST, Ivy,
Web cache Squirrel, ..
Archival/Backup store HiveNet, Mojo, Pastiche
Censor-resistant stores Eternity, FreeNet,..
DB query and indexing PIER,
Event notification Scribe
Naming systems ChordDNS, Twine, ..
Communication primitives I3,

Common thread data is location-independent
9
Roadmap

One application CFS/DHash
One structured overlay Chord
Alternatives
Other solutions
Geometry and performance
The interface
Applications

10
CFS Cooperative file sharing
File system
block
get (key)
put (key, block)
Distributed hash tables
.
node
node
node

DHT used as a robust block store
Client of DHT implements file system
Read-only CFS, PAST
Read-write OceanStore, Ivy

11
CFS Design
12
File representationself-authenticating data
File System key995
431SHA-1
144 SHA-1
901 SHA-1

995 key901 key732 Signature
key431 key795
a.txt ID144

(i-node block)

(data)
(root block)
(directory blocks)
Signed blocks Root blocks Chord ID
H(publisher's public key) Unsigned blocks
Directory blocks, inode blocks, data blocks
Chord ID H(block contents)
13
DHT distributes blocks by hashing IDs
Block 732
Block 705
Node B
995 key901 key732 Signature
247 key407 key992 key705 Signature
Node A
Internet
Block 407
Node C
Node D
Block 901
Block 992

DHT replicates blocks for fault tolerance
DHT caches popular blocks for load balance

14
(No Transcript)
15
(No Transcript)
16
(No Transcript)
17
DHT implementation challenges

Scalable lookup
Balance load (flash crowds)
Handling failures
Coping with systems in flux
Network-awareness for performance
Robustness with untrusted participants
Programming abstraction
Heterogeneity
Anonymity
Indexing
Goal simple, provably-good algorithms

18
1. The lookup problem
N2
N1
N3
Internet
Put (Keysha-1(data), Valuedata)
?
Client
Publisher
Get(keysha-1(data))
N6
N4
N5

Get() is a lookup followed by check
Put() is a lookup followed by a store

19
Centralized lookup (Napster)
N2
N1
SetLoc(title, N4)
N3
Client
DB
N4
Publisher_at_
Lookup(title)
Keytitle Valuefile data
N8
N9
N7
N6
Simple, but O(N) state and a single point of
failure
20
Flooded queries (Gnutella)
N2
N1
Lookup(title)
N3
Client
N4
Publisher_at_
Keytitle ValueMP3 data
N6
N8
N7
N9
Robust, but worst case O(N) messages per lookup
21
Algorithms based on routing

Map keys to nodes in a load-balanced way
Hash keys and nodes into a string of digit
Assign key to closest node

Forward a lookup for a key to a closer node

Join insert node in ring

Examples CAN, Chord, Kademlia, Pastry, Tapestry,
Viceroy, .
22
Chords routing table fingers
½
¼
1/8
1/16
1/32
1/64
1/128
N80
23
Lookups take O(log(N)) hops
N5
N10
N110
K19
N20
N99
N32
Lookup(K19)
N80
N60

Lookup route to closest predecessor

24
Can we do better?

Caching
Exploit flexibility at the geometry level
Iterative vs. recursive lookups

25
2. Balance load
N5
K19
N10
N110
K19
N20
N99
N32
Lookup(K19)
N80
N60

Hash function balances keys over nodes
For popular keys, cache along the path

26
Why Caching Works Well
N20

Only O(log N) nodes have fingers pointing to N20
This limits the single-block load on N20

27
3. Handling failures redundancy
N5
N10
N110
N20
N99
N32
N40
N80
N60

Each node knows IP addresses of next r nodes
Each key is replicated at next r nodes

28
Lookups find replicas
N5
N10
N110
2.
N20
1.
3.
N99
Block 17
N40
4.
RPCs 1. Lookup step 2. Get successor list 3.
Failed block fetch 4. Block fetch
N80
N50
N60
N68
Lookup(BlockID17)
29
First Live Successor Manages Replicas
N5
N10
N110
N20
N99
Copy of 17
Block 17
N40
N80
N50
N60
N68

Node can locally determine that it is the first
live successor

30
4. Systems in flux

Lookup takes log(N) hops
If system is stable
But, system is never stable!
What we desire are theorems of the type
In the almost-ideal state, .log(N)
System maintains almost-ideal state as nodes join
and fail

31
Half-life Liben-Nowell 2002
N new nodes join
N nodes
N/2 old nodes leave

Doubling time time for N joins
Halfing time time for N/2 old nodes to fail
Half life MIN(doubling-time, halfing-time)

32
Applying half life

For any node u in any P2P network
If u wishes to stay connected with high
probability,
then, on average, u must be notified about ?(log
N) new nodes per half life
And so on,

33
5. Optimize routing to reduce latency
N20
N40
N41
N80

Nodes close on ring, but far away in Internet
Goal put nodes in routing table that result in
few hops and low latency

34
close metric impacts choice of nearby nodes
N06
USA
N105
USA
K104
Far east
N32
N103
Europe
N60
USA

Chords numerical close and (original) routing
table restrict choice
Should new nodes be able to choose their own ID
Other allows for more choice (e.g., prefix based,
XOR)

35
6. Malicious participants

Attacker denies service
Flood DHT with data
Attacker returns incorrect data detectable
Self-authenticating data
Attacker denies data exists liveness
Bad node is responsible, but says no
Bad node supplies incorrect routing info
Bad nodes make a bad ring, and good node joins it

Basic approach use redundancy
36
Sybil attack Douceur 02
N5

Attacker creates multiple identities
Attacker controls enough nodes to foil the
redundancy

N10
N110
N20
N99
N32
N40
N80
N60

Need a way to control creation of node IDs

37
One solution secure node IDs

Every node has a public key
Certificate authority signs public key of good
nodes
Every node signs and verifies messages
Quotas per publisher

38
Another solutionexploit practical byzantine
protocols
N06
N105
N
N
N
N32
N103
N
N60

A core set of servers is pre-configured with keys
and perform admission control OceanStore
The servers achieve consensus with a practical
byzantine recovery protocol Castro and Liskov
99 and 00
The servers serialize updates OceanStore or
assign secure node Ids Configuration service

39
A more decentralized solutionweak secure node
IDs