Self Healing Wide Area Network Services - PowerPoint PPT Presentation

About This Presentation

Title:

Self Healing Wide Area Network Services

Description:

In case a crash is detected - try and restart. No central monitoring station involved. ... Remote (re)start attempted after Hello timeout. ... – PowerPoint PPT presentation

Number of Views:23

Avg rating:3.0/5.0

Slides: 24

Provided by: Bhav5

Learn more at: https://cseweb.ucsd.edu

Category:

more less

Transcript and Presenter's Notes

Title: Self Healing Wide Area Network Services

1
Self Healing Wide Area Network Services

Bhavjit S Walha
Ganesh Venkatesh

2
Layout

Introduction
Previous Work
Issues
Solution
Preliminary results
Problems Future Extensions
Conclusion

3
Motivation

Companies may have servers distributed over a
wide area network
Akamai Content Distribution Network.
Distributed web-servers
Manual monitoring may not be feasible
Centralized control may lead to problems in
case of a network partition
Typical server applications
May crash due software bugs
Little state is retained
Simple restart is thus sufficient

4
Motivation

What if peers monitored each others health?
In case a crash is detected - try and restart.
No central monitoring station involved.
Loosely based on a worm
Resilient to sporadic failures
Spreads to uninfected nodes
But
No backdoor involved
May not always shift to new nodes

Introduction
Previous Work
Issues
Solution
Preliminary results
Problems Future Extensions
Conclusion

6
Medusa

All nodes a part of a Multicast Group
Each node is thus in touch with all other nodes
through Heatbeat messages.
Nodes send regular updates to the multicast tree
All communication through reliable multicast
In case a node goes down
Other nodes try to restart it
Request for service sent to multicast group

7
Medusa Problems

Scalability
Assumptions of reliable packet delivery
State information shared with all nodes.
Reliable Multicast
Assumes reliable delivery of packets to all nodes
No explicit ACKs
The kill operations fail in case of a temporary
break in Multicast tree.
Security
No way of authenticating packets

Introduction
Previous Work
Issues
Solution
Preliminary results
Problems Future Extensions
Conclusion

9
Proposed solution

Nodes form peering relationships with only a
subset of other nodes.
Exchange Hello packets
Scalable as the degree is fixed
No central control
No dependence on reliable multicast
Distributed communication protocol
Explicit ACKs for packets
Some super-nodes required to be up when booted
Power of Randomly-connected graphs graphs

10
Design

Each node continually sends Hello Packets to its
peer nodes.
Indicates everything is up and working
A timeout indicates something is wrong
Application crash
Network Partition
Aim at application crashes
Application should be stateless
No code transfer
Remotely restartable
SSH needed A login account and distributed keys.

11
Initialization

3-5 super-nodes form a fully-connected connected
graph.
Are expected to be up all the time
All nodes have information about their IPs
May be under manual supervision
May have information about the topology
Responsible for forwarding join requests to other
nodes

12
Remote start

SSH to a remote node to restart
Remote (re)start attempted after Hello timeout.
Current implementation requires keys to be
distributed beforehand
Starts a small watchdog program which immediately
returns
Checks if there is a another copy already running
Current implementation uses ps
In case the application start fails, do nothing
wait for retry to restart
Possible extension allow the service to spread

13
New node comes up

Waits for others to contact it
After timeout
Send JoinRequest to a super-node with the number
of peers needed.
Supernode forwards this request to other nodes
AddRequest
Some node may ask new node to become its peer
Add to neighbourList and send AddACK
Hello
Can add to neighbourList if unsolicited Hello
received
Beneficial in case of a short temporary failures
After Request-timeout
Contact another super-node with another
JoinRequest.
Timeout can be dynamically specified in
JoinRequestACK.

14
New node comes upRandom Walk.

Request forwarded by super-node to 3 random nodes
on behalf of new node
Each node forwards it to others
Decrease hop count by 1 each time
If hop count 0, check if it can support more
nodes
YES!
Send AddRequest to new node
Add to neighbourList on receiving AddACK.
NO!
Ignore the request
New node may already have found neighbours
Due to duplicate joinRequest or repair of Network
partition
New node thus replies to AddRequest with Die
packet.

15
Shutdown

Critical to ensure that all nodes go down
3-way protocol
Send kill to target node
Target node replies with die
Send dieACK to target node.
kill
used when multiple copies detected
Possibly to balance load
die
Reply to unsolicited Hello
No perfect solution in case of a network partition

16
Global Shutdown

Secret killAll packet
Sent by an external program for complete system
shutdown
Forwarded to all neighbours
Node does not die until it receives a killACK
from everyone
Stops sending hellos immediately
No further restart attempts
Reply only to die, kill and killAll
May send unnecessary traffic
Eventually time out on seeing zero neighbours.

17
Performance

Tested on 6 nodes in GradLab
Hello interval 5s
Hello timeout 22s
Wait before joinRequest 10s
joinRequest timeout 20s
Hop count 2
Initial degree request 3
Super-nodes 3
Preliminary tests on PlanetLab

18
Results

LAN
No timeouts or packet losses observed
No duplicate copies
killAll works perfectly
Re-start latency 22s
Decreases after a number of restarts
Join latency 15s
PlanetLab
Re-start latency 27s
Join latency 21s

Introduction
Previous Work
Issues
Solution
Preliminary results
Problems and Future Extensions
Conclusion

20
Limitations

Security
The packets are not authenticated
Stray copies
After a killAll there may be stray copies
Harmless as they do not try to spread
But prevents another copy from running
No new nodes
Node discovery
Why should they be idle in first place?
What to do when the original nodes come back up?
Solution
Send regular updates to super-nodes
Extra servers can be killed easily

21
Parameter tweaking

Hop count for Random Walk
Connectivity
Min-degree to ensure connectivity
Max-degree to spread the failure probability
Timeouts
Request timeout
Depends on hop-count
Hello timeout
Different for WAN LAN
Global timeout
In case of network partition
Loss of Kill ACK packets

22
Conclusion

Maintaining High Availability does not always
require central control
Achieving a global shutdown is problematic
Need to explore connectivity requirements to
ensure a connected graph at all times.

23
Thank You !

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

Introducing-PowerShowcom PowerPoint PPT Presentation

Introducing-PowerShowcom - Introducing-PowerShowcom (Without Music)

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

Jini Overview and Specification PowerPoint PPT Presentation

Jini Overview and Specification - ... data, but entire objects and code. Provides Simplicity ... hierarchy of lookup servers (like DNS), but no actual wide area' JINI network has been deployed. ... | PowerPoint PPT presentation | free to view

Kansas Healthy Start Home Visitor Training PowerPoint PPT Presentation

Kansas Healthy Start Home Visitor Training - Assessment of linkages between social network factors and child maltreatment risk. ... Call to schedule a time for your visit. ... | PowerPoint PPT presentation | free to view

LONG Tom Peters PowerPoint PPT Presentation

LONG Tom Peters - ... Guesswork: From heart surgery to prostate care, the health industry knows little ... Caring Through Massage. 7. Healing Arts: Nutrition for the Soul ... | PowerPoint PPT presentation | free to view

Business Continuity and Disaster Recovery PowerPoint PPT Presentation

Business Continuity and Disaster Recovery - Network Security and Disaster Recovery Planning. Sun ... Cover in Recovery Section. Prevention. Data. Architecture Developments. Storage Area Networks ... | PowerPoint PPT presentation | free to view

GLOBAL RING NETWORK FOR ADVANCED APPLICATIONS DEVELOPMENT RussiaChinaUSA Science PowerPoint PPT Presentation

GLOBAL RING NETWORK FOR ADVANCED APPLICATIONS DEVELOPMENT RussiaChinaUSA Science - Russia-China-USA Science & Education Network. April 14, 2003 ... Russia-China-USA Science & Education Network. Organizing Trip. with Chinese Academy of Sciences ... | PowerPoint PPT presentation | free to view

Establishment of E-Health Network for Disasters and Healthcare Improvement: Integrated Medical Information Technology System PowerPoint PPT Presentation

Establishment of E-Health Network for Disasters and Healthcare Improvement: Integrated Medical Information Technology System - Dayton, OH. Installation of Stentor iSite COTS ... Like its civilian counterparts, the Air Force is interested in implementing ... | PowerPoint PPT presentation | free to view

FUTURE PLANS PowerPoint PPT Presentation

FUTURE PLANS - Undergraduate admission. Registrarial services. Regulatory and academic community support ... Closes the pre-admission advising gap (welcome centre) ... | PowerPoint PPT presentation | free to view

The INFOSEC Research Council PowerPoint PPT Presentation

The INFOSEC Research Council - Malicious Code. Studies Proposed. Self Healing Networks. Technology Transfer. Network Study ... of foreign and mobile code. Controlled sharing of sensitive ... | PowerPoint PPT presentation | free to view

IT Infrastructure and Emerging Technologies PowerPoint PPT Presentation

IT Infrastructure and Emerging Technologies - ... systems (both hardware and software) have become so complex that the cost of ... maintained by a world-wide network of programmers and designers under the ... | PowerPoint PPT presentation | free to view

Unit 1: Introduction to LANs Network Design, Case Analysis PowerPoint PPT Presentation

Unit 1: Introduction to LANs Network Design, Case Analysis - Opportunity for students to gain some practical experience in the subject area ... Corporate Structure, Geographical Structure, Staffing, Policies and Politics ... | PowerPoint PPT presentation | free to view

Bridges and Barriers to Mental Health Services for Asylum Seekers and Refugees PowerPoint PPT Presentation

Bridges and Barriers to Mental Health Services for Asylum Seekers and Refugees - HARP conducted a DH funded study to identify the bridges and barriers into ... here are also rituals, exorcisms, and drumming to help.' Participant from DRC. ... | PowerPoint PPT presentation | free to view

Building Network-Centric Systems Liviu Iftode PowerPoint PPT Presentation

Building Network-Centric Systems Liviu Iftode - Occasional TCP/IP networking with low expectations and mostly non-interactive traffic ... A software offloading architecture using existing hardware ... | PowerPoint PPT presentation | free to view

Chapter 5 LOCAL AREA NETWORK CONCEPTS AND ARCHITECTURES PowerPoint PPT Presentation

Chapter 5 LOCAL AREA NETWORK CONCEPTS AND ARCHITECTURES - ... process, each successive layer of the OSI model removes headers &/or trailers ... Between house and local exchange (subscriber loop) Within buildings ... | PowerPoint PPT presentation | free to view

California Network of Mental Health Clients PowerPoint PPT Presentation

California Network of Mental Health Clients - Jay Mahler, Bay Area. Joyce Ott-Havenner, Far North. Linford Gayle, Bay Area ... A world where programs and services are always voluntary and without conditions, ... | PowerPoint PPT presentation | free to view

Mesh Networking Alf Young, Motorola Inc' PowerPoint PPT Presentation

Mesh Networking Alf Young, Motorola Inc' - Wide Area Coverage for Public and Private Networks. Fixed and mobile wireless access, using unlicensed and ... Kissimmee, Florida. X MotoMesh Duo - MEA device ... | PowerPoint PPT presentation | free to view

Presented by: Prof Mark Baker PowerPoint PPT Presentation

Presented by: Prof Mark Baker - The idea of an 'intergalactic computer network' was introduced in the sixties by ... Microsoft Azure. Users provision entire infrastructure: ... | PowerPoint PPT presentation | free to view

83180 Wireless LANs 9'3' 2005 LRWPAN LowRate Wireless Personal Area Networks PowerPoint PPT Presentation

83180 Wireless LANs 9'3' 2005 LRWPAN LowRate Wireless Personal Area Networks - Reliable transfer w/ low data rate (20-250 kb/s) Low power consumption (battery life 1 month) ... are home automation, toys, games, PC peripherals. 2) ... | PowerPoint PPT presentation | free to view

Research%20Challenges%20for%20Military%20Networking PowerPoint PPT Presentation

Research%20Challenges%20for%20Military%20Networking - Background on military networking challenges. ARL CTA program. DARPA AJCN program ... Configure/reconfigure the network into more homogeneous routing domains ... | PowerPoint PPT presentation | free to view

Data Mining using Fractals and Power laws PowerPoint PPT Presentation

Data Mining using Fractals and Power laws - self-managing storage. infrastructure. a storage brick (0.5 5 TB) ~1 PB ' ... Self-* Storage (Ganger ) C. Faloutsos. 11. School of Computer Science. Carnegie Mellon ... | PowerPoint PPT presentation | free to view

Habib Youssef, Ph'D PowerPoint PPT Presentation

Habib Youssef, Ph'D - We live in the 'Information Era', where pervasive access to information is ... Such services will be provided by satellite onboard processing (OBP) systems ... | PowerPoint PPT presentation | free to view

Bob Herbst - (Smart Meter, Thermostat, LM and CPP) Home Area Network. Wide Area Network (WAN) ... Developing load shapes for customer classes. X. X. Voltage monitoring. X. X ... | PowerPoint PPT presentation | free to view

Modern Services of Data Network Part I Communication PowerPoint PPT Presentation

Modern Services of Data Network Part I Communication - Formatted: Dionne Miller, Silver fox ... Part I Communication Presented by: Dr. Mohsen Kahani Ferdowsi University of Mashhad | PowerPoint PPT presentation | free to view

Mobile Communications Chapter 8: Network Protocols/Mobile IP PowerPoint PPT Presentation

Mobile Communications Chapter 8: Network Protocols/Mobile IP - Mobile Communications Chapter 8: Network Protocols/Mobile IP Motivation Data transfer , Encapsulation Security, IPv6, Problems Micro mobility support | PowerPoint PPT presentation | free to view

Optical Fiber Communications PowerPoint PPT Presentation

Optical Fiber Communications - Optical Fiber Communications Optical Networks 15 16 17 18 19 20 22 WDM Multi-hop Architecture Four node broadcast and select multihop network Each node transmits at ... | PowerPoint PPT presentation | free to view

Janusz Dobrowolski SG11 WP1/Q1/Q2 PowerPoint PPT Presentation

Janusz Dobrowolski SG11 WP1/Q1/Q2 - ... networks Self-healing Continuous real time monitoring of every session and connection Application agnostic Billing and charging ... Self-healing Continuous real ... | PowerPoint PPT presentation | free to view

Choose from vast variety of plans Internet, tv and phone services plans. PowerPoint PPT Presentation

Choose from vast variety of plans Internet, tv and phone services plans. - Logic is Grand Cayman's biggest Residential TV, Business TV and Internet service provider company with over 190+ TV channels & internet speeds up to 50MB & beyond. For More Information: www.logic.ky | PowerPoint PPT presentation | free to view

Logic Gives Premium Wholesale Solutions in Internet and Telecommunication in and Around Cayman PowerPoint PPT Presentation

Logic Gives Premium Wholesale Solutions in Internet and Telecommunication in and Around Cayman - Logic is Grand Cayman's biggest Residential TV, Business TV and Internet service Provider Company with over 190+ TV channels & internet speeds up to 50MB & beyond. Logic is the one-stop shop for all of your communications and information technology needs. From residential service to small local businesses, to global enterprises Logic can help. Logic’s Global Connect Wide Area Network (WAN) services deliver a dedicated, predictable, secure, and private connections to your network. The Logic portfolio of Global Connect WAN products provides high quality, high performance and cost effective options for your clients in Cayman and the Global Offshore Islands. Visit Site for detail: https://www.logic.ky | PowerPoint PPT presentation | free to view