Course Overview Principles of Operating Systems - PowerPoint PPT Presentation

About This Presentation

Title:

Course Overview Principles of Operating Systems

Description:

the next step after connecting computers via networks are distributed systems ... dynamic approach (rendezvous arranged via matchmaker port) Distributed Systems 18 ... – PowerPoint PPT presentation

Number of Views:30

Avg rating:3.0/5.0

Slides: 50

Provided by: franzjk

Learn more at: http://users.csc.calpoly.edu

Category:

more less

Transcript and Presenter's Notes

Title: Course Overview Principles of Operating Systems

1
Course OverviewPrinciples of Operating Systems

Introduction
Computer System Structures
Operating System Structures
Processes
Process Synchronization
Deadlocks
CPU Scheduling

Memory Management
Virtual Memory
File Management
Security
Networking
Distributed Systems
Case Studies
Conclusions

2
Chapter Overview Distributed Systems

Motivation
Objectives
Distributed System Structures
Network Operating Systems
Distributed OS
Remote Services
Distributed Processing
Task Distribution
Process Migration
Distributed File Systems
Naming and Transparency
Remote File Access
Stateful/Stateless Service
File Replication

Distributed Communication
Message Passing
Remote Procedure Call
Shared Memory
Distributed Coordination
Event Ordering
Mutual Exclusion
Atomicity
Distributed Deadlock
Important Concepts and Terms
Chapter Summary

3
Motivation

the next step after connecting computers via
networks are distributed systems
resources are transparently available throughout
the system
users dont need to be aware of the system
structure
environment independent of the local machine
the location and execution of processes is
distributed
load balancing
process migration

4
Objectives

be aware of benefits and potential problems of
distributed systems
understand location independence and location
transparency
understand mechanisms for distributing data and
computation
know extensions of previous concepts to
distributed systems

5
Characteristics of Modern Operating Systems

(review OS Structures chapter)
microkernel architecture
multithreading
symmetric multiprocessing
distributed operating systems
object-oriented design

6
Distributed Operating Systems

all resources within the distributed system are
available to all processes if they have the right
permissions
users and processes dont need to be aware of the
exact location of resources
the execution of tasks can be distributed over
several nodes
transparent to the task, user and programmer
there is one single file system encompassing all
files on all nodes
transparent to the user

7
Distributed Systems Diagram
Distributed OS
Users and User Programs
Applications
Server Processes
File System
Operating System
Hardware
David Jones
8
Distributed System Structures

Network Operating Systems
Distributed Operating Systems
Remote Services

9
Network Operating Systems

users are aware of the individual machines in the
network
resources are accessible via login or explicit
transfer of data
remote login, ftp, etc.

10
Distributed Operating Systems

users are unaware of the underlying machines and
networks
in practice, users still have knowledge about
particular machines
remote resources are accessible in the same way
as local resources
practical limitations
physical access (printer, removable media)
movement of data and processes is under the
control of the distributed OS

11
Distributed Processing

Task Distribution
Data Migration
Computation Migration
Process Migration

12
Task Distribution

allocation of tasks to nodes
static at compile or load time
dynamical at run time
separation of a task into subtasks
usually by or with the help of the programmer

13
Data Migration

movement of data within a distributed system
transfer of whole files
whenever access to a file is requested, the whole
file is transferred to the node
all future accesses in that session are local
transfer of necessary parts
only those parts of a file that are actually
needed are transferred
similar to demand paging
used in many modern systems
Sun NFS, Andrews, MS SMB protocol

14
Computation Migration

the computation (task, process, thread) is moved
to the location of the data
large quantities of data
time to transfer data vs. time to execute remote
commands
remote procedure call
a predefined procedure is invoked on a remote
system
message passing
a message with a request for an action is sent to
the remote system
the remote system creates a process to execute
the requested task, and returns a message with
the result

15
Process Migration

extension to computation migration
a process may be executed at a site different
from the one where it was initiated
reasons for process migration
load balancing
computation speedup
specialized hardware
specialized software
access to data
OS decides the allocation of processes
practical limitations (specialized hardware and
software)

16
Remote Services

(see the chapter on processes)
remote procedure calls (RPC)
threads

17
Remote Procedure Calls

usually implemented on top of a message-based
communication scheme
far less reliable than local procedure calls
precautions must be taken for failures
binding problems
local systems can integrate the procedure call
into the executable
this is not possible for remote calls
fixed port number at compile time
dynamic approach (rendezvous arranged via
matchmaker port)

18
Threads

often used in combination with remote procedure
calls
threads execute RPCs on the receiving system
lower overhead than full processes
a server spawns a new thread for incoming
requests
all threads can continue concurrently
threads dont block each other
example Distributed Computing Environment (DCE)

19
Distributed Computing Environment (DCE)

threads package for standardizing network
functionality and protocols
system calls for various purposes
thread management
synchronization
condition variables
scheduling
interoperability
available for most Unix systems, Windows NT

20
Distributed File Systems

Naming and Transparency
Remote File Access
Stateful/Stateless Service
File Replication
Example Sun NFS

21
Distributed File System

multi-user file system where files may reside on
various nodes in a distributed system
transfer time over the network as additional
delay
at 10 MBit/s, the transfer of a 1 MByte file will
take about 1 second (under good conditions)

22
Naming and Transparency

naming
mapping between logical file names as seen by the
user, and the physical location of the blocks
that constitute a file
location transparency
the actual location of the file does not have to
be known by the user
a file may reside on a local system or on a
central file server
files may be cached or replicated for performance
reasons
location independence
the file may be moved, but its name doesnt have
to be changed

23
Naming Schemes

specification of host and path
hostlocal-path/file-name
not location transparent nor location independent
mounting of remote directories
remote directories can be attached to local
directories
can become cumbersome to maintain
location transparent, but not location
independent
example Sun NFS
global name space
total integration of the individual file systems
location transparent, location independent
example Andrews, Sprite, Locus

24
Remote File Access

remote service
on top of a remote procedure call mechanism
extension of system calls
frequently caching is used to improve performance

25
Stateful/Stateless Service

stateful file service
connection between client and server is
maintained for the duration of a session
the server has information on the status of the
client
frequently better performance
stateless file service
each request is self-contained
the server keeps no information on the status or
previous activities of the client
less complex

26
File Replication

several copies of files are kept on different
machines
performance
better access times
redundancy
loss or corruption of a file is not a big problem
consistency
different instances must be kept identical

27
Sun NFS

widely used distributed file system
interconnected workstations are viewed as
independent nodes with independent file systems
files can be shared between any pair of nodes
not restricted to servers
implemented by mounting directories into local
file systems
the mounted directory looks like a part of the
local file system
remote procedure calls enable remote file
operations

28
NFS Diagram
Client
Server
file system calls
VFS Interface
Unix file system
other file systems
NFS client
Request
Communication
RPC Mech.
Response
Operating System
Hardware Platform
29
Communication in Distributed Systems

Message Passing
Remote Procedure Call
Shared Memory
impractical for distributed systems

30
Message Passing

a request for a service is sent from the local
system to the remote system
the request is sent in the form of a message
the receiving process accepts the message and
performs the desired service
the result is also returned as a message
reliability
acknowledgments may be used to indicate the
receipt f a message
synchronous (blocking) or asynchronous
(non-blocking)

31
Remote Procedure Call

often built on top of message passing
frequently implemented as synchronous calls
parameter passing
call by value is much easier to implement than
call by reference
parameter representation
translation between programming languages or
operating systems may be required

32
RPC Diagram
Client Application
Server Application
local calls
local calls
Application Logic (Client Side)
Application Logic (Client Side)
Local Stub
Local Stub
Request
Communication
Communication
RPC Mech.
RPC Mech.
Response
Operating System
Operating System
Hardware Platform
Hardware Platform
33
Distributed Coordination

Event Ordering
Mutual Exclusion
Atomicity
Distributed Deadlock

34
Distributed Coordination

synchronization of processes across distributed
systems
no common memory
no common clock
extension of methods discussed in the chapters on
process synchronization and deadlocks
not all methods can be extended easily

35
Event Ordering

straightforward in a single system
it is always possible to determine if on event
happens before, at the same time, or after
another event
often expressed by the happened-before relation
defines a total ordering of the events
timestamps can be used in distributed systems to
determine a global ordering of events

36
Event Ordering Diagram
A
A
A
Time
Message
Event
Process
37
Mutual Exclusion

critical sections which may be used by at most
one process at a time
processes are distributed over several nodes
approaches
centralized
fully distributed
token passing

38
Centralized Approach

one process coordinates the entry to the critical
section
processes wishing to enter the critical section
send a request message to the coordinator
one process gets permission through a reply
message from the coordinator, enters the critical
section, and sends a release message to the
coordinator
coordinator is critical
if it fails, a new coordinator must be determined

39
Fully Distributed Approach

far more complicated than the centralized
approach
based on event ordering with timestamps
a process that wants to enter its critical
section sends a request (including the timestamp)
to all other processes
it waits until it receives a reply message from
all processes before entering the critical
section
if a process is in its critical section, it wont
send a reply message until it has left the
critical section

40
Token Passing

processes are arranged in a logical ring
one single token is passed around the distributed
system
the holder of the token is allowed to enter the
critical section
precautions must be taken for lost tokens

41
Distributed Atomicity

atomic transaction
a set of operations that is either fully
executed, or not at all
in a distributed system, the operations grouped
into one atomic transaction may be executed on
different nodes/sites
transaction coordinator
local coordinator guarantees atomicity at one
site
two-phase commit (2PC) protocol

42
Two-Phase Commit Protocol

makes sure that all sites involved either commit
to a common transaction, or abort
Phase 1
after the execution of the transaction, the
transaction manager at the initiating site
queries all the others if they are willing to
commit their portions of the transaction
Phase 2
if all answer positively within a given time, the
transaction is committed otherwise it must be
aborted
the outcome is reported to all sites, and they
finalize the commit or abort

43
Failure Handling in 2PC

participating site
the affected site must either redo, undo, or
contact the coordinator about the fate of the
transaction
coordinator
if a participating site has a commit (abort) on
record, the transaction must be committed
(aborted)
it may be impossible to determine if and what
kind of decision has been made, and the sites
must wait for the coordinator to recover
network
for one link, it is similar to the failure of a
site
for several links, the network may be partitioned
if coordinator and all participating sites are in
the same partition, the protocol can continue
otherwise it is similar to site failure

44
Distributed Deadlock

extensions of the methods and algorithms
discussed in the chapter on deadlocks
deadlock prevention and avoidance
resource ordering
bankers algorithm
prioritized preemption
deadlock detection
(simple case one instance per resource type)
centralized
fully distributed

45
Deadlock Prevention

resource-ordering
can be enhanced by defining a global ordering on
the resources in the distributed system
distributed bankers algorithm
one process performs the role of the banker
all requests for resources must go through the
banker
the banker can become the bottleneck
prioritized preemption
each process has a unique priority number
cycles in the resource allocation graph are
prevented by preempting processes with lower
priorities

46
Centralized Deadlock Detection

each site maintains a local wait-for graph
it must be shown that the union of all the graphs
contains no cycle
one coordinator maintains the unified graph
time delays may lead to false cycles
can be avoided by using time stamps

47
Distributed Deadlock Detection

partial graphs are maintained at every site
if a deadlock exists, it will lead to a cycle in
at least one of the partial graphs
based on local wait-for graphs enhanced by a node
for external processes
an arc to that node exists if a process waits for
an external item
a cycle involving that external node indicates
the possibility of a deadlock
can be verified by a distributed deadlock
detection algorithm involving message exchanges
with affected sites

48
Important Concepts and Terms

asynchronous
atomic transactions
client/server model
communication
coordination
distributed deadlock
distributed file system
distributed operating system
event ordering
kernel
location independence
location transparency

message passing
microkernel
mutual exclusion
naming
network file system
network operating system
processes
remote procedure call
resources
server, services
synchronous
tasks

49
Chapter Summary

distributed systems extend the functionality of
computers connected through networks
location independence and location transparency
are important aspects of distributed systems
distribution of data and computation can achieve
better resource utilization and performance
many aspects of distributed systems are more
complex than for local systems

Write a Comment

User Comments (0)