Course Overview Principles of Operating Systems - PowerPoint PPT Presentation

About This Presentation
Title:

Course Overview Principles of Operating Systems

Description:

the next step after connecting computers via networks are distributed systems ... dynamic approach (rendezvous arranged via matchmaker port) Distributed Systems 18 ... – PowerPoint PPT presentation

Number of Views:30
Avg rating:3.0/5.0
Slides: 50
Provided by: franzjk
Category:

less

Transcript and Presenter's Notes

Title: Course Overview Principles of Operating Systems


1
Course OverviewPrinciples of Operating Systems
  • Introduction
  • Computer System Structures
  • Operating System Structures
  • Processes
  • Process Synchronization
  • Deadlocks
  • CPU Scheduling
  • Memory Management
  • Virtual Memory
  • File Management
  • Security
  • Networking
  • Distributed Systems
  • Case Studies
  • Conclusions

2
Chapter Overview Distributed Systems
  • Motivation
  • Objectives
  • Distributed System Structures
  • Network Operating Systems
  • Distributed OS
  • Remote Services
  • Distributed Processing
  • Task Distribution
  • Process Migration
  • Distributed File Systems
  • Naming and Transparency
  • Remote File Access
  • Stateful/Stateless Service
  • File Replication
  • Distributed Communication
  • Message Passing
  • Remote Procedure Call
  • Shared Memory
  • Distributed Coordination
  • Event Ordering
  • Mutual Exclusion
  • Atomicity
  • Distributed Deadlock
  • Important Concepts and Terms
  • Chapter Summary

3
Motivation
  • the next step after connecting computers via
    networks are distributed systems
  • resources are transparently available throughout
    the system
  • users dont need to be aware of the system
    structure
  • environment independent of the local machine
  • the location and execution of processes is
    distributed
  • load balancing
  • process migration

4
Objectives
  • be aware of benefits and potential problems of
    distributed systems
  • understand location independence and location
    transparency
  • understand mechanisms for distributing data and
    computation
  • know extensions of previous concepts to
    distributed systems

5
Characteristics of Modern Operating Systems
  • (review OS Structures chapter)
  • microkernel architecture
  • multithreading
  • symmetric multiprocessing
  • distributed operating systems
  • object-oriented design

6
Distributed Operating Systems
  • all resources within the distributed system are
    available to all processes if they have the right
    permissions
  • users and processes dont need to be aware of the
    exact location of resources
  • the execution of tasks can be distributed over
    several nodes
  • transparent to the task, user and programmer
  • there is one single file system encompassing all
    files on all nodes
  • transparent to the user

7
Distributed Systems Diagram
Distributed OS
Users and User Programs
Applications
Server Processes
File System
Operating System
Hardware
David Jones
8
Distributed System Structures
  • Network Operating Systems
  • Distributed Operating Systems
  • Remote Services

9
Network Operating Systems
  • users are aware of the individual machines in the
    network
  • resources are accessible via login or explicit
    transfer of data
  • remote login, ftp, etc.

10
Distributed Operating Systems
  • users are unaware of the underlying machines and
    networks
  • in practice, users still have knowledge about
    particular machines
  • remote resources are accessible in the same way
    as local resources
  • practical limitations
  • physical access (printer, removable media)
  • movement of data and processes is under the
    control of the distributed OS

11
Distributed Processing
  • Task Distribution
  • Data Migration
  • Computation Migration
  • Process Migration

12
Task Distribution
  • allocation of tasks to nodes
  • static at compile or load time
  • dynamical at run time
  • separation of a task into subtasks
  • usually by or with the help of the programmer

13
Data Migration
  • movement of data within a distributed system
  • transfer of whole files
  • whenever access to a file is requested, the whole
    file is transferred to the node
  • all future accesses in that session are local
  • transfer of necessary parts
  • only those parts of a file that are actually
    needed are transferred
  • similar to demand paging
  • used in many modern systems
  • Sun NFS, Andrews, MS SMB protocol

14
Computation Migration
  • the computation (task, process, thread) is moved
    to the location of the data
  • large quantities of data
  • time to transfer data vs. time to execute remote
    commands
  • remote procedure call
  • a predefined procedure is invoked on a remote
    system
  • message passing
  • a message with a request for an action is sent to
    the remote system
  • the remote system creates a process to execute
    the requested task, and returns a message with
    the result

15
Process Migration
  • extension to computation migration
  • a process may be executed at a site different
    from the one where it was initiated
  • reasons for process migration
  • load balancing
  • computation speedup
  • specialized hardware
  • specialized software
  • access to data
  • OS decides the allocation of processes
  • practical limitations (specialized hardware and
    software)

16
Remote Services
  • (see the chapter on processes)
  • remote procedure calls (RPC)
  • threads

17
Remote Procedure Calls
  • usually implemented on top of a message-based
    communication scheme
  • far less reliable than local procedure calls
  • precautions must be taken for failures
  • binding problems
  • local systems can integrate the procedure call
    into the executable
  • this is not possible for remote calls
  • fixed port number at compile time
  • dynamic approach (rendezvous arranged via
    matchmaker port)

18
Threads
  • often used in combination with remote procedure
    calls
  • threads execute RPCs on the receiving system
  • lower overhead than full processes
  • a server spawns a new thread for incoming
    requests
  • all threads can continue concurrently
  • threads dont block each other
  • example Distributed Computing Environment (DCE)

19
Distributed Computing Environment (DCE)
  • threads package for standardizing network
    functionality and protocols
  • system calls for various purposes
  • thread management
  • synchronization
  • condition variables
  • scheduling
  • interoperability
  • available for most Unix systems, Windows NT

20
Distributed File Systems
  • Naming and Transparency
  • Remote File Access
  • Stateful/Stateless Service
  • File Replication
  • Example Sun NFS

21
Distributed File System
  • multi-user file system where files may reside on
    various nodes in a distributed system
  • transfer time over the network as additional
    delay
  • at 10 MBit/s, the transfer of a 1 MByte file will
    take about 1 second (under good conditions)

22
Naming and Transparency
  • naming
  • mapping between logical file names as seen by the
    user, and the physical location of the blocks
    that constitute a file
  • location transparency
  • the actual location of the file does not have to
    be known by the user
  • a file may reside on a local system or on a
    central file server
  • files may be cached or replicated for performance
    reasons
  • location independence
  • the file may be moved, but its name doesnt have
    to be changed

23
Naming Schemes
  • specification of host and path
  • hostlocal-path/file-name
  • not location transparent nor location independent
  • mounting of remote directories
  • remote directories can be attached to local
    directories
  • can become cumbersome to maintain
  • location transparent, but not location
    independent
  • example Sun NFS
  • global name space
  • total integration of the individual file systems
  • location transparent, location independent
  • example Andrews, Sprite, Locus

24
Remote File Access
  • remote service
  • on top of a remote procedure call mechanism
  • extension of system calls
  • frequently caching is used to improve performance

25
Stateful/Stateless Service
  • stateful file service
  • connection between client and server is
    maintained for the duration of a session
  • the server has information on the status of the
    client
  • frequently better performance
  • stateless file service
  • each request is self-contained
  • the server keeps no information on the status or
    previous activities of the client
  • less complex

26
File Replication
  • several copies of files are kept on different
    machines
  • performance
  • better access times
  • redundancy
  • loss or corruption of a file is not a big problem
  • consistency
  • different instances must be kept identical

27
Sun NFS
  • widely used distributed file system
  • interconnected workstations are viewed as
    independent nodes with independent file systems
  • files can be shared between any pair of nodes
  • not restricted to servers
  • implemented by mounting directories into local
    file systems
  • the mounted directory looks like a part of the
    local file system
  • remote procedure calls enable remote file
    operations

28
NFS Diagram
Client
Server
file system calls
VFS Interface
Unix file system
other file systems
NFS client
Request
Communication
RPC Mech.
Response
Operating System
Hardware Platform
29
Communication in Distributed Systems
  • Message Passing
  • Remote Procedure Call
  • Shared Memory
  • impractical for distributed systems

30
Message Passing
  • a request for a service is sent from the local
    system to the remote system
  • the request is sent in the form of a message
  • the receiving process accepts the message and
    performs the desired service
  • the result is also returned as a message
  • reliability
  • acknowledgments may be used to indicate the
    receipt f a message
  • synchronous (blocking) or asynchronous
    (non-blocking)

31
Remote Procedure Call
  • often built on top of message passing
  • frequently implemented as synchronous calls
  • parameter passing
  • call by value is much easier to implement than
    call by reference
  • parameter representation
  • translation between programming languages or
    operating systems may be required

32
RPC Diagram
Client Application
Server Application
local calls
local calls
Application Logic (Client Side)
Application Logic (Client Side)
Local Stub
Local Stub
Request
Communication
Communication
RPC Mech.
RPC Mech.
Response
Operating System
Operating System
Hardware Platform
Hardware Platform
33
Distributed Coordination
  • Event Ordering
  • Mutual Exclusion
  • Atomicity
  • Distributed Deadlock

34
Distributed Coordination
  • synchronization of processes across distributed
    systems
  • no common memory
  • no common clock
  • extension of methods discussed in the chapters on
    process synchronization and deadlocks
  • not all methods can be extended easily

35
Event Ordering
  • straightforward in a single system
  • it is always possible to determine if on event
    happens before, at the same time, or after
    another event
  • often expressed by the happened-before relation
  • defines a total ordering of the events
  • timestamps can be used in distributed systems to
    determine a global ordering of events

36
Event Ordering Diagram
A
A
A
Time
Message
Event
Process
37
Mutual Exclusion
  • critical sections which may be used by at most
    one process at a time
  • processes are distributed over several nodes
  • approaches
  • centralized
  • fully distributed
  • token passing

38
Centralized Approach
  • one process coordinates the entry to the critical
    section
  • processes wishing to enter the critical section
    send a request message to the coordinator
  • one process gets permission through a reply
    message from the coordinator, enters the critical
    section, and sends a release message to the
    coordinator
  • coordinator is critical
  • if it fails, a new coordinator must be determined

39
Fully Distributed Approach
  • far more complicated than the centralized
    approach
  • based on event ordering with timestamps
  • a process that wants to enter its critical
    section sends a request (including the timestamp)
    to all other processes
  • it waits until it receives a reply message from
    all processes before entering the critical
    section
  • if a process is in its critical section, it wont
    send a reply message until it has left the
    critical section

40
Token Passing
  • processes are arranged in a logical ring
  • one single token is passed around the distributed
    system
  • the holder of the token is allowed to enter the
    critical section
  • precautions must be taken for lost tokens

41
Distributed Atomicity
  • atomic transaction
  • a set of operations that is either fully
    executed, or not at all
  • in a distributed system, the operations grouped
    into one atomic transaction may be executed on
    different nodes/sites
  • transaction coordinator
  • local coordinator guarantees atomicity at one
    site
  • two-phase commit (2PC) protocol

42
Two-Phase Commit Protocol
  • makes sure that all sites involved either commit
    to a common transaction, or abort
  • Phase 1
  • after the execution of the transaction, the
    transaction manager at the initiating site
    queries all the others if they are willing to
    commit their portions of the transaction
  • Phase 2
  • if all answer positively within a given time, the
    transaction is committed otherwise it must be
    aborted
  • the outcome is reported to all sites, and they
    finalize the commit or abort

43
Failure Handling in 2PC
  • participating site
  • the affected site must either redo, undo, or
    contact the coordinator about the fate of the
    transaction
  • coordinator
  • if a participating site has a commit (abort) on
    record, the transaction must be committed
    (aborted)
  • it may be impossible to determine if and what
    kind of decision has been made, and the sites
    must wait for the coordinator to recover
  • network
  • for one link, it is similar to the failure of a
    site
  • for several links, the network may be partitioned
  • if coordinator and all participating sites are in
    the same partition, the protocol can continue
  • otherwise it is similar to site failure

44
Distributed Deadlock
  • extensions of the methods and algorithms
    discussed in the chapter on deadlocks
  • deadlock prevention and avoidance
  • resource ordering
  • bankers algorithm
  • prioritized preemption
  • deadlock detection
  • (simple case one instance per resource type)
  • centralized
  • fully distributed

45
Deadlock Prevention
  • resource-ordering
  • can be enhanced by defining a global ordering on
    the resources in the distributed system
  • distributed bankers algorithm
  • one process performs the role of the banker
  • all requests for resources must go through the
    banker
  • the banker can become the bottleneck
  • prioritized preemption
  • each process has a unique priority number
  • cycles in the resource allocation graph are
    prevented by preempting processes with lower
    priorities

46
Centralized Deadlock Detection
  • each site maintains a local wait-for graph
  • it must be shown that the union of all the graphs
    contains no cycle
  • one coordinator maintains the unified graph
  • time delays may lead to false cycles
  • can be avoided by using time stamps

47
Distributed Deadlock Detection
  • partial graphs are maintained at every site
  • if a deadlock exists, it will lead to a cycle in
    at least one of the partial graphs
  • based on local wait-for graphs enhanced by a node
    for external processes
  • an arc to that node exists if a process waits for
    an external item
  • a cycle involving that external node indicates
    the possibility of a deadlock
  • can be verified by a distributed deadlock
    detection algorithm involving message exchanges
    with affected sites

48
Important Concepts and Terms
  • asynchronous
  • atomic transactions
  • client/server model
  • communication
  • coordination
  • distributed deadlock
  • distributed file system
  • distributed operating system
  • event ordering
  • kernel
  • location independence
  • location transparency
  • message passing
  • microkernel
  • mutual exclusion
  • naming
  • network file system
  • network operating system
  • processes
  • remote procedure call
  • resources
  • server, services
  • synchronous
  • tasks

49
Chapter Summary
  • distributed systems extend the functionality of
    computers connected through networks
  • location independence and location transparency
    are important aspects of distributed systems
  • distribution of data and computation can achieve
    better resource utilization and performance
  • many aspects of distributed systems are more
    complex than for local systems
Write a Comment
User Comments (0)
About PowerShow.com