Module 2.0: Processes and Threads - PowerPoint PPT Presentation

About This Presentation
Title:

Module 2.0: Processes and Threads

Description:

Assembly language modules are assembled into machine language. ... passing messages and data between threads. scheduling thread execution ... – PowerPoint PPT presentation

Number of Views:99
Avg rating:3.0/5.0
Slides: 44
Provided by: marily237
Category:

less

Transcript and Presenter's Notes

Title: Module 2.0: Processes and Threads


1
Module 2.0 Processes and Threads
  • Process Concept
  • Trace of Processes
  • Process Context
  • Context Switching
  • Threads
  • ULT
  • KLT

2
Process
  • Also called a task.
  • Useful and Important Concept Process program
    in execution
  • A process is not the same as a program. Program
    is a passive entity, whereas process is active.
    Process consists of an executable program,
    associated data, and execution context.
  • Modern (multiprogramming) operating systems are
    structured around the concept of a process.
  • Multiprogramming OS supports execution of many
    concurrent processes. OS issues tend to revolve
    around management of processes
  • How are processes created/destroyed?
  • How to manage resource requirements of a process
    during its execution cpu time, memory, I/O,
    communication, ... ?
  • How to avoid interference between processes?
  • How to achieve cooperation and communication
    between processes?

3
Program Creation
  • Program (say, C program) is edited
  • It is compiled into assembly language, which may
    consist of several modules.
  • Assembly language modules are assembled into
    machine language.
  • External references (i.e., to procedures and data
    in another module) are resolved. This is called
    linking, which creates a load module.
  • Load or image module is stored as a file in file
    system and may be executed at a later time by
    loading into memory to be executed.

4
Process creation and termination
  • Consider a simple disk operating system (like
    MS-DOS, typically supports only one process at a
    time)
  • User types command like run foo at Keyboard
    (I/O device driver for keyboard, screen)
  • Command is parsed by command shell
  • Executable program file (load module) foo is
    located on disk (file system, I/O device driver
    for disk)
  • Contents are loaded into memory and control
    transferred gt process comes alive! (device
    driver for disk, relocating loader, memory
    management)
  • During execution, process may call OS to perform
    I/O console, disk, printer, etc. (system call
    interface, I/O device drivers)
  • When process terminates, memory is reclaimed.
    (memory management)

5
Two Types
  • Processes can be described as either
  • I/O-bound process
  • spends more time doing I/O than computations
  • many short CPU bursts
  • Long I/O burst
  • Ex vi
  • CPU-bound process
  • spends more time doing computations
  • Heavy number crunching
  • few very long CPU bursts
  • Ex simulation

6
Trace of Processes
7
Trace of processes (cont.)
8
Trace of processes (cont.)
9
(No Transcript)
10
Example
If parent chooses to wait until the child
executes (but not always the case).
11
Multiprogramming/Timesharing Systems
  • They provide interleaved execution of several
    processes to give an illusion of many
    simultaneously executing processes.
  • Computers can be a single-processor or
    multi-processor machine.
  • The OS must keep track of the state for each
    active process and make sure that the correct
    information is properly installed when a process
    is given control of the CPU.

12
Multiprogramming (multiple processes)
  • For each process, the O.S. maintains a data
    structure, called the process control block
    (PCB). The PCB provides a way of accessing all
    information relevant to a process
  • This data is either contained directly in the
    PCB, or else the PCB contains pointers to other
    system tables.
  • Processes (PCBs) are manipulated by two main
    components of the process subsystem in order to
    achieve the effects of multiprogramming
  • Scheduler determines the order by which
    processes will gain access to the CPU. Efficiency
    and fair-play are issues here.
  • Dispatcher actually allocates CPU to process
    next in line as determined by the scheduler.

13
Process Context
  • The context (or image) of a process can be
    described by
  • contents of main memory
  • contents of CPU registers
  • other info (open files, I/O in progress, etc.)
  • Main memory -- three logically distinct regions
    of memory
  • code region contains executable code (typically
    read-only)
  • data region storage area for dynamically
    allocated data structure, e.g., lists, trees
    (typically heap data structure)
  • stack region run-time stack of activation
    records
  • CPU registers general registers, PC, SP, PSW,
    segmentation registers
  • Other info
  • open files table, status of ongoing I/O
  • process status (running, ready, blocked), user
    id, ...

14
The Process Control Block (PCB)
  • The PCB typically contains the following types of
    information
  • Process status (or state) new, ready to run,
    user running, kernel running, waiting, halted
  • Program counter where in program the process is
    executing
  • CPU registers contents of general-purpose
    register stack pointer, PSW, index registers
  • Memory Management info segment base and limit
    registers, page table, location of pages on disk,
    process size
  • User ID, Group ID, Process ID, Parent PID, ...
  • Event Descriptor when process is in the sleep
    or waiting state
  • Scheduling info process priority, size of CPU
    quantum, length of current CPU burst

15
PCB (cont.)
  • List of pending signals
  • Accounting info process execution time, resource
    utilization
  • Real and Effective User IDs determine various
    privileges allowed the process such as file
    access rights
  • More timers record time process has spent
    executing in user and Kernel mode
  • Array indicating how process wishes to react to
    signals
  • System call info arguments, return value, error
    field for current system call
  • Pending I/O operation info amount of data to
    transfer, addr in user memory, file offset, ...
  • Current directory and root file system
    environment of process
  • Open file table records files process has open

16
Process States Transitions
Zooming in
  • Running
  • User-running
  • Kernel-running
  • Ready
  • Ready, suspend
  • Ready
  • Waiting (or blocked)
  • Blocked
  • Blocked, suspend

Suspend may swap out all or part of the process.
Shared regions/segments are not suspended.
17
How queues are implemented?
18
When to context switch
  • Typically, hardware automatically saves the user
    PC and PSW when interrupt occurs, and takes new
    PC from interrupt vector.
  • Interrupt handler may simply perform its function
    and then return to the same process that was
    interrupted (restoring the PC and PSW from the
    stack).
  • Alternatively, may no longer be appropriate to
    resume execution of process that was running
    because
  • process has used up its current CPU quantum
  • process has requested I/O and must wait for
    results
  • process has asked to be suspended (sleep) for
    some amount of time
  • a signal or error requires process be destroyed
    (killed)
  • a higher priority process should be given the
    CPU
  • E.g., pressing ctrl-alt-delete
  • In such a situation, a context switch is
    performed to install appropriate info for running
    a new process.

19
Mechanics of a Context Switch
  • copy contents of CPU registers (general-purpose,
    SP, PC, PSW, etc.) into a save area in the PCB of
    running process
  • change status of running process from running
    to waiting (or ready)
  • change a system variable running-process to point
    to the PCB of new process to run
  • copy info from register save area in PCB of new
    process into CPU registers
  • Note
  • context switching is pure overhead and should be
    done as fast as possible
  • often hardware-assisted - special instructions
    for steps 1 and 4
  • determining new process to run accomplished by
    consulting scheduler queues
  • step 4 will start execution of new process -
    known as dispatching.

20
MULTIPROGRAMMING Through CONTEXT SWITCHING
21
Process Creation
  • Parent process creates children processes, which,
    in turn create other processes, forming a tree of
    processes
  • Resource sharing models/types
  • Parent and children share all resources
  • Children share subset of parents resources
  • Parent and child share no resources
  • Execution
  • Parent and children execute concurrently
  • Parent waits until children terminate
  • UNIX examples
  • fork system call creates new process
  • exec system call used after a fork to replace the
    process memory space with a new program

22
C Program Forking Separate Process
  • int main()
  • Pid_t pid
  • / fork another process /
  • pid fork()
  • if (pid lt 0) / error occurred /
  • fprintf(stderr, "Fork Failed")
  • exit(-1)
  • else if (pid 0) / child process /
  • execlp("/bin/ls", "ls", NULL)
  • else / parent process /
  • / parent will wait for the child to complete
    /
  • wait (NULL)
  • printf ("Child Complete")
  • exit(0)

23
A tree of processes on a typical Solaris
24
Introduction to Threads
  • Parallelism at different levels
  • Granularity of parallelism at processor level
  • Coarse-grain processes
  • Fine-grain -- threads
  • Multitasking OS can do more than one thing
    concurrently by running more than a single
    process
  • Processes can do several things concurrently be
    running more than a single thread.
  • Each thread is a different stream of control that
    can execute its instructions independently.
  • A program (e.g. Browser) may consist of the
    following threads
  • GUI thread
  • I/O thread
  • computation

25
Processes and Threads
  • A typical process consists of
  • a running program
  • a bundle of resources (file descriptor table,
    address space)
  • A thread, called a lightweight process, has its
    own
  • stack
  • CPU Registers
  • state
  • All the other resources are shared by all threads
    of that process. These include
  • open files
  • virtual address space (code and data segments).
  • child processes

26
Processes vs. Threads
27
Single and Multithreaded Processes
28
Single Threaded and Multithreaded Process Models
  • Thread Control Block contains a register image,
    thread priority and thread state information.

29
Benefits of Threads vs Processes
  • Takes less time to create a new thread than a
    process
  • Less time to terminate a thread than a process
  • Less time to switch between two threads within
    the same process
  • Since threads within the same process share
    memory and files, they can communicate with each
    other without invoking the kernel. However, it
    is necessary to synchronize the activities of
    various threads so that they do not obtain
    inconsistent views of the data.

30
Example I Web Browser
31
Example II Web Server
32
Threads States
  • Three key states running, ready, blocked
  • Generally, it does not make sense to have a
    suspend state for threads.
  • Because all threads within the same process share
    the same address space
  • Indeed suspending (ie swapping) a single thread
    involves suspending all threads of the same
    process
  • Termination of a process, terminates all threads
    within the process

33
User-Level Threads (ULT)
  • The kernel is not aware of the existence of
    threads
  • All thread management is done by the application
    by using a thread library
  • Thread switching does not require kernel mode
    privileges (no mode switch)
  • Scheduling is application specific

34
Threads library
  • Contains code for
  • creating and destroying threads
  • passing messages and data between threads
  • scheduling thread execution
  • saving and restoring thread contexts
  • Three primary thread libraries
  • POSIX Pthreads. The P stands for POSIX and run
    on unix, linux, and MS Windows.
  • Cthreads
  • Win32 threads
  • Java threads

35
Kernel activity for ULTs
  • The kernel is not aware of thread activity but it
    is still managing process activity
  • When a thread makes a system call, the whole
    process will be blocked
  • but for the thread library that thread is still
    in the running state
  • So thread states are independent of process states

36
Advantages and inconveniences of ULT
  • Advantages
  • Thread switching does not involve the kernel no
    mode switching
  • thread_yield()
  • Strong sharing of data with little blocking
  • No need for shared memory system calls
  • Excel sheets share a lot other than files
  • Scheduling can be application specific choose
    the best algorithm.
  • Run a garbage collection thread at convenient
    points
  • ULTs can run on any OS. Only needs a thread
    library
  • Portable
  • Inconveniences
  • Most system calls are blocking and the kernel
    blocks processes. So all threads within the
    process will be unable to run
  • The kernel can only assign processes to
    processors. Two threads within the same process
    cannot run simultaneously on two processors

For theads that run for too long (1 sec),
preemption is done using signals or alarms (e.g.,
ualarm). However this requires a lot more
overhead in switching. Signal delivery by kernel
to process is very complex. Kernel checks for
signal at termination of phase interrupts, if one
is pending save context of process, K-U to
handle signal, U-K to restore context of
process, K-U to resume process.
37
Improving blocking with ULT -- Advanced
  • Use nonblocking I/O system calls
  • Returns quickly without need to complete the full
    I/O operation
  • Use asynchronous I/O system calls
  • Setup a callback function and returns quick
  • When I/O is completed a function is called (part
    of signal handling)
  • Identify blocking system calls, and place a
    jacket or wrapper around them
  • Needs to modify API or system call library
  • If we know it will block, defer the thread and
    let other threads run first

38
Kernel-Level Threads (KLT)
  • All thread management is done by kernel
  • No thread library but an API (I.e. system calls)
    to the kernel thread facility
  • Kernel maintains context information for the
    process and the threads
  • Switching between threads requires the kernel
  • Scheduling on a thread basis
  • Examples
  • Windows XP/2000
  • Solaris
  • Linux
  • Tru64 UNIX
  • Mac OS X

39
Kernel Multithreading Models
  • Many-to-One
  • One-to-One
  • Many-to-Many

40
Advantages and inconveniences of KLT
  • Advantages
  • the kernel can simultaneously schedule many
    threads of the same process on many processors
  • blocking is done on a thread level
  • kernel routines can be multithreaded
  • Inconveniences
  • thread switching within the same process involves
    the kernel. We have 2 mode switches per thread
    switch user to kernel and kernel to user.
  • this results in a significant slow down due to
  • Interrupt overhead due to mode switch
  • Updates to TCB info
  • Cache pollution and flushing to Process tables
    and page tables

41
Combined ULT/KLT Approaches
  • Thread creation done in the user space
  • Bulk of scheduling and synchronization of threads
    done in the user space
  • The programmer may adjust the number of KLTs
  • May combine the best of both approaches
  • Examples
  • Solaris prior to version 9
  • Windows NT/2000 with the ThreadFiber package

42
Solaris versatility
  • We can use ULTs when logical parallelism does not
    need to be supported by hardware parallelism (we
    save mode switching)
  • Ex Multiple windows but only one is active at
    any one time
  • Excel sheet (sheet1, sheet2, etc)
  • Power point
  • Word processor
  • It is wise to have 2 KLTs under this situation.
    So if one window is blocked when making a system
    call, use the other KLT to run the other selected
    window). If the windows are doing a lot of
    blocking, use more KLTs.
  • Reason is efficiency
  • ULTs can be created, blocked, destroyed, without
    involving the kernel
  • Efficiency in terms of memory and data structure
    allocated in kernel space
  • Minimizing cache pollution and flushing
  • If threads may block then we can specify two or
    more LWPs (or KLTs) to avoid blocking the whole
    application

43
Further Readings
  • What is the difference between RPC and RMI?
  • What is meant by marshalling parameters?
  • What is the idea behind a thread pool?
  • What is hyperthreading?
  • Answer this
  • If you have CPU bound application, when does it
    make sense to use ULTs for them as opposed to
    KLTs?
  • Example is a parallel array computation where you
    divide the rows of its arrays among different
    threads
  • Answer
  • use ULTs to minimize switching with uniprocessor
  • Use KLTs for more concurrency with SMP
Write a Comment
User Comments (0)
About PowerShow.com