Title: Chapter 9: Virtual Memory
1Chapter 9 Virtual Memory
2Chapter 9 Virtual Memory
- Background
- Demand Paging
- Process Creation
- Page Replacement
- Allocation of Frames
- Thrashing
- Demand Segmentation
- Operating System Examples
3Background
- Virtual memory separation of user logical
memory from physical memory. - Only part of the program needs to be in memory
for execution. - Logical address space can therefore be much
larger than physical address space. - Allows address spaces to be shared by several
processes. - Allows for more efficient process creation.
- Virtual memory can be implemented via
- Demand paging
- Demand segmentation
4Virtual Memory That is Larger Than Physical Memory
?
5Virtual-address Space
6Shared Library Using Virtual Memory
7Demand Paging
- Bring a page into memory only when it is needed
- Less I/O needed
- Less memory needed
- Faster response
- More users
- Page is needed ? reference to it
- invalid reference ? abort
- not-in-memory ? bring to memory
8Transfer of a Paged Memory to Contiguous Disk
Space
9Valid-Invalid Bit
- With each page table entry a validinvalid bit is
associated(1 ? in-memory, 0 ? not-in-memory) - Initially validinvalid bit is set to 0 on all
entries - Example of a page table snapshot
- During address translation, if validinvalid bit
in page table entry is 0 ? page fault
Frame
valid-invalid bit
1
1
1
1
0
?
0
0
page table
10Page Table When Some Pages Are Not in Main Memory
11Page Fault
- If there is ever a reference to a page, first
reference will trap to OS ? page fault - OS looks at another table to decide
- Invalid reference ? abort.
- Just not in memory.
- Get empty frame.
- Swap page into frame.
- Reset tables, validation bit 1.
- Restart instruction Least Recently Used
- block move
- auto increment/decrement location
12Steps in Handling a Page Fault
13What happens if there is no free frame?
- Page replacement find some page in memory, but
not really in use, swap it out - algorithm
- performance want an algorithm which will result
in minimum number of page faults - Same page may be brought into memory several times
14Performance of Demand Paging
- Page Fault Rate 0 ? p ? 1.0
- if p 0, no page faults
- if p 1, every reference is a fault
- Effective Access Time (EAT)
- EAT (1 p) x memory access
- p (page fault overhead
- swap page out
- swap page in
- restart overhead)
15Demand Paging Example
- Memory access time 1 microsecond
- 50 of the time the page that is being replaced
has been modified and therefore needs to be
swapped out - Swap Page Time 10 msec 10,000 usec
- EAT (1 p) x 1 p (15000)
- 1 15000P (in usec)
16Process Creation
- Virtual memory allows other benefits during
process creation - - Copy-on-Write
- - Memory-Mapped Files (later)
17Copy-on-Write
- Copy-on-Write (COW) allows both parent and child
processes to initially share the same pages in
memoryIf either process modifies a shared page,
only then is the page copied - COW allows more efficient process creation as
only modified pages are copied - Free pages are allocated from a pool of
zeroed-out pages
18Page Replacement
- Prevent over-allocation of memory by modifying
page-fault service routine to include page
replacement - Use modify (dirty) bit to reduce overhead of page
transfers only modified pages are written to
disk - Page replacement completes separation between
logical memory and physical memory large
virtual memory can be provided on a smaller
physical memory
19Need For Page Replacement
20Basic Page Replacement
- Find the location of the desired page on disk
- Find a free frame - If there is a free frame,
use it - If there is no free frame, use a page
replacement algorithm to select a victim frame - Read the desired page into the (newly) free
frame. Update the page and frame tables. - Restart the process
21Page Replacement
22Page Replacement Algorithms
- Want lowest page-fault rate
- Evaluate algorithm by running it on a particular
string of memory references (reference string)
and computing the number of page faults on that
string - In all our examples, the reference string is
- 1, 2, 3, 4, 1, 2, 5, 1, 2, 3, 4, 5
23Graph of Page Faults Versus The Number of Frames
24First-In-First-Out (FIFO) Algorithm
- Reference string 1, 2, 3, 4, 1, 2, 5, 1, 2, 3,
4, 5 - 3 frames (3 pages can be in memory at a time per
process) -
- 4 frames
-
- FIFO Replacement Beladys Anomaly
- more frames ? more page faults
1
1
4
5
2
2
1
3
9 page faults
3
3
2
4
1
1
5
4
2
2
1
10 page faults
5
3
3
2
4
4
3
25FIFO Page Replacement
26FIFO Illustrating Beladys Anomaly
Beladys anomaly
27Optimal Algorithm
- Replace page that will not be used for longest
period of time - 4 frames example
- 1, 2, 3, 4, 1, 2, 5, 1, 2, 3, 4, 5
- How do you know this?
- Used for measuring how well your algorithm
performs
1
4
2
6 page faults
3
4
5
28Optimal Page Replacement
29Least Recently Used (LRU) Algorithm
- Reference string 1, 2, 3, 4, 1, 2, 5, 1, 2, 3,
4, 5 - Counter implementation
- Every page entry has a counter every time page
is referenced through this entry, copy the clock
into the counter - When a page needs to be changed, look at the
counters to determine which are to change
1
5
2
3
4
5
4
3
30LRU Page Replacement
31LRU Algorithm (Cont.)
- Stack implementation keep a stack of page
numbers in a double link form - Page referenced
- move it to the top
- requires 6 pointers to be changed
- When a page needs to be replaced, replace the
page at bottom of stack - No search for replacement
32Use Of A Stack to Record The Most Recent Page
References
33LRU Approximation Algorithms
- Reference bit
- With each page associate a bit, initially 0
- When page is referenced bit set to 1
- Replace the one which is 0 (if one exists). We
do not know the order, however. - Second chance
- Need reference bit
- Clock replacement
- If page to be replaced (in clock order) has
reference bit 1 then - set reference bit 0
- leave page in memory
- replace next page (in clock order), subject to
same rules
34Second-Chance (clock) Page-Replacement Algorithm
35Counting Algorithms
- Keep a counter of the number of references that
have been made to each page - LFU Algorithm replaces page with smallest
count - MFU Algorithm based on the argument that the
page with the smallest count was probably just
brought in and has yet to be used
36Allocation of Frames
- Each process needs minimum number of pages
- Example IBM 370 6 pages to handle SS MOVE
instruction - instruction is 6 bytes, might span 2 pages
- 2 pages to handle from
- 2 pages to handle to
- Problem complicated if computer architecture
allows multiple levels of indirection - Two major allocation schemes
- fixed allocation
- priority allocation
37Fixed Allocation
- Equal allocation For example, if there are 100
frames and 5 processes, give each process 20
frames. - Proportional allocation Allocate according to
the size of process
38Priority Allocation
- Use a proportional allocation scheme using
priorities rather than size - If process Pi generates a page fault,
- select for replacement one of its frames
- select for replacement a frame from a process
with lower priority number
39Global vs. Local Allocation
- Global replacement process selects a
replacement frame from the set of all frames one
process can take a frame from another - Local replacement each process selects from
only its own set of allocated frames
40Thrashing
- If a process does not have enough pages, the
page-fault rate is very high. This leads to - low CPU utilization
- operating system thinks that it needs to increase
the degree of multiprogramming - another process added to the system
- Thrashing ? a process is busy swapping pages in
and out
41Thrashing (Cont.)
42Demand Paging and Thrashing
- Why does demand paging work?Locality model
- Process migrates from one locality to another
- Localities may overlap
- Why does thrashing occur?? size of locality gt
total memory size
43Locality In A Memory-Reference Pattern
44Working-Set Model
- ? ? working-set window ? a fixed number of page
references Example 10,000 instructions - WSSi (working set size of Process Pi) total
number of pages referenced in the most recent ?
(varies in time) - if ? too small will not encompass entire locality
- if ? too large will encompass several localities
- if ? ? ? will encompass entire program
- D ? WSSi ? total demand frames
- if D gt m ? Thrashing
- Policy if D gt m, then suspend one of the
processes
45Working-set model
46Keeping Track of the Working Set
- Approximate with interval timer a reference bit
- Example ? 10,000
- Timer interrupts after every 5000 time units
- Keep in memory 2 bits for each page
- Whenever a timer interrupts copy and set the
values of all reference bits to 0 - If one of the bits in memory 1 ? page in
working set - Why is this not completely accurate?
- Improvement 10 bits and interrupt every 1000
time units
47Page-Fault Frequency Scheme
- Establish acceptable page-fault rate
- If actual rate too low, process loses frame
- If actual rate too high, process gains frame
48Memory-Mapped Files
- Memory-mapped file I/O allows file I/O to be
treated as routine memory access by mapping a
disk block to a page in memory - A file is initially read using demand paging. A
page-sized portion of the file is read from the
file system into a physical page. Subsequent
reads/writes to/from the file are treated as
ordinary memory accesses. - Simplifies file access by treating file I/O
through memory rather than read() write() system
calls - Also allows several processes to map the same
file allowing the pages in memory to be shared
49Memory Mapped Files
50Memory-Mapped Files in Java
- import java.io.
- import java.nio.
- import java.nio.channels.
- public class MemoryMapReadOnly
-
- // Assume the page size is 4 KB
- public static final int PAGE SIZE 4096
- public static void main(String args) throws
IOException - RandomAccessFile inFile new
RandomAccessFile(args0,"r") - FileChannel in inFile.getChannel()
- MappedByteBuffer mappedBuffer
- in.map(FileChannel.MapMode.READ ONLY, 0,
in.size()) - long numPages in.size() / (long)PAGE SIZE
- if (in.size() PAGE SIZE gt 0)
- numPages
51Memory-Mapped Files in Java (cont)
- // we will "touch" the first byte of every page
- int position 0
- for (long i 0 i lt numPages i)
- byte item mappedBuffer.get(position)
- position PAGE SIZE
-
- in.close()
- inFile.close()
-
-
- The API for the map() method is as follows
- map(mode, position, size)
52Memory-Mapped Shared Memory in Windows
53Allocating Kernel Memory
- Treated differently from user memory
- Often allocated from a free-memory pool
- Kernel requests memory for structures of varying
sizes - Some kernel memory needs to be contiguous
54Buddy System
- Allocates memory from fixed-size segment
consisting of physically-contiguous pages - Memory allocated using power-of-2 allocator
- Satisfies requests in units sized as power of 2
- Request rounded up to next highest power of 2
- When smaller allocation needed than is available,
current chunk split into two buddies of
next-lower power of 2 - Continue until appropriate sized chunk available
55Buddy System Allocator
56Slab Allocator
- Alternate strategy
- Slab is one or more physically contiguous pages
- Cache consists of one or more slabs
- Single cache for each unique kernel data
structure - Each cache filled with objects instantiations
of the data structure - When cache created, filled with objects marked as
free - When structures stored, objects marked as used
- If slab is full of used objects, next object
allocated from empty slab - If no empty slabs, new slab allocated
- Benefits include no fragmentation, fast memory
request satisfaction
57Slab Allocation
58Other Issues Prepaging
- Prepaging
- To reduce the large number of page faults that
occurs at process startup - Prepage all or some of the pages a process will
need, before they are referenced - But if prepaged pages are unused, I/O and memory
was wasted - Assume s pages are prepaged and a of the pages is
used - Is cost of s a save pages faults gt or lt than
the cost of prepaging s (1- a) unnecessary
pages? - a near zero ? prepaging loses
59Other Issues Page Size
- Page size selection must take into consideration
- fragmentation
- table size
- I/O overhead
- locality
60Other Issues TLB Reach
- TLB Reach - The amount of memory accessible from
the TLB - TLB Reach (TLB Size) X (Page Size)
- Ideally, the working set of each process is
stored in the TLB. Otherwise there is a high
degree of page faults. - Increase the Page Size. This may lead to an
increase in fragmentation as not all applications
require a large page size - Provide Multiple Page Sizes. This allows
applications that require larger page sizes the
opportunity to use them without an increase in
fragmentation.
61Other Issues Program Structure
- Program structure
- Int128,128 data
- Each row is stored in one page
- Program 1
- for (j 0 j lt128 j)
for (i 0 i lt 128 i)
datai,j 0 - 128 x 128 16,384 page faults
- Program 2
- for (i 0 i lt 128
i) for (j 0 j lt
128 j)
datai,j 0 - 128 page faults
62Other Issues I/O interlock
- I/O Interlock Pages must sometimes be locked
into memory - Consider I/O. Pages that are used for copying a
file from a device must be locked from being
selected for eviction by a page replacement
algorithm.
63Reason Why Frames Used For I/O Must Be In Memory
64Operating System Examples
65Windows XP
- Uses demand paging with clustering. Clustering
brings in pages surrounding the faulting page. - Processes are assigned working set minimum and
working set maximum - Working set minimum is the minimum number of
pages the process is guaranteed to have in memory - A process may be assigned as many pages up to its
working set maximum - When the amount of free memory in the system
falls below a threshold, automatic working set
trimming is performed to restore the amount of
free memory - Working set trimming removes pages from processes
that have pages in excess of their working set
minimum
66Solaris
- Maintains a list of free pages to assign faulting
processes - Lotsfree threshold parameter (amount of free
memory) to begin paging - Desfree threshold parameter to increasing
paging - Minfree threshold parameter to being swapping
- Paging is performed by pageout process
- Pageout scans pages using modified clock
algorithm - Scanrate is the rate at which pages are scanned.
This ranges from slowscan to fastscan - Pageout is called more frequently depending upon
the amount of free memory available
67Solaris 2 Page Scanner
68End of Chapter 9