#### MEMORY: SMALLER PAGE TABLES AND SWAPPING

Shivaram Venkataraman

CS 537, Fall 2024

### **ADMINISTRIVIA**

Project 3 is due next week

Code review

#### Midterm I:

Practice exams on Canvas

Review session in class

## AGENDA / LEARNING OUTCOMES

Memory virtualization

What are the challenges with paging?

How we go about addressing them?

How we support virtual mem larger than physical mem? What are mechanisms and policies for this?

# **RECAP**

## PROS/CONS OF PAGING

#### Pros

#### No external fragmentation

Any page can be placed in any frame in physical memory

#### Fast to allocate and free

- Alloc: No searching for suitable free space
- Free: Doesn't have to coalesce with adjacent free space

#### Cons

#### Additional memory reference

- MMU stores only base address of page table

Storage for page tables may be substantial

- Simple page table: Requires PTE for all pages in address space
- Entry needed even if page not allocated ?

### TLB SUMMARY

Pages are great, but accessing page tables for every memory access is slow Cache recent page translations -> TLB

MMU performs TLB lookup on every memory access

TLB performance depends strongly on workload

TLBs increase cost of context switches

- Flush TLB on every context switch
- Add ASID to every TLB entry

In different systems, hardware or OS handles TLB misses

### WHY ARE PAGE TABLES LARGE?

how to avoid storing these?

|   | PFN     | valid            | prot       |
|---|---------|------------------|------------|
|   | 10      | <u>l</u>         | r-x        |
|   | -<br>23 | O<br>I           | -<br>rw-   |
|   | -       | Ó                | -          |
|   | -       | 0<br>0<br>0      | -          |
|   | -       | Ŏ                | -          |
|   | m       | any more invalid |            |
|   | -       | Q                | -          |
|   | -       | Q                | -          |
|   | -       | 0<br>0<br>0      | -          |
| _ | -       | 0                | -          |
|   | 28<br>4 |                  | rw-        |
|   | 4       | 1                | rw-<br>rw- |

# MULTILEVEL PAGE TABLES

Linear Page Table PTBR[ 201 valid prot **PFN** rx PFN 201 13 rx 100 rw 202 PFN PFN 203 204

PFN

86

15

rw

Creates multiple levels of page tables

Only allocate page tables for pages in use

Allow page table to be allocated non-contiguously



## MULTILEVEL TRANSLATION EXAMPLE

| page | directory |
|------|-----------|
|------|-----------|

| p 4.9 cm | <b>J J J J</b> |
|----------|----------------|
| PPN      | valid          |
| 0x3      | T              |
| -        | 0              |
| -        | 0              |
| -        | 0              |
| -        | 0              |
| -        | 0              |
| -        | 0              |
| _        | 0              |
| -        | 0              |
| _        | 0              |
| _        | 0              |
| -        | 0              |
| _        | 0              |
| _        | 0              |
| 0×92     | 1              |

page of PT (@PPN:0x3)

| PPN  | valid |
|------|-------|
| 0×10 | I     |
| 0×23 | 1     |
| -    | 0     |
| -    | 0     |
| 0×80 | I     |
| 0×59 | I     |
| -    | 0     |
| -    | 0     |
| -    | 0     |
| -    | 0     |
| -    | 0     |
| -    | 0     |
| -    | 0     |
| -    | 0     |
| -    | 0     |

page of PT (@PPN:0x92)

| `     | •                                    |
|-------|--------------------------------------|
| valid |                                      |
| 0     | translate 0x01ABC                    |
| 0     |                                      |
| 0     |                                      |
| 0     |                                      |
| 0     |                                      |
| 0     |                                      |
| 0     |                                      |
| 0     |                                      |
| 0     |                                      |
| 0     |                                      |
| 0     |                                      |
| 0     |                                      |
| 0     |                                      |
| 1     |                                      |
| 1     |                                      |
|       | 0<br>0<br>0<br>0<br>0<br>0<br>0<br>0 |

#### 20-bit address:

outer page(4 bits) inner page(4 bits) page offset (12 bits)

#### MORE THAN 2 LEVELS?

Problem: page directories (outer level) may not fit in a page

#### 64-bit address:

|          | OT-DIC addicss. |                      |                       |  |  |
|----------|-----------------|----------------------|-----------------------|--|--|
| outer pa | ge? ( bits )    | inner page (10 bits) | page offset (12 bits) |  |  |

#### Solution:

- Split page directories into pieces
- Use another page dir to refer to the page dir pieces.



How large is virtual address space with 4 KB pages, 4 byte PTEs, (each page table fits in page)

- I level:
- 2 levels:

## EXAMPLE: X86-64

Virtual address: 48 bits

Upper 16 bits are sign extended (all 0, all 1)

Physical address: 48 bits

PTE size = 48 bits + metadata = 8 bytes PTEs/4kb page = 512 = 9bits

| 47 - 39        | 38-30             | 29-21          | 21-12      | 11-0             |
|----------------|-------------------|----------------|------------|------------------|
| Page Map Lvl 4 | Page Pointer Dir. | Page directory | Page table | offset (12 bits) |
| (9 bits)       | (9 bits)          | (9 bits)       | (9 bits)   |                  |

### FULL SYSTEM WITH TLBS

On TLB miss: lookups with more levels more expensive

Assume 3-level page table
Assume 256-byte pages
Assume 16-bit addresses
Assume ASID of current process is 211

| ASID | VPN  | PFN  | Valid |
|------|------|------|-------|
| 211  | 0xbb | 0x91 | 1     |
| 211  | 0xff | 0x23 | Ι     |
| 122  | 0×05 | 0x91 | 1     |
| 211  | 0×05 | 0x12 | 0     |

How many physical accesses for each instruction? (Ignore ops changing TLB)

(a) 0xAA10: addl \$0x5, %edh

(b) 0xBB13: addl \$0x3, %edi

#### INVERTED PAGE TABLE

Only store entries for virtual pages w/ valid physical mappings

#### Naïve approach:

Search through data structure <ppn, vpn+asid> to find match

Too much time to search entire table

### **INVERTED PAGE TABLE**

#### Better:

Find possible matches entries by hashing vpn+asid

Use chaining to handle collisions

Smaller number of entries to search for exact match

Managing inverted page table requires software-controlled TLB

Used in IBM POWER and Intel Itanium, but complicated





#### https://tinyurl.com/cs537-fa24-q7

What problem(s) can be solved by using ASIDs?



For a hardware-managed TLB miss, which of the following statements are true?

For a software-managed TLB miss, which of the following statements are true?

16-bit address space and 4kB page size.

Assume the page table is at 0x2000

Fach PTF is of 4 bytes. No TI B

| ach PTE is of 4 bytes. No TLB |     |
|-------------------------------|-----|
| VPN:0                         | 0x0 |
|                               | 0x0 |
| PageTable                     | 0x1 |
|                               | 0x9 |
|                               | 0x7 |
|                               | 0x8 |
|                               | 0   |
|                               | :   |
| VPN:15                        | 0   |
|                               |     |

Virtual Addresses
0x3000: load 0x5320, %eax
0x3004: load 0x4004, %ebx
0x3008: mul %ecx, %eax, %ebx
0x300C: store %ebx, 0x5324
0x3010: load 0x5328, %ebx

16-bit address space and 4kB page size. Assume the page table is at 0x2000

Each DTE is of 4 bytes With E entry TIP

| Each PIE is of 4 bytes. With 5 entry ILB |     |                                                         |
|------------------------------------------|-----|---------------------------------------------------------|
| VPN:0                                    | 0x0 |                                                         |
|                                          | 0x0 | Virtual Addresses 0x3000: load 0x5320, %eax             |
| PageTable                                | 0x1 | 0x3004: load 0x4004, %ebx                               |
|                                          | 0x9 | 0x3008: mul %ecx, %eax, %ebx 0x300C: store %ebx, 0x5324 |
|                                          | 0x7 | 0x3010: load 0x5328, %ebx                               |
|                                          | 0x8 |                                                         |
|                                          | 0   |                                                         |
|                                          | :   |                                                         |
| VPN:15                                   | 0   |                                                         |
|                                          |     |                                                         |

### EFFECT OF PAGE SIZE

```
Assume 4KB pages 32 TLB entries
```

```
Assume 2MB pages 32 TLB entries
```

```
Miss rate of TLB: #TLB misses / #TLB lookups

#TLB lookups? number of accesses to a =

Chance of a TLB miss?

=
```

**#TLB** lookups? number of accesses to a =

Chance of a TLB miss?

# LARGE PAGES (HUGE PAGES)

TLB reach: how much memory can be accessed without a TLB miss?

1000 entries 4KB pages → 4MB

Large pages 1000 entries, 2MB pages → 2GB!

#### How to use?

- Programmer requested: mmap(MAP\_HUGE) returns huge pages
- Transparent Huge Pages (THP, in Linux)
  - OS uses huge pages when available for > 2MB allocations

## TRANSLATING LARGE PAGES

HugePages saves TLB entries. But how does it affect page translation?

4KB pages: 4 levels → 4 memory accesses

| 47 - 39        | 38-30             | 29-21          | 21-12      | 11-0             |
|----------------|-------------------|----------------|------------|------------------|
| Page Map Lvl 4 | Page Pointer Dir. | Page directory | Page table | offset (12 bits) |
| (9 bits)       | (9 bits)          | (9 bits)       | (9 bits)   | Oliset (12 bits) |

#### 2MB pages:

| Page Map Lvl 4 | Page Pointer Dir. | Page Directory | page offset ( bits) |
|----------------|-------------------|----------------|---------------------|
| ( bits)        | ( bits)           | ( bits)        | page onset ( bits)  |

## SHARING WITH PAGE TABLES



# Where is page sharing used?

- Kernel data mapped into each process
- Different processes running the same binary
- User-level system libraries
- Shared pages as IPC

### **SUMMARY: BETTER PAGE TABLES**

Problem: Simple linear page tables require too much contiguous memory

Many options for efficiently organizing page tables

If OS traps on TLB miss, OS can use any data structure

Inverted page tables (hashing)

If Hardware handles TLB miss, page tables must follow specific format

- Multi-level page tables used in x86 architecture
- Each page table fits within a page

Large pages can reduce TLB use and number of accesses for translation

# **SWAPPING**

## **MOTIVATION**

OS goal: Support processes when not enough physical memory

- Single process with very large address space
- Multiple processes with combined address spaces

User code should be independent of amount of physical memory

Correctness, if not performance

Virtual memory: OS provides illusion of more physical memory Why does this work?

Relies on key properties of user processes (workload)
 and machine architecture (hardware)





### LOCALITY OF REFERENCE

#### Leverage locality of reference within processes

- Spatial: reference memory addresses near previously referenced addresses
- Temporal: reference memory addresses that have referenced in the past
- Processes spend majority of time in small portion of code
  - Estimate: 90% of time in 10% of code

#### Implication:

- Process only uses small amount of address space at any moment
- Only small amount of address space must be resident in physical memory

### MEMORY HIERARCHY

Leverage memory hierarchy of machine architecture Each layer acts as "backing store" for layer above



### **SWAPPING INTUITION**

Idea: OS keeps unreferenced pages on disk

Slower, cheaper backing store than memory

Process can run when not all pages are loaded into main memory OS and hardware cooperate to make large disk seem like memory

Same behavior as if all of address space in main memory

#### Requirements:

- OS must have mechanism to identify location of each page in address space → in memory or on disk
- OS must have **policy** to determine which pages live in memory and which on disk

# VIRTUAL ADDRESS SPACE MECHANISMS

Each page in virtual address space maps to one of three locations:

- Physical main memory: Small, fast, expensive
- Disk (backing store): Large, slow, cheap
- Nothing (error): Free

Extend page tables with an extra bit: present

- permissions (r/w), valid, present
- Page in memory: present bit set in PTE
- Page on disk: present bit cleared
  - PTE points to block on disk
  - Causes trap into OS when page is referenced
  - Trap: page fault



| Phys I | Mem | ory |
|--------|-----|-----|
|--------|-----|-----|



| PFN valid          | prot<br>r-x | present<br>I |
|--------------------|-------------|--------------|
| - 0                | -           | -            |
| - 0<br>23 [        | rw-         | 0            |
| - 0                | -           | -            |
| - 0<br>- 0         | -           | -            |
| - 0                | -           | -            |
| - 0                | -           | -            |
| - 0                | -           | -            |
| - 0                | -           | -            |
| - 0                | -           | -            |
| - 0                | -           | -            |
| - 0<br>28 I<br>4 I | rw-         | 0            |
| 4 I                | rw-<br>rw-  | I            |

What if access vpn 0xb?

### VIRTUAL MEMORY MECHANISMS

First, hardware checks TLB for virtual address

if TLB hit, address translation is done; page in physical memory

#### Else ...

- Hardware or OS walk page tables
- If PTE designates page is present, then page in physical memory (i.e., present bit is cleared)

#### Else

- Trap into OS (not handled by hardware)
- OS selects victim page in memory to replace
  - Write victim page out to disk if modified (use dirty bit in PTE)
- OS reads referenced page from disk into memory
- Page table is updated, present bit is set
- Process continues execution

# **SWAPPING POLICIES**

#### Goal: Minimize number of page faults

- Page faults require milliseconds to handle (reading from disk)
- Implication: Plenty of time for OS to make good decision

#### OS has two decisions

Page selection

When should a page (or pages) on disk be brought into memory?

Page replacement

Which resident page (or pages) in memory should be thrown out to disk?

# **NEXT STEPS**

Project 3: Due soon!