Hello!

## MEMORY VIRTUALIZATION

Shivaram Venkataraman CS 537, Spring 2020

## **ADMINISTRIVIA**

- Project Ib is due Wednesday
- Project Ia grades this week

- Midterm makeup requests (email or Piazza)

## AGENDA / LEARNING OUTCOMES

Memory virtualization

What are main techniques to virtualize memory?

What are their benefits and shortcomings?

## **RECAP**

MEMORY VIRTUALIZATION

Abstraction

Mechanism

Ess is unaware of sharing

Transparency: Process is unaware of sharing

Protection: Cannot corrupt OS or other process memory

Efficiency: Do not waste memory or slow down processes

Sharing: Enable sharing between cooperating processes

## **ABSTRACTION: ADDRESS SPACE**



## MEMORY ACCESS

List of memory location accessed

Initial %rip =  $0 \times 10$ %rbp = 0x200

0x10: movl 0x8(%rbp), %edi
0x13: addl \$0x3, %edi

0x19: movl %edi, 0x8(%rbp)

%rbp is the base pointer: points to base of current stack frame

**%rip** is instruction pointer (or program counter)

Fetch instruction at addr 0x10 Exec:

load from addr 0x208

Fetch instruction at addr 0x13 Exec:

no memory access

Fetch instruction at addr 0x19 Exec:

store to addr 0x208

## MEMORY VIRTUALIZATION: MECHANISMS

## HOW TO VIRTUALIZE MEMORY

Problem: How to run multiple processes simultaneously?

Addresses are "hardcoded" into process binaries

How to avoid collisions? wile Protection, Maring etc.

Possible Solutions for Mechanisms (covered in this class):

- I. Time Sharing
- 2. Static Relocation
- 3. Base
- 4. Base+Bounds



## TIME SHARE MEMORY: EXAMPLE

Ly No shoring
of memory
Ly Also violates
protection

## PROBLEMS WITH TIME SHARING?

Ridiculously poor performance

Better Alternative: space sharing!

At same time, space of memory is divided across processes

Remainder of solutions all use space sharing



## 2) STATIC RELOCATION

Idea: OS rewrites each program before loading it as a process in memory Each rewrite for different process uses different addresses and pointers Change jumps, loads of static data



## STATIC: LAYOUT IN MEMORY





```
0x3010:movl 0x8(%rbp), %edi
0x3013:addl $0x3, %edi
0x3019:movl %edi, 0x8(%rbp)
```

## STATIC RELOCATION: DISADVANTAGES

#### No protection

- Process can destroy OS or other processes
- No privacy

Cannot move address space after it has been placed

May not be able to allocate new process

## 3) DYNAMIC RELOCATION

Goal: Protect processes from one another

Requires hardware support

Memory Management Unit (MMU)

MMU dynamically changes process address at every memory reference

- Process generates logical or virtual addresses (in their address space)
- Memory hardware uses physical or real addresses



### HARDWARE SUPPORT FOR DYNAMIC RELOCATION

Privileged (protected, kernel) mode: OS runs

- When enter OS (trap, system calls, interrupts, exceptions) \_\_\_\_\_ change contents of
- Allows certain instructions to be executed (Can manipulate contents of MMU)
- Allows OS to access all of physical memory

User mode: User processes run

Perform translation of logical address to physical address

mon does his

## IMPLEMENTATION OF DYNAMIC RELOCATION: BASE REG

Translation on every memory access of user process MMU adds base register to logical address to form physical address



## DYNAMIC RELOCATION WITH BASE REGISTER

Translate virtual addresses to physical by adding a fixed offset each time.
 Store offset in base register





## VISUAL EXAMPLE OF DYNAMIC RELOCATION: BASE REGISTER

## QUIZ 7

#### https://tinyurl.com/quiz7-sp20





Virtual Physical PI: load 100, RI load 1124, RI

P2: load 1000, R1 load 5096, R1

PI: store 3072, RI store 4096, RI (3072 + 1024)

## 4) DYNAMIC WITH BASE+BOUNDS

Idea: limit the address space with a bounds register

Base register: smallest physical addr (or starting location)

Bounds register: size of this process's virtual address space

Sometimes defined as largest physical address (base + size)

OS kills process if process loads/stores beyond bounds



## IMPLEMENTATION OF BASE+BOUNDS

Translation on every memory access of user process

- MMU compares logical address to bounds register if logical address is greater, then generate error
- MMU adds base register to logical address to form physical address

store 3072, R1
bound 1024 & size
of
addr
tress
space







## MANAGING PROCESSES WITH BASE AND BOUNDS

Context-switch: Add base and bounds registers to proc struct 705 Mores Steps

- Change to privileged mode
   Save base and bounds registers of old process
  - Load base and bounds registers of new process
  - Change to user mode and jump to new process

Protection requirement

- User process cannot change base and bounds registers
- User process cannot change to privileged mode

instructions which control MMV can only be called in servel model

## BASE AND BOUNDS

# Efficieny \* Transparency Protection

#### Advantages

- --- Provides protection (both read and write) across address spaces
- ->Supports dynamic relocation

Can place process at different locations initially and move address spaces

Simple, inexpensive implementation: Few registers, little logic in MMU

#### **Disadvantages**

Each process must be allocated contiguously in physical memory Must allocate memory that may not be used by process No partial sharing: Cannot share parts of address space



## 5) **SEGMENTATION**

Divide address space into logical segments

- Each segment corresponds to logical entity in address space

(code, stack, heap)

Each segment has separate base + bounds register



2<sup>n</sup>-1

## SEGMENTED ADDRESSING

Process now specifies segment and offset within segment segment

How does process designate a particular segment?

- Use part of logical address
  - Top bits of logical address select segment
  - Low bits of logical address select offset within segment

What if small address space, not enough bits?

- Implicitly by type of memory reference
- Special registers

## SEGMENTATION IMPLEMENTATION

MMU contains Segment Table (per process)

Each segment has own base and bounds, protection bits 3 Permission

Example: 14 bit logical address, 4 segments;

How many bits fc

| '             | Segment | Base / | Bounds | R W |
|---------------|---------|--------|--------|-----|
| for segment?  | 0       | 0x2000 | 0x6ff  | 1 0 |
| 2 61 1-       | 1       | 0x0000 | 0x4ff  | 1 1 |
| How many bits | 2       | 0x3000 | 0xfff  | 1 1 |
| for offset?   | 3       | 0x0000 | 0x000  | 0 0 |
| 14-2=126      | \A_1    |        |        |     |

remember:

I hex digit  $\rightarrow$  4 bits

access

## VISUAL INTERPRETATION



Segment numbers:

0: code+data

I: heap

2: stack



 $0 \times 1600 + 0 \times 010 = 0 \times 1610$ 

**Physical** 

load 0x1010, R1

 $0 \times 400 + 0 \times 010 = 0 \times 410$ 

load 0x1100, R1

 $0 \times 400 + 0 \times 100 = 0 \times 500$ 

Segment numbers:

0: code+data

I: heap

2: stack

## QUIZ 8!

### https://tinyurl.com/cs537-sp20-quiz8

14 bit addressing scheme

| Segment | Base   | Bounds | R W |
|---------|--------|--------|-----|
| 0       | 0x2000 | 0x6ff  | 1 0 |
| 1       | 0x0000 | 0x4ff  | 1 1 |
| 2       | 0x3000 | 0xfff  | 1 1 |
| 3       | 0x0000 | 0x000  | 0 0 |

| R                 | ۵m | em | hei | ŗ. |
|-------------------|----|----|-----|----|
| $\mathcal{L}_{i}$ | em | em | bei | •  |

I hex digit  $\rightarrow$  4 bits





Translate logical (in hex) to physical

of segment

0x0240: 0x2000 + 240 = 0x2240

0×1108: 0,0000 + 108 = 0,0008

0x265c: 0x3000 + 65c = 0x365c

0x3002: FAIL

Virtuel 0KB **Program Code** 1KB Heap 2KB (free)

15KB

16KB

## HOW DO STACKS GROW?

12 bits offset: 4 k

Stack goes 16K  $\rightarrow$  12K, in physical memory is 28K  $\rightarrow$  24K Segment base is at 28K

Virtual address 0x3C00 = 15K $\rightarrow$  top 2 bits (0x3) segment ref, offset is 0xC00 = 3K

How do we make CPU translate that?

Negative offset = subtract max segment from offset 3K = 3K - 4K = -1KStack - 1KStack - 1KStack - 1K 2K - 4K 2K - 4K

## HOW DOES THIS LOOK IN X86

Stack Segment (SS): Pointer to the stack

Code Segment (CS): Pointer to the code

Data Segment (DS): Pointer to the data

Extra Segment (ES): Pointer to extra data

F Segment (FS): Pointer to more extra data

G Segment (GS): Pointer to still more extra data

## ADVANTAGES OF SEGMENTATION

Enables sparse allocation of address space

Stack and heap can grow independently

- Heap: If no data on free list, dynamic memory allocator requests more from OS
   (e.g., UNIX: malloc calls sbrk())
- Stack: OS recognizes reference outside legal segment, extends stack implicitly

Different protection for different segments

- Enables sharing of selected segments
- Read-only status for code

Supports dynamic relocation of each segment

## DISADVANTAGES OF SEGMENTATION

|                                                             | 0KB r | Not Compacted    |
|-------------------------------------------------------------|-------|------------------|
| Each segment must be allocated contiguously                 |       |                  |
|                                                             | 8KB   | Operating System |
| May not have sufficient physical memory for large segments? | 16KB  |                  |
|                                                             |       | (not in use)     |
| External Fragmentation                                      | 24KB  | Allocated        |
| 20 KB Megnet                                                | 32KB  | (not in use)     |
| 2012                                                        | 40KB  | Allocated        |
|                                                             | 48KB  | (not in use)     |
|                                                             | 56KB  |                  |
|                                                             | 64KB  | Allocated        |

## **NEXT STEPS**

Project 1b: Due Wednesday!

Next class: Paging, TLBs and more!