1 Pre-Check

This section is designed as a conceptual check for you to determine if you conceptually understand and have any misconceptions about this topic. Please answer true/false to the following questions, and include an explanation:

1.1 Polling and interrupts are only relevant concepts for low level programming.

False. Similar concepts apply to almost all types of applications, including web apps, mobile, distributed systems, and so on.

1.2 Memory mapped IO only works with polling.

False. Not necessarily, the implementation backing the memory mapping can use interrupt-driven IO (for example, reading files).

1.3 The virtual and physical page number must be the same size.

False. There could be less possible physical pages than virtual pages. However, the page size does need to be the same.
2 Polling & Interrupts

Fill out this table that compares polling and interrupts.

<table>
<thead>
<tr>
<th>Operation</th>
<th>Definition</th>
<th>Pro/Good for</th>
<th>Con</th>
</tr>
</thead>
</table>
| **Polling** | Forces the hardware to wait on ready bit (alternatively, if timing of device is known, the ready bit can be polled at the frequency of the device). | • Low Latency  
• Low overhead when data is available  
• Good For: devices that are always busy or when you can’t make progress until the device replies | • Can’t do anything else while polling  
• Can’t sleep while polling (CPU always at full speed) |
| **Interrupts** | Hardware fires an “exception” when it becomes ready. CPU changes PC register to execute code in the interrupt handler when this occurs. | • Can do useful work while waiting for response  
• Can wait on many things at once  
• Good for: Devices that take a long time to respond, especially if you can do other work while waiting. | • Nondeterministic when interrupt occurs  
• interrupt handler has some overhead (e.g. saves all registers, flush pipeline, etc.)  
• Higher latency per event  
• Worse throughput |

3 Memory Mapped I/O

For this question, the following addresses correspond to registers in some I/O devices and not regular user memory.

- **0xFFFF0000**—Receiver Control: LSB is the ready bit (in the context of polling), there may be other bits set that we don’t need right now.
- **0xFFFF0004**—Receiver Data: Received data stored at lowest byte.
- **0xFFFF0008**—Transmitter Control: LSB is the ready bit (in the context of polling), there may be other bit set that we don’t need right now.
- **0xFFFF000C**—Transmitter Data: Transmitted data stored at lowest byte.

Recall that receiver will only have data for us when the corresponding ready bit is 1, and that we can only write data to the transmitter when its ready bit is 1. Write RISC-V code that reads byte from the receiver (busy-waiting if necessary) and writes that byte to the transmitter (busy-waiting if necessary).

```
lui t0 0xfffff0
receive_wait: lw t1 0(t0)
           andi t1 t1 1 # poll on ready of receiver
           beq t1 x0 receive_wait
           lb t2 4(t0)    # load data
```
transmit_wait: lw t1 8(t0)  # poll on ready of transmitter
andi t1 t1 1
beq t1 x0 transmit_wait  # write to transmitter
sb t2 12(t0)

4 Addressing

Virtual Address (VA) What your program uses

<table>
<thead>
<tr>
<th>Virtual Page Number (VPN)</th>
<th>Page Offset</th>
</tr>
</thead>
</table>

Physical Address (PA) What actually determines where in memory to go

<table>
<thead>
<tr>
<th>Physical Page Number (PPN)</th>
<th>Page Offset</th>
</tr>
</thead>
</table>

For example, with 4 KiB pages and byte addresses, there are 12 page offset bits since $4 \text{ KiB} = 2^{12} \times 2 = 4096B$.

Pages
A chunk of memory or disk with a set size. Addresses in the same virtual page map to addresses in the same physical page. The page table determines the mapping.

<table>
<thead>
<tr>
<th>Valid</th>
<th>Dirty</th>
<th>Permission Bits</th>
<th>PPN</th>
</tr>
</thead>
<tbody>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>— Page entry (VPN: 0) —</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>— Page entry (VPN: 1) —</td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

Each stored row of the page table is called a page table entry. There are $2^{VPN}$ bits such entries in a page table. Say you have a VPN of 5 and you want to use the page table to find what physical page it maps to; you’ll check the 5th (0-indexed) page table entry. If the valid bit is 1, then that means that the entry is valid (in other words, the physical page corresponding to that virtual page is in main memory as opposed to being only on disk) and therefore you can get the PPN from the entry and access that physical page in main memory. The page table is stored in memory: the OS sets a register (the Page Table Base Register) telling the hardware the address of the first entry of the page table. If you write to a page in memory, the processor updates the “dirty” bit in the page table entry corresponding to that page, which lets the OS know that updating that page on disk is necessary (remember: main memory contains a subset of what’s on disk). This is a similar concept as having a dirty bit for each cache block in a write-back cache, which we covered in
lecture and in Lab 9. Each process gets its own illusion of full memory to work with, and thereby its own page table.

**Protection Fault** The page table entry for a virtual page has permission bits that prohibit the requested operation. This is how a segmentation fault occurs.

**Page Fault** The page table entry for a virtual page has its valid bit set to false. This means that the entry is not in memory, so we pull it from disk, add the page to memory (evicting another page if necessary), and add the mapping to the page table and the TLB.

**Translation Lookaside Buffer**
A cache for the page table. Each block is a single page table entry. If an entry is not in the TLB, it’s a TLB miss. Assuming fully associative:

<table>
<thead>
<tr>
<th>TLB Valid</th>
<th>Tag (VPN)</th>
<th>Page Table Entry</th>
</tr>
</thead>
<tbody>
<tr>
<td></td>
<td></td>
<td>Page Dirty</td>
</tr>
<tr>
<td></td>
<td></td>
<td>Permission Bits</td>
</tr>
<tr>
<td></td>
<td></td>
<td>PPN</td>
</tr>
</tbody>
</table>

<table>
<thead>
<tr>
<th>TLB entry</th>
</tr>
</thead>
</table>

To access some memory location, we get the virtual page number (VPN) from the virtual address (VA) and first try to translate the VPN to a physical page number (PPN) using the translation lookaside buffer (TLB). If the TLB doesn’t contain the desired VPN, we check if the page table contains it (remember: the TLB is a subset of the page table!). If the page table doesn’t contain an entry for the VPN, then this is a page fault; memory doesn’t contain the corresponding physical page! This means we need to fetch the physical page from disk and put it into memory, update the page table entry, and load the entry into the TLB. Then, we use the physical page and the offset of the physical address in the page to access memory as the program intended.

4.1 What are three specific benefits of using virtual memory?

- **Illusion of infinite memory** (bridges memory and disk in memory hierarchy).
- **Simulates full address space** for each process so that the linker/loader don’t need to know about other programs.
- Enforces protection between processes and even within a process (e.g., read-only pages set up by the OS).

4.2 What should happen to the TLB when a new value is loaded into the page table address register?

The valid bits of the TLB should all be set to 0. The page table entries in the TLB corresponded to the old process/page table, so none of them are valid once the page table address register points to a different page table.