#### **Recall: Modern Process with Multiple Threads**

• Process: Operating system abstraction to represent **CS162** what is needed to run a single, multithreaded **Operating Systems and** program Systems Programming • Two parts: - Multiple Threads Lecture 4 » Each thread is a single, sequential stream of execution - Protected Resources: Thread Dispatching » Main Memory State (contents of Address Space) » I/O state (i.e. file descriptors) • Why separate the concept of a thread from that of September 11, 2006 a process? Prof. John Kubiatowicz - Discuss the "thread" part of a process (concurrency) - Separate from the "address space" (Protection) http://inst.eecs.berkeley.edu/~cs162 - Heavyweight Process  $\equiv$  Process with one thread 9/11/06 Kubiatowicz CS162 ©UCB Fall 2006 Lec 4.2 **Recall: Single and Multithreaded Processes Recall:** Classification



- $\boldsymbol{\cdot}$  Threads encapsulate concurrency
  - "Active" component of a process
- · Address spaces encapsulate protection
  - Keeps buggy program from trashing the system
  - "Passive" component of a process

9/11/06

Kubiatowicz CS162 ©UCB Fall 2006

Lec 4.3

| uppe<br>bogges<br># threads #<br>Per AS: | One                                                                                                                                             | Many             |
|------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------|------------------|
| One                                      | MS/DOS, early<br>Macintosh                                                                                                                      | Traditional UNIX |
| Many                                     | Embedded systems<br>(Geoworks, V×Works,<br>JavaOS,etc)<br>JavaOS, Pilot(PC)<br>Mach, OS/2, Li<br>Win 95?, Mac C<br>Win NT to X<br>Solaris, HP-L |                  |

• Real operating systems have either

- One or many address spaces
- One or many threads per address space
- Did Windows 95/98/ME have real memory protection?
   No: Users could overwrite process tables/System DLLs

Kubiatowicz CS162 ©UCB Fall 2006



#### MIPS: Software conventions for Registers

| 1   | at   | reserved for assembler                                  | (callee must save)                                                                                    |
|-----|------|---------------------------------------------------------|-------------------------------------------------------------------------------------------------------|
| 2   | v0   | expression evaluation &                                 | 23 s7                                                                                                 |
|     | v1   | function results                                        | 24 t8 temporary (cont'd)                                                                              |
|     | a0   | arguments                                               | 25 t9                                                                                                 |
|     | a1   |                                                         | 26 k0 reserved for OS kernel                                                                          |
|     | a2   |                                                         | 27 k1                                                                                                 |
|     | a3   |                                                         | 28 gp Pointer to global area                                                                          |
|     | t0   | temporary: caller saves                                 | 29 sp Stack pointer                                                                                   |
|     |      | (callee can clobber)                                    | 30 fp frame pointer                                                                                   |
| 5   | t7   |                                                         | 31 ra Return Address (HW)                                                                             |
| - 5 | Save | e calling procedure:<br>e caller-saves regs<br>e v0, v1 | <ul> <li>After return, assume</li> <li>Callee-saves reg OK</li> <li>gp,sp,fp OK (restored)</li> </ul> |

Kubiatowicz CS162 ©UCB Fall 2006

9/11/06

#### Single-Threaded Example

#### • Imagine the following C program:

```
main() {
   ComputePI("pi.txt");
   PrintClassList("clist.text");
}
```

- What is the behavior here?
  - Program would never print out class list
  - Why? ComputePI would never finish

Lec 4.7

#### Use of Threads



```
main() {
    CreateThread(ComputePI("pi.txt"));
    CreateThread(PrintClassList("clist.text"));
}
```

- What does "CreateThread" do?
  - Start independent thread running given procedure
- What is the behavior here?

CPU2

Time .

CPU1

9/11/06

- Now, you would actually see the class list

CPU1

- This should behave as if there are two separate CPUs

Kubiatowicz CS162 ©UCB Fall 2006

CPU2 CPU1

CPU2

# Memory Footprint of Two-Thread Example If we stopped this program and examined it with a

debugger, we would see - Two sets of CPU registers Stack 1 - Two sets of Stacks • Questions: - How do we position stacks relative to Stack 2 Address each other? - What maximum size should we choose : Space for the stacks? - What happens if threads violate this? Heap - How might you catch violations? Global Data Code 9/11/06 Kubiatowicz CS162 ©UCB Fall 2006 Lec 4.10

#### Per Thread State

- Each Thread has a *Thread Control Block* (TCB)
  - Execution State: CPU registers, program counter, pointer to stack
  - Scheduling info: State (more later), priority, CPU time
  - Accounting Info
  - Various Pointers (for implementing scheduling queues)
  - Pointer to enclosing process? (PCB)?
  - Etc (add stuff as you find a need)
- $\boldsymbol{\cdot}$  In Nachos: "Thread" is a class that includes the TCB
- OS Keeps track of TCBs in protected memory - In Array, or Linked List, or ...

# Lifecycle of a Thread (or Process)



- $\boldsymbol{\cdot}$  As a thread executes, it changes state:
  - new: The thread is being created
  - ready: The thread is waiting to run
  - running: Instructions are being executed
  - waiting: Thread waiting for some event to occur
  - terminated: The thread has finished execution
- $\cdot$  "Active" threads are represented by their TCBs
- TCBs organized into queues based on their state 9/11/06 Kubiatowicz C5162 @UCB Fall 2006

Lec 4.11

Lec 4.9

#### Ready Queue And Various I/O Device Queues

- Thread not running  $\Rightarrow$  TCB is in some scheduler queue
  - Separate queue for each device/signal/condition
  - Each queue can have a different scheduler policy



#### **Administrivia**

- Time to start Project 1
  - Go to Nachos page: start reading tasks and Nachos code
  - Java 1.5 now supported (let us know about bugs...)
- Nachos readers:
  - Available from Copy Central now (Required!)
  - Includes lectures and printouts of all of the code
- Warning: you should have produced an ssh key
  - you will be prompted for a passphrase
  - We need to autogenerate ssh keys for you
  - When prompted for a pass phrase, don't forget it!
  - This is needed for group collaboration tools
- Not everyone has run the register program!
  - This should happen automatically when you login, but you need to avoid hitting control-C

Lec 4.15

#### Administrivia

- Audio Podcasts are now available
  - RSS, stream, MP3 downloads
- Group assignments now posted on website
  - Check out the "Group/Section Assignment" link
  - Please attend your newly assigned section

| Section | Time            | Location        | ТА             |
|---------|-----------------|-----------------|----------------|
| 101     | Th 9:00-10:00P  | 3111 Etcheverry | Thomas Kho     |
| 102     | Th 11:00-12:00A | 85 Evans        | Thomas Kho     |
| 104     | Th 1:00-2:00P   | 405 Soda        | Subhransu Maji |
| 103     | Th 2:00-3:00P   | 87 Evans        | Steven Houston |
| 105     | Th 5:00-6:00P   | 405 Soda        | Subhransu Maji |

#### 9/11/06

Kubiatowicz CS162 ©UCB Fall 2006

#### **Dispatch Loop**

· Conceptually, the dispatching loop of the operating system looks as follows:

```
Loop {
   RunThread():
   ChooseNextThread();
   SaveStateOfCPU(curTCB);
   LoadStateOfCPU(newTCB);
```

• This is an *infinite* loop

}

- One could argue that this is all that the OS does
- Should we ever exit this loop???
  - When would that be?

Lec 4.14



#### **Internal Events**



| <pre>Switch(tCur,tNew) {    /* Unload old thread */    TCB[tCur].regs.r7 = CPU.r7;</pre> |              | <ul> <li>How many registers need to be saved/restored?</li> <li>MIPS 4k: 32 Int(32b), 32 Float(32b)</li> </ul>                                                                                  |  |  |
|------------------------------------------------------------------------------------------|--------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|
|                                                                                          |              | ntium: 14 Int(32b), 8 Float(80b), 8 SSE(128b),                                                                                                                                                  |  |  |
| TCB[tCur].regs.r0 = CPU.r0;<br>TCB[tCur].regs.sp = CPU.sp;                               | - Sp         | - Sparc(v7): 8 Regs(32b), 16 Int regs (32b) * 8 windows =<br>136 (32b)+32 Float (32b)<br>- Itanium: 128 Int (64b), 128 Float (82b), 19 Other(64b<br>• retpc is where the return should jump to. |  |  |
| TCB[tCur].regs.retpc = CPU.retpc; /*return add                                           | ar*/         |                                                                                                                                                                                                 |  |  |
| <pre>/* Load and execute new thread */</pre>                                             | - In         | reality, this is implemented as a jump                                                                                                                                                          |  |  |
| CPU.r7 = TCB[tNew].regs.r7;<br>                                                          |              | <ul> <li>There is a real implementation of switch in Nachos.</li> <li>See switch.s</li> </ul>                                                                                                   |  |  |
| CPU.r0 = TCB[tNew].regs.r0;                                                              | ×            | Normally, switch is implemented as assembly!                                                                                                                                                    |  |  |
| CPU.sp = TCB[tNew].regs.sp;                                                              | - Of         | - Of course, it's magical!<br>- But you should be able to follow it!                                                                                                                            |  |  |
| CPU.retpc = TCB[tNew].regs.retpc;<br>return; /* Return to CPU.retpc */                   | - Bu         |                                                                                                                                                                                                 |  |  |
| }<br>6 Kubiatowicz CS162 ©UCB Fall 2006 Lec                                              | 4.21 9/11/06 | Kubiatowicz CS162 ©UCB Fall 2006 Lec 4.22                                                                                                                                                       |  |  |

### Switch Details (continued)

- What if you make a mistake in implementing switch?
  - Suppose you forget to save/restore register 4
  - Get intermittent failures depending on when context switch occurred and whether new thread uses register 4
  - System will give wrong result without warning
- $\cdot$  Can you devise an exhaustive test to test switch code?
  - No! Too many combinations and inter-leavings
- Cautionary tail:
  - For speed, Topaz kernel saved one instruction in switch()
  - Carefully documented!
    - » Only works As long as kernel size < 1MB
  - What happened?
    - » Time passed, People forgot
    - » Later, they added features to kernel (no one removes features!)
    - » Very weird behavior started happening
  - Moral of story: Design for simplicity

9/11/06

006

Lec 4.23

# What happens when thread blocks on I/O?



- What happens when a thread requests a block of data from the file system?
  - User code invokes a system call
  - Read operation is initiated
  - Run new thread/switch
- Thread communication similar
  - Wait for Signal/Join
  - Networking

Kubiatowicz CS162 ©UCB Fall 2006

#### **External Events**

#### Raise priority • What happens if thread never does any $I/O_{.}$ eenable All Ints? never waits, and never yields control? Save registers Jo Interrupt add \$r1,\$r2,\$r3 - Could the ComputePI program grab all resources Dispatch to Handle subi \$r4,\$r1,#4 2 and never release the processor? slli \$r4,\$r4,#2 and Transfer Network » What if it didn't print to console? Packet from hardware Pipeline Flush - Must find way that dispatcher can regain control! External to Kernel Buffers Interrupt Answer: Utilize External Events \$r2,0(\$r4) ٦ω 1w \$r3.4(\$r4) - Interrupts: signals from hardware or software Restore registers \$r2,\$r2,\$r3 add that stop the running code and jump to kernel Clear current Int 8(\$r4),\$r2 C 147 Disable All Ints - Timer: like an alarm clock that goes off every Restore priority some many milliseconds RTI • If we make sure that external events occur • An interrupt is a hardware-invoked context switch frequently enough, can ensure dispatcher runs - No separate step to choose what to run next - Always run the interrupt handler immediately 9/11/06 Kubiatowicz CS162 ©UCB Fall 2006 Lec 4.25 9/11/06 Kubiatowicz CS162 ©UCB Fall 2006 Lec 4.26

## Use of Timer Interrupt to Return Control

Solution to our dispatcher problem
Use the timer interrupt to force scheduling decisions



I/O interrupt: same as timer interrupt except that DoHousekeeping() replaced by ServiceIO().

#### 9/11/06

#### Choosing a Thread to Run

**Example: Network Interrupt** 



- Zero ready threads dispatcher loops
  - » Alternative is to create an "idle thread"
  - » Can put machine into low-power mode
- Exactly one ready thread easy
- More than one ready thread: use scheduling priorities
- · Possible priorities:
  - LIFO (last in, first out):
    - » put ready threads on front of list, remove from front
  - Pick one at random
  - FIFO (first in, first out):
    - » Put ready threads on back of list, pull them from front
    - » This is fair and is what Nachos does
  - Priority queue:
- % keep ready list sorted by TCB priority field

#### Summary

- $\cdot$  The state of a thread is contained in the TCB
  - Registers, PC, stack pointer
  - States: New, Ready, Running, Waiting, or Terminated
- Multithreading provides simple illusion of multiple CPUs
  - Switch registers and stack to dispatch new thread
  - Provide mechanism to ensure dispatcher regains control
- $\cdot$  Switch routine
  - Can be very expensive if many registers
  - Must be very carefully constructed!
- Many scheduling options
  - Decision of which thread to run complex enough for complete lecture

| 9/11/06 | Kubiatowicz CS162 ©UCB Fall 2006 | Lec 4.29 |
|---------|----------------------------------|----------|
|         |                                  |          |
|         |                                  |          |