## Processes and Threads Implementation



## **Learning Outcomes**

- An understanding of the typical implementation strategies of processes and threads
  - Including an appreciation of the trade-offs between the implementation approaches
    - · Kernel-threads versus user-level threads
- · A detailed understanding of "context switching"



2

## Summary: The Process Model One program counter Process switch Process switch Four program counters Summary: The Process Model One program counter Four program counters Summary: The Process Model Four program counters Four program counters Summary: The Process Model Four program counters Four program counters Summary: The Process Model Four program counters Four program

Only one program active at any instant



# Drocesses User-mode Processes (programs) scheduled by the kernel Isolated from each other No concurrency issues between each other System-calls transition into and return from the kernel Kernel-mode Nearly all activities still associated with a process Kernel memory shared between all processes Concurrency issues exist between processes concurrently executing in a system call















### **User-level Threads**

- Implementation at user-level
  - User-level Thread Control Block (TCB), ready queue, blocked queue, and dispatcher
  - Kernel has no knowledge of the threads (it only sees a single process)
  - If a thread blocks waiting for a resource held by another thread inside the same process, its state is saved and the dispatcher switches to another ready thread
  - Thread management (create, exit, yield, wait) are implemented in a runtime support library IST OF THE UNIVERSITY OF THE PROPERTY OF THE PROP

## **User-Level Threads**

- Pros
  - Thread management and switching at user level is much faster than doing it in kernel level
    - No need to trap (take syscall exception) into kernel and back to switch
  - Dispatcher algorithm can be tuned to the application
  - E.g. use priorities
  - Can be implemented on any OS (thread or non-thread aware)
  - Can easily support massive numbers of threads on a perapplication basis
    - Use normal application virtual memory
    - Kernel memory more constrained. Difficult to efficiently support wildly differing numbers of threads for different applications.



14

## **User-level Threads**

- Cons
  - Threads have to yield() manually (no timer interrupt delivery to user-level)
    - · Co-operative multithreading
      - A single poorly design/implemented thread can monopolise the available CPU time
    - There are work-arounds (e.g. a timer signal per second to enable pre-emptive multithreading), they are course grain and a kludge.
  - Does not take advantage of multiple CPUs (in reality, we still have a single threaded process as far as the kernel is concerned)



15

# User-Level Threads • Cons - If a thread makes a blocking system call (or takes a page fault), the process (and all the internal threads) blocks • Can't overlap I/O with computation User Mode Scheduler Scheduler Frocess Scheduler





## **Kernel Threads**

- Threads are implemented in the kernel
  - TCBs are stored in the kernel
    - · A subset of information in a traditional PCB
      - The subset related to execution context
    - · TCBs have a PCB associated with them
      - Resources associated with the group of threads (the process)
  - Thread management calls are implemented as system calls
    - · E.g. create, wait, exit



## **Kernel Threads**

- Cons
  - Thread creation and destruction, and blocking and unblocking threads requires kernel entry and exit.
    - More expensive than user-level equivalent
- Pros
  - Preemptive multithreading
  - Parallelism
    - Can overlap blocking I/O with computation
    - · Can take advantage of a multiprocessor





## Multiprogramming Implementation

- 1. Hardware stacks program counter, etc.
- 2. Hardware loads new program counter from interrupt vector.
- 3. Assembly language procedure saves registers
- Assembly language procedure sets up new stack.
   Cinterrupt service runs (typically reads and buffers input).
- 6. Scheduler decides which precess is to run next . O procedure returns to the assembly code
- 8. Assembly language procedure starts up new current process

Skeleton of what lowest level of OS does when an interrupt occurs - a context switch



22

## Context Switch Terminology

- · A context switch can refer to
  - A switch between threads
    - · Involving saving and restoring of state associated with a thread
  - A switch between processes
    - · Involving the above, plus extra state associated with a process.
      - E.g. memory maps



23

### Context Switch Occurrence

- · A switch between process/threads can happen any time the OS is invoked
  - On a system call
  - Mandatory if system call blocks or on exit();
  - On an exception
    - · Mandatory if offender is killed
  - On an interrupt
    - Triggering a dispatch is the main purpose of the timer

A thread switch can happen between any two instructions

Note instructions do not equal program statements





- ⇒OS must save all state that affects the thread
- This state is called the process/thread context
- Switching between process/threads consequently results in a context switch.



25































```
OS/161 switchframe_switch

switchframe_switch:

/*

a0 contains the address of the switchframe pointer in the old thread.

a1 contains the address of the switchframe pointer in the new thread.

The switchframe pointer is really the stack pointer. The other registers get saved on the stack, namely:

S0-s6, s8
gp. ra

The order must match <mips/switchframe.h>.

Note that while we'd ordinarily need to save s7 too, because we use it to hold curthread saving it would interfere with the way curthread is managed by thread.c. So we'll just let thread.c

manage it.

TILININISSITY OF NEW SOUTH WALES
```



```
OS/161 switchframe_switch

/* Get the new stack pointer from the new thread '/
|w sp. 0(a1)
|nop /* delay slot for load '/

/* Now, restore the registers '/
|w s0, 0(sp)
|w s1, 4(sp)
|w s2, 8(sp)
|w s2, 8(sp)
|w s3, 12(sp)
|w s4, 16(sp)
|w s5, 22(sp)
|w s6, 24(sp)
|w s6, 24(sp)
|w s8, 28(sp)
|w gp. 32(sp)
|w gp. 32(sp)
|w gp. 32(sp)
|w ra, 36(sp)
|nop /* delay slot for load '/
```



