Today: I/O Systems

• How does I/O hardware influence the OS?
• What I/O services does the OS provide?
• How does the OS implement those services?
• How can the OS improve the performance of I/O?

Architecture of I/O Systems

• Key components
  – **System bus**: allows the device to communicate with the CPU, typically shared by multiple devices.
  – A device **port** typically consisting of 4 registers:
    • **Status**: indicates a device busy, data ready, or error condition
    • **Control**: command to perform
    • **Data-in**: data being sent from the device to the CPU
    • **Data-out**: data being sent from the CPU to the device
  – **Controller**: receives commands from the system bus, translates them into device actions, and reads/writes data onto the system bus.
  – The device itself

• Traditional devices: disk drive, printer, keyboard, modem, mouse, display
• Non-traditional devices: joystick, robot actuators, flying surfaces of an airplane, fuel injection system of a car, ...
PCI Bus Structure

Kernel I/O Subsystem
Device I/O Port location on PCs

<table>
<thead>
<tr>
<th>I/O address range (hexadecimal)</th>
<th>device</th>
</tr>
</thead>
<tbody>
<tr>
<td>000-00F</td>
<td>DMA controller</td>
</tr>
<tr>
<td>020-021</td>
<td>interrupt controller</td>
</tr>
<tr>
<td>040-043</td>
<td>timer</td>
</tr>
<tr>
<td>200-20F</td>
<td>game controller</td>
</tr>
<tr>
<td>2F8-2FF</td>
<td>serial port (secondary)</td>
</tr>
<tr>
<td>320-32F</td>
<td>hard-disk controller</td>
</tr>
<tr>
<td>378-37F</td>
<td>parallel port</td>
</tr>
<tr>
<td>3D0-3DF</td>
<td>graphics controller</td>
</tr>
<tr>
<td>3F0-3F7</td>
<td>diskette-drive controller</td>
</tr>
<tr>
<td>3F8-3FF</td>
<td>serial port (primary)</td>
</tr>
</tbody>
</table>

I/O Services Provided by OS

- Naming of files and devices. (On Unix, devices appear as files in the /dev directory)
- Access control.
- Operations appropriate to the files and devices.
- Device allocation.
- Buffering, caching, and spooling to allow efficient communication with devices.
- I/O scheduling.
- Error handling and failure recovery associated with devices (command retries, for example).
- Device drivers to implement device-specific behaviors.
Communication using Polling

- CPU busy-waits until the status is idle.
- CPU sets the command register and data-out if it is an output operation.
- CPU sets status to command-ready => controller sets status to busy.
- Controller reads the command register and performs the command, placing a value in data-in if it is an input command.
- If the operation succeeds, the controller changes the status to idle.
- CPU observes the change to idle and reads the data if it was an input operation.
- Good choice if data must be handled promptly, like for a modem or keyboard.
- What happens if the device is slow compared to the CPU?

Communication using Interrupts

- Rather than using busy waiting, the device can interrupt the CPU when it completes an I/O operation.
- On an I/O interrupt:
  - Determine which device caused the interrupt.
  - If the last command was an input operation, retrieve the data from the device register.
  - Start the next operation for that device.
Intel X86 Event Vectors

<table>
<thead>
<tr>
<th>vector number</th>
<th>description</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>divide error</td>
</tr>
<tr>
<td>1</td>
<td>debug exception</td>
</tr>
<tr>
<td>2</td>
<td>null interrupt</td>
</tr>
<tr>
<td>3</td>
<td>breakpoint</td>
</tr>
<tr>
<td>4</td>
<td>INTO-detected overflow</td>
</tr>
<tr>
<td>5</td>
<td>bound range exception</td>
</tr>
<tr>
<td>6</td>
<td>invalid opcode</td>
</tr>
<tr>
<td>7</td>
<td>device not available</td>
</tr>
<tr>
<td>8</td>
<td>double fault</td>
</tr>
<tr>
<td>9</td>
<td>coprocessor segment overrun (reserved)</td>
</tr>
<tr>
<td>10</td>
<td>invalid task state segment</td>
</tr>
<tr>
<td>11</td>
<td>segment not present</td>
</tr>
<tr>
<td>12</td>
<td>stack fault</td>
</tr>
<tr>
<td>13</td>
<td>general protection</td>
</tr>
<tr>
<td>14</td>
<td>page fault</td>
</tr>
<tr>
<td>15</td>
<td>(Intel reserved, do not use)</td>
</tr>
<tr>
<td>16</td>
<td>floating-point error</td>
</tr>
<tr>
<td>17</td>
<td>alignment check</td>
</tr>
<tr>
<td>18</td>
<td>machine check</td>
</tr>
<tr>
<td>19D31</td>
<td>(Intel reserved, do not use)</td>
</tr>
<tr>
<td>32D255</td>
<td>maskable interrupts</td>
</tr>
</tbody>
</table>

Direct Memory Access

- For devices that transfer large volumes of data at a time (like a disk block), it is expensive to have the CPU retrieve these one byte at a time.
- **Solution:** Direct memory access (DMA)
  - Use a sophisticated DMA controller that can write directly to memory. Instead of data-in/data-out registers, it has an address register.
  - The CPU tells the DMA the locations of the source and destination of the transfer.
  - The DMA controller operates the bus and interrupts the CPU when the entire transfer is complete, instead of when each byte is ready.
  - The DMA controller and the CPU compete for the memory bus, slowing down the CPU somewhat, but still providing better performance than if the CPU had to do the transfer itself.
Application Programmer's View of I/O Devices

- The OS provides a high-level interface to devices, greatly simplifying the programmer's job.
  - Standard interfaces are provided for related devices.
  - Device dependencies are encapsulated in device drivers.
  - New devices can be supported by providing a new device driver.

**Device characteristics:**
- Transfer unit: **character** or **block**
- Access method: sequential or random access
- Timing: synchronous or asynchronous.
  - Most devices are asynchronous, while I/O system calls are synchronous => The OS implements **blocking I/O**
- Sharable or dedicated
- Speed
- Operations: Input, output, or both
- Examples: keyboard (sequential, character), disk (block, random or sequential)

### Examples of I/O Device types

<table>
<thead>
<tr>
<th>aspect</th>
<th>variation</th>
<th>example</th>
</tr>
</thead>
<tbody>
<tr>
<td>data-transfer mode</td>
<td>character</td>
<td>terminal</td>
</tr>
<tr>
<td></td>
<td>block</td>
<td>disk</td>
</tr>
<tr>
<td>access method</td>
<td>sequential</td>
<td>modem</td>
</tr>
<tr>
<td></td>
<td>random</td>
<td>CD-ROM</td>
</tr>
<tr>
<td>transfer schedule</td>
<td>synchronous</td>
<td>tape</td>
</tr>
<tr>
<td></td>
<td>asynchronous</td>
<td>keyboard</td>
</tr>
<tr>
<td>sharing</td>
<td>dedicated</td>
<td>tape</td>
</tr>
<tr>
<td></td>
<td>sharable</td>
<td>keyboard</td>
</tr>
<tr>
<td>device speed</td>
<td>latency</td>
<td>CD-ROM</td>
</tr>
<tr>
<td></td>
<td>seek time</td>
<td>graphics controller</td>
</tr>
<tr>
<td></td>
<td>transfer rate</td>
<td>disk</td>
</tr>
<tr>
<td></td>
<td>delay between operations</td>
<td></td>
</tr>
</tbody>
</table>
Block and Character devices

- Block devices include disk drives
  - Commands include read, write, seek
  - Raw I/O or file-system access
  - Memory-mapped file access possible

- Character devices include keyboards, mice, serial ports
  - Commands include `get`, `put`
  - Libraries layered on top allow line editing

I/O Buffering

I/O devices typically contain a small on-board memory where they can store data temporarily before transferring to/from the CPU.

- A disk buffer stores a block when it is read from the disk.
- It is transferred over the bus by the DMA controller into a buffer in physical memory.
- The DMA controller interrupts the CPU when the transfer is done.
Why buffer on the OS side?

- To cope with speed mismatches between device and CPU.
  - Example: Compute the contents of a display in a buffer (slow) and then zap the buffer to the screen (fast)

- To cope with devices that have different data transfer sizes.
  - Example: ftp brings the file over the network one packet at a time. Stores to disk happen one block at a time.

- To minimize the time a user process is blocked on a write.
  - Writes => copy data to a kernel buffer and return control to the user program. The write from the kernel buffer to the disk is done later.

Caching

- Improve disk performance by reducing the number of disk accesses.
  - Idea: keep recently used disk blocks in main memory after the I/O call that brought them into memory completes.
  - Example: Read (diskAddress)
    If (block in memory) return value from memory
    Else ReadSector(diskAddress)
  - Example: Write (diskAddress)
    If (block in memory) update value in memory
    Else Allocate space in memory, read block from disk, and update value in memory

- What should happen when we write to a cache?
  - write-through policy (write to all levels of memory containing the block, including to disk). High reliability.
  - write-back policy (write only to the fastest memory containing the block, write to slower memories and disk sometime later). Faster.
Putting the Pieces Together - a Typical Read Call

1. User process requests a read from a device.
2. OS checks if data is in a buffer. If not,
   a) OS tells the device driver to perform input.
   b) Device driver tells the DMA controller what to do and blocks itself.
   c) DMA controller transfers the data to the kernel buffer when it has all been retrieved from the device.
   d) DMA controller interrupts the CPU when the transfer is complete.
3. OS transfers the data to the user process and places the process in the ready queue.
4. When the process gets the CPU, it begins execution following the system call.

Steps in DMA transfer

1. device driver is told to transfer disk data to buffer at address X
2. device driver tells disk controller to transfer C bytes from disk to buffer at address X
3. disk controller initiates DMA transfer
4. disk controller sends each byte to DMA controller
Summary

- I/O is expensive for several reasons:
  - Slow devices and slow communication links
  - Contention from multiple processes.
  - I/O is typically supported via system calls and interrupt handling, which are slow.

- Approaches to improving performance:
  - Reduce data copying by caching in memory
  - Reduce interrupt frequency by using large data transfers
  - Offload computation from the main CPU by using DMA controllers.
  - Increase the number of devices to reduce contention for a single device and thereby improve CPU utilization.
  - Increase physical memory to reduce amount of time paging and thereby improve CPU utilization.