DOC PREVIEW
UW CSE 378 - Study Notes

This preview shows page 1-2-3-24-25-26 out of 26 pages.

Save
View full document
View full document
Premium Document
Do you want full access? Go Premium and unlock all 26 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 26 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 26 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 26 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 26 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 26 pages.
Access to all documents
Download any document
Ad free experience
Premium Document
Do you want full access? Go Premium and unlock all 26 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

06/04/03 CSE 378 I/O 1Input-output•I/O is very much architecture/system dependent•I/O requires cooperation between–processor that issues I/O command (read, write etc.)–buses that provide the interconnection between processor, memory and I/O devices–I/O controllers that handle the specifics of control of each device and interfacing– devices that store data or signal events06/04/03 CSE 378 I/O 2Basic (simplified) I/O architectureCPUCacheM.Cont. D.Cont. N.InterfaceMain memoryDisks NetworkBus06/04/03 CSE 378 I/O 3Types of I/O devices•Input devices–keyboard, mouse• Output devices–screen, line printer•Devices for both input and output–disks, network interfaces06/04/03 CSE 378 I/O 4An important I/O device: the disktracksectorDisk surfaceRead-write headsplattersCylinder06/04/03 CSE 378 I/O 5Secondary memory (disks)•Physical characteristics–Platters (1 to 20) with diameters from 1.3 to 8 inches (recording on both sides)–Tracks (1,000 to 10,000)–Cylinders (all the tracks in the same position in the platters)– Sectors (e.g., 128-256 sectors/track with gaps and info related to sectors between them; typical sector 512 bytes)– Current trend: constant bit density, i.e., more info (sectors) on outer tracks06/04/03 CSE 378 I/O 6Example: IBM Ultrastar 146Z10•Disk for server–146 GB– 8 MB cache–10,000 RPM– 3 ms average latency– Up to 6 platters; Up to 12 heads– Average seek latency 4.7 ms– Sustained transfer rate 33-66 MB/s06/04/03 CSE 378 I/O 7Disk access time•Arm(s) with a reading/writing head• Four components in an access:– Seek time (to move the arm on the right cylinder). From 0 (if arm already positioned) to a maximum of 15-20 ms. Not a linear function. Smaller disks have smaller seek times. Ultrastar example: Average seek time = 4.7 ms; •My guess: track to track 0.5 ms; longest (inmost strack to outmost track) 8 ms•Rotation time (on the average 1/2 rotation). At 3600 RPM, 8.3 ms. Current disks are 3600 or 5400 or 7200 or even 10,000 RPM (e.g., the Ultrastar, hence average is 3 ms)06/04/03 CSE 378 I/O 8Disk access time (ct’d)– Transfer time depends on rotation time, amount to transfer (minimal size a sector), recording density, disk/memory connection. Today, transfer time occurs at 6 to 66 MB/second–Disk controller time. Overhead to perform an access (of the order of 1 ms)– But … many disk controllers have a cache that contains recently accessed sectors. If the I/O requests hits in the cache, the only components of access time are disk controller time and transfer time (which is then of the order of 40 MB/sec). Cache is used to prefetch on read.06/04/03 CSE 378 I/O 9Improvements in disks•Capacity (via density). Same growth rate as DRAMs•Price decrease has followed (today $5-$50/GB?)•Access times have decreased but not enormously–Higher density -> smaller drives -> smaller seek time– RPM has increased slightly 3600 upto 10,000 (rarely) –Transfer time has improved • CPU speed - DRAM access is one “memory wall”• DRAM access time - Disk access time is a “memory gap”– Technologies to fill the gap have not succeeded (currently the most promising is more DRAM backed up by batteries)06/04/03 CSE 378 I/O 10Connecting CPU, Memory and I/OCPUCacheCPU-Memory busI/O busBus adapterMain memoryI/O contr. I/O contr. I/O contr.disk GraphicsNetwork06/04/03 CSE 378 I/O 11Buses•Simplest interconnect– Low cost: set of shared wires–Easy to add devices (although variety of devices might make the design more complex or less efficient -- longer bus and more electrical load; hence the distinction between I/O buses and CPU/memory buses)–But bus is a single shared resource so can get saturated (both physically because of electrical load, and performance-wise because of contention to access it )•Key parameters:– Width (number of lines:data, addresses, control)– Speed (limited by length and electrical load)06/04/03 CSE 378 I/O 12Memory and I/O buses•CPU/memory bus: tailored to the particular CPU–Fast (separate address and data lines; of course separate control lines)–Often short and hence synchronous (governed by a clock)–Wide (64-128 and even 256 bits)– Expensive•I/O bus: follows some standard so many types of devices can be hooked on to it– Asynchronous (hand-shaking protocol)–Narrower06/04/03 CSE 378 I/O 13Bus transactions•Consists of arbitration and commands–Arbitration: who is getting control of the bus– Commands: type of transaction (read, write, ack, etc…)•Read, Write, Atomic Read-Modify-Write (atomic swap) –Read: send address and data is returned– Write: send address and data– Read-Modify-write : keep bus during the whole transaction. Used for synchronization between processes06/04/03 CSE 378 I/O 14Bus arbitration•Arbitration: who gets the bus if several requests occur at the same time–Only one master (processor): centralized arbitration– Multiple masters (most common case): centralized arbitration (FIFO, daisy-chain, round-robin, combination of those) vs. decentralized arbitration (each device knows its own priority)•Communication protocol between master and slave–Synchronous (for short buses - no clock skew - i.e. CPU/memory)– Asynchronous (hand-shaking finite-state machine; easier to accommodate many devices)06/04/03 CSE 378 I/O 15Hand-shaking protocol•Example : Master (CPU) requests data from Slave (Mem)1. Master transmits a read request (control lines) and address (address/data lines)2. Slave recognizes the request. Grabs the address and raises the Ack control line.3. Master sees the Ack line high. Releases the request and data lines4. Slave sees the Read request low. Releases the Ack line5. Slave is ready to transmit data. Places data on data lines and raises Data ready (control line)6. Master sees Data ready high. Grabs data and raises Ack7. Slave sees Ack high. Releases data line and Data Ready8. Master sees Data Ready low. Releases Ack. Transaction is finished06/04/03 CSE 378 I/O 16Split-transaction buses•Split a read transaction into–Send address (CPU is master)– Send data (Memory is master)–In between these two transactions (memory access time) the bus is freed– Requires “tagging” the transaction •Can even have more concurrency by having different transactions using the data and address lines concurrently•Useful for multiprocessor systems and for I/O06/04/03 CSE 378 I/O 17I/O Hardware-software interface•I/O is best left


View Full Document

UW CSE 378 - Study Notes

Documents in this Course
Encoding

Encoding

20 pages

Load more
Download Study Notes
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Study Notes and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Study Notes 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?