Berkeley COMPSCI 262A - Speculative Execution - D1638826

Home> Schools> University of California, Berkeley> Computer Science (COMPSCI) > COMPSCI 262A> Speculative Execution

DOC PREVIEW

Berkeley COMPSCI 262A - Speculative Execution

School name University of California, Berkeley

Course Compsci 262a- Advanced Topics in Computer Systems

Pages 4

This preview shows page 1 out of 4 pages.

Save

View full document

Premium Document

Do you want full access? Go Premium and unlock all 4 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

Premium Document

Do you want full access? Go Premium and unlock all 4 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

Unformatted text preview:

Speculative ExecutionI. BackgroundII. SpeculatorIII. ImplementationIV. Using speculationV. Evaluation1SpeculatorAdvanced Topics in Computer Systems, CS262AProf. Eric Brewer11/23/09Speculative ExecutionI. BackgroundLong tradition of research on speculation for computer architecture: o branch predictiono value prediction (!)But these speculations have limited upside... what about speculation at the application level?Insight: OS controls side effects, so OS can limit the impact of speculation!o E.g., delay printf until the computation is really going to happen for sureo If cancelled, we just delete itStart with the file system: it is often synchronous for actions that work as expected such as cache hitso we can speculate that we will hit, and then recover if we are wrongo synchronization delays are the cost of consistencyo distributed file systems (DFS) have to choose between these delays and weakerconsistency: they generally weaken consistencyNFS provides “close-to-open” consistency (weaker):o when you open a file you will see all changes from others that have closed that fileo plus a 30-second delay!o do not see changes until the file is closed, which reduces the communication overheadStrict synchronization is pessimistic:o most file operations do not conflicto Speculator is optimisticBasic operation:o when the OS reaches a blocking operationo 1) checkpoint state (to enable recovery)o 2) speculate on the answero 3) continue execution immediately using this answero 4) when real answer comes in, either continue along, or abort and restart from thecheckpointSpeculator2II. SpeculatorKey invariants:o Know when a process is speculativeo Speculative processes cannot externalize output (just queue the output)o Track speculative operations as they affect other processes! (they become speculativetoo)• fork, exit, signals, pepes, fifos, sockets, local file, distributed files• other forms of IPC just block until speculation is resolvedo No modification of applications!Performance: 2x over local networks, 10x over wide-area networksNFS normal behavior:o async writes of the data to the servero sync commit RPC (=> round trip plus disk write)o speculator guesses that all writes succeed and so does the commito on open, speculator guesses that cached copy is still validThree characteristics:o Speculation is usually right: for DFS, only wrong in the case of conflicts, which arerareo checkpointing is faster than remote I/O• copy-on-write fork operation is fast; most pages not copied• child only executes if speculation was wrong.• take and discard a checkpoint for a small process = 52us !• big process = 6mso Spare CPU for speculation (low opportunity cost)III. ImplementationLinux 2.4.21, 7500 lines of Co create_speculation -> spec_ido commit/fail callsCreating a speculationo copy-on-write fork (save the child for abort)o save the state of open file descriptors, copy pending signals (to replay them)o child is not runnable3Speculatoro simply discard the child (reclaim pages) if speculation commitso aborted processes are killed next time they try to runo child process assumes the identity of the failed process• process id, thread group id• restore file desciptors and signals• reexcute system call that caused speculation (would it speculate again?)Two new data structures in the kernel:o 1) track set of kernel objects that depend on speculation (so you can reclaim them)o 2) an undo log for each kernel objecto On a nested speculation, you can reuse the undo log (but may undo more work thannecessary)• if more than 500ms has passed create a new undo log, so that you won’t undo too much work• if the first operation has side effects (like mkdir), then better to create a second log. If the first opera-tion succeeds its side effects are visible, so we don’t want to undo them!Single process correctness:o Invariant: speculative state never visible to external device or user• screen output, network packets, IPC, etc.o Invariant: a process should never see speculative state unless it is explicitly dependenton that state (or its producer).• If it does see the state, and that speculation fails, then this process must fail too• Easiest solution is to block new process until speculation resolves (but this limits parallelism)• Basic version: make a special syscall vector table that blocks all system calls that would externalize output• V2: allows syscalls that are read only• V3: allows syscalls that modify only private state to this process (example: dup2)• V4: allows syscalls to speculative file systems• V5: allows read/write to speculative files• V6: buffer write call output until speculation resolves (to screen or network)Multi-process speculation:o DFS: failure => invalidate cached copyo Ramdisk: rmdir => keep around old version so you can put it backo ext3: never write speculative data to disk (no steal!). On failure, can just delete thisbuffer page rather than write it out (no undo log for the disk)o problem with no steal: some pages may never get written back (e.g. superblock)• solution: use redo/undo functions and keep two version (spec and nonspec). update non-spec on com-mits and only write it outo reorder compound ext3 transactions so that speculative actions are postponedSpeculator4o pipes: need to track create/delete as well as read/writeo signals: can’t checkpoint the receiver at an arbitrary point (why?)• instead: cause receiver to checkpoint soon• deliver a “pending” signal, but take no action• when process returns from the kernel, the pending signals become real and are handled• checkpoint is taken a this time (which is safe)o exit: keep around pid until process exit is confirmedIV. Using speculationGeneral case:o speculate on something in cache (if not in cache, then don’t speculate)o convert sync call to verify cache entry into async callo start speculationo on reply to async call, decide commit/failHarder for mutating operations!o side effects of speculation could become visible to otherso key solution: server need not speculate! It knows if the hypothesis is wrong and can actaccordinglyo for each RPC, include list of hypotheses for server to validateGroup commit:o speculation leads to many outstanding “synchronous” operationso can do one write for the group, rather than one write each!o just delay a bit and the client will send you more work, which you can amortize withone actual write• great for journals, does it matter

View Full Document


School:
Email:
New Password:
Confirm Password:

This preview shows page 1 out of 4 pages.

Berkeley COMPSCI 262A - Speculative Execution

Sign up for free to view:

Please select your school