DOC PREVIEW
Berkeley COMPSCI 152 - CS 152 Section 9

This preview shows page 1-2-23-24 out of 24 pages.

Save
View full document
View full document
Premium Document
Do you want full access? Go Premium and unlock all 24 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 24 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 24 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 24 pages.
Access to all documents
Download any document
Ad free experience
Premium Document
Do you want full access? Go Premium and unlock all 24 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

CS 152 Computer Architecture & EngineeringSlide 2Mystery DieSlide 4Alpha 21264 PipelineBranch PredictionSlide 721264 FetchSlide 921264 Register RenamingSlide 11Slide 12Slide 13Slide 14Slide 1521264 Superscalar ExecutionSlide 17Slide 18Slide 19Slide 20Slide 2121264 Instruction ReorderingMemory Ordering in the 21264Speculation in the 21264CS 152Computer Architecture & EngineeringAndrew WatermanUniversity of California, BerkeleySection 9Spring 2010•Hit rate vs. miss rate, AMAT•One writeup only per group on open-endedMystery Die•DEC Alpha 21264•15M transistors•600 MHz in 350 nm•Highly speculative OoO superscalarMystery Die•DEC Alpha 21264•15M transistors•600 MHz in 350 nm•Highly speculative OoO superscalarAlpha 21264 PipelineBranch Prediction•Two kinds of correlating branch predictors:Local GlobalPCPCLocal History TableLocal History TableBranch History TableBranch History TableGlobal HistoryGlobal HistoryBranch History TableBranch History TableBranch Prediction•21264 uses both! (tournament predictor)Local GlobalPredictionPCPCLocal History TableLocal History TableBranch History TableBranch History TableGlobal HistoryGlobal HistoryBranch History TableBranch History TableTournament PredictorTournament Predictor21264 Fetch•Line/way prediction keeps fetch loop shortAlpha 21264 Pipeline21264 Register Renaming•Registers are renamed, then instructions are inserted into the issue queue•Map table backed up on every in-flight insn21264 Register Renaming•What hazards does renaming obviate?•In what situations is renaming useful?•If you had to choose between branch prediction and renaming, which would you pick?21264 Register Renaming•What hazards does renaming obviate?–WAR, WAW•In what situations is renaming useful?•If you had to choose between branch prediction and renaming, which would you pick?21264 Register Renaming•What hazards does renaming obviate?–WAR, WAW•In what situations is renaming useful?–Code with ILP and name dependencies: loops•If you had to choose between branch prediction and renaming, which would you pick?21264 Register Renaming•What hazards does renaming obviate?–WAR, WAW•In what situations is renaming useful?–Code with ILP and name dependencies: loops•If you had to choose between branch prediction and renaming, which would you pick?–Not much ILP within a basic block, so renaming isn’t too useful without branch predictionAlpha 21264 Pipeline21264 Superscalar Execution•The 21264 can decode, rename, issue, execute, and commit 4 insns/cycle•How does circuit complexity scale with W in the following operations?–Instruction decode–Register renaming–Result bypassing21264 Superscalar Execution•The 21264 can decode, rename, issue, execute, and commit 4 insns/cycle•How does circuit complexity scale with W in the following operations?–Instruction decode: O(W)–Register renaming–Result bypassing21264 Superscalar Execution•The 21264 can decode, rename, issue, execute, and commit 4 insns/cycle•How does circuit complexity scale with W in the following operations?–Instruction decode: O(W)–Register renaming: O(W2)–Result bypassing21264 Superscalar Execution•The 21264 can decode, rename, issue, execute, and commit 4 insns/cycle•How does circuit complexity scale with W in the following operations?–Instruction decode: O(W)–Register renaming: O(W2)–Result bypassing: O(W2)21264 Superscalar Execution•The 21264 can decode, rename, issue, execute, and commit 4 insns/cycle•How does circuit complexity scale with W in the following operations?–Instruction decode: O(W)–Register renaming: O(W2)–Result bypassing: O(W2)•What about issue window complexity?21264 Superscalar Execution•21264 couldn’t fit full bypassing into one clock cycle•Instead, they fully bypass within each of two clusters; inter-cluster bypass takes another cycle21264 Instruction Reordering•As mentioned earlier, 21264 uses explicit renaming, as opposed to data-in-ROB design•What does ROB hold?Memory Ordering in the 21264•To execute the critical instruction path quickly, want to execute loads ASAP•Initially, loads speculatively bypass stores•On a misspeculation, set a “wait” bit for that load’s PC, so it will behave conservatively from then on•Clear wait bits periodicallySpeculation in the 21264•What does the 21264 speculate on?–Next I$ line/way–Branches, indirect jumps–Exceptions–Load/Store ordering–Load hit/miss•Shortens hit time by a cycle–Anything


View Full Document

Berkeley COMPSCI 152 - CS 152 Section 9

Documents in this Course
Quiz 5

Quiz 5

9 pages

Memory

Memory

29 pages

Quiz 5

Quiz 5

15 pages

Memory

Memory

29 pages

Memory

Memory

35 pages

Memory

Memory

15 pages

Quiz

Quiz

6 pages

Midterm 1

Midterm 1

20 pages

Quiz

Quiz

12 pages

Memory

Memory

33 pages

Quiz

Quiz

6 pages

Homework

Homework

19 pages

Quiz

Quiz

5 pages

Memory

Memory

15 pages

Load more
Download CS 152 Section 9
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view CS 152 Section 9 and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view CS 152 Section 9 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?