Unformatted text preview:

CS 152 Computer Architecture Engineering Section 9 Spring 2010 Andrew Waterman University of California Berkeley Hit rate vs miss rate AMAT One writeup only per group on openended Mystery Die DEC Alpha 21264 15M transistors 600 MHz in 350 nm Highly speculative OoO superscalar Mystery Die DEC Alpha 21264 15M transistors 600 MHz in 350 nm Highly speculative OoO superscalar Alpha 21264 Pipeline Branch Prediction Two kinds of correlating branch predictors Local History Table Branch History Table Global History PC Local Branch History Table Global Branch Prediction 21264 uses both tournament predictor Local History Table Branch History Table PC Local Global Prediction Branch History Table Tournament Predictor Global History 21264 Fetch Line way prediction keeps fetch loop short Alpha 21264 Pipeline 21264 Register Renaming Registers are renamed then instructions are inserted into the issue queue Map table backed up on every in flight insn 21264 Register Renaming What hazards does renaming obviate In what situations is renaming useful If you had to choose between branch prediction and renaming which would you pick 21264 Register Renaming What hazards does renaming obviate WAR WAW In what situations is renaming useful If you had to choose between branch prediction and renaming which would you pick 21264 Register Renaming What hazards does renaming obviate WAR WAW In what situations is renaming useful Code with ILP and name dependencies loops If you had to choose between branch prediction and renaming which would you pick 21264 Register Renaming What hazards does renaming obviate WAR WAW In what situations is renaming useful Code with ILP and name dependencies loops If you had to choose between branch prediction and renaming which would you pick Not much ILP within a basic block so renaming isn t too useful without branch prediction Alpha 21264 Pipeline 21264 Superscalar Execution The 21264 can decode rename issue execute and commit 4 insns cycle How does circuit complexity scale with W in the following operations Instruction decode Register renaming Result bypassing 21264 Superscalar Execution The 21264 can decode rename issue execute and commit 4 insns cycle How does circuit complexity scale with W in the following operations Instruction decode O W Register renaming Result bypassing 21264 Superscalar Execution The 21264 can decode rename issue execute and commit 4 insns cycle How does circuit complexity scale with W in the following operations Instruction decode O W Register renaming O W2 Result bypassing 21264 Superscalar Execution The 21264 can decode rename issue execute and commit 4 insns cycle How does circuit complexity scale with W in the following operations Instruction decode O W Register renaming O W2 Result bypassing O W2 21264 Superscalar Execution The 21264 can decode rename issue execute and commit 4 insns cycle How does circuit complexity scale with W in the following operations Instruction decode O W Register renaming O W2 Result bypassing O W2 What about issue window complexity 21264 Superscalar Execution 21264 couldn t fit full bypassing into one clock cycle Instead they fully bypass within each of two clusters inter cluster bypass takes another cycle 21264 Instruction Reordering As mentioned earlier 21264 uses explicit renaming as opposed to data in ROB design What does ROB hold Memory Ordering in the 21264 To execute the critical instruction path quickly want to execute loads ASAP Initially loads speculatively bypass stores On a misspeculation set a wait bit for that load s PC so it will behave conservatively from then on Clear wait bits periodically Speculation in the 21264 What does the 21264 speculate on Next I line way Branches indirect jumps Exceptions Load Store ordering Load hit miss Shortens hit time by a cycle Anything else


View Full Document

Berkeley COMPSCI 152 - CS 152 Section 9

Documents in this Course
Quiz 5

Quiz 5

9 pages

Memory

Memory

29 pages

Quiz 5

Quiz 5

15 pages

Memory

Memory

29 pages

Memory

Memory

35 pages

Memory

Memory

15 pages

Quiz

Quiz

6 pages

Midterm 1

Midterm 1

20 pages

Quiz

Quiz

12 pages

Memory

Memory

33 pages

Quiz

Quiz

6 pages

Homework

Homework

19 pages

Quiz

Quiz

5 pages

Memory

Memory

15 pages

Load more
Loading Unlocking...
Login

Join to view CS 152 Section 9 and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view CS 152 Section 9 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?