Cache Memories 15 213 Cache memories are small fast SRAM based memories managed automatically in hardware The course that gives CMU its Zip Cache Memories Feb 25 2003 Hold frequently accessed blocks of main memory CPU looks first for data in L1 then in L2 then in main memory Typical bus structure Topics CPU chip Generic cache memory organization Direct mapped caches Set associative caches Impact of caches on performance cache bus L2 cache Inserting an L1 Cache Between the CPU and Main Memory The tiny very fast CPU register file has room for four 4 byte words The transfer unit between the CPU register file and the cache is a 4 byte block 3 memory bus system bus main memory I O bridge bus interface 15 213 S 03 2 class13 ppt The transfer unit between the cache and main memory is a 4 word block 16 bytes register file L1 ALU cache line 0 line 1 The small fast L1 cache has room for two 4 word blocks General Org of a Cache Memory Each set contains one or more lines set 0 Each line holds a block of data S block 10 t tag bits per line Cache is an array of sets 2s set 1 sets abcd block 21 pqrs block 30 wxyz tag valid tag valid tag valid tag 0 0 1 B 1 1 B 1 0 0 1 B 1 1 B 1 1 B 1 1 B 1 E lines per set The big slow main memory has room for many 4 word blocks set S 1 15 213 S 03 valid B 2b bytes per cache block 4 1 valid bit per line valid tag valid tag 0 0 Cache size C B x E x S data bytes 15 213 S 03 Addressing Caches Addressing Caches Address A t bits set 0 set 1 v tag v tag v tag v tag v tag v tag 0 1 B 1 0 1 B 1 m 1 0 tag set index block offset 0 1 B 1 0 1 B 1 set S 1 s bits Address A t bits b bits 0 1 B 1 0 1 B 1 set 0 set 1 The word at address A is in the cache if the tag bits in one of the valid lines in set set index match tag v tag v tag v tag v tag v tag v tag 0 1 B 1 0 1 B 1 0 1 B 1 0 1 B 1 The word contents begin at offset block offset bytes from the beginning of the block 15 213 S 03 5 b bits m 1 0 tag set index block offset 0 1 B 1 0 1 B 1 set S 1 s bits 1 Locate the set based on set index 2 Locate the line in the set based on tag 3 Check that the line is valid 4 Locate the data in the line based on block offset 15 213 S 03 6 Direct Mapped Cache Accessing Direct Mapped Caches Simplest kind of cache Set selection Characterized by exactly one line per set set 0 valid tag cache block set 1 valid tag cache block E 1 lines per set Use the set index bits to determine the set of interest selected set set 0 valid tag cache block set 1 valid tag cache block set S 1 valid tag set S 1 valid cache block t bits Cache size C B x S data bytes 7 15 213 S 03 m 1 8 tag s bits 00 001 set index cache block tag b bits block offset 0 15 213 S 03 Accessing Direct Mapped Caches Accessing Direct Mapped Caches Line matching and word selection Line matching and word selection Line matching Find a valid line in the selected set with a matching tag Word selection Then extract the word Line matching Find a valid line in the selected set with a matching tag Word selection Then extract the word 1 1 The valid bit must be set 0 selected set i 1 2 The tag bits in the cache line must match the tag bits in the address m 1 1 2 3 4 5 6 7 0110 w0 If 1 and 2 then cache hit t bits 0110 tag w1 w2 w3 15 213 S 03 M 16 byte addresses B 2 bytes block S 4 sets E 1 entry set Address trace reads 0 1 7 8 0 00002 00012 01112 10002 00002 v tag data 0 1 1 0 M 8 9 M 0 1 1 0 M 6 7 1 1 2 0110 3 4 w0 5 6 7 w1 w2 w3 3 If cache hit block offset selects starting byte m 1 Direct Mapped Cache Simulation 11 selected set i s bits b bits i 100 set index block offset0 9 t 1 s 2 b 1 x xx x 0 t bits 0110 tag s bits b bits i 100 set index block offset0 15 213 S 03 10 Set Associative Caches Characterized by more than one line per set miss hit miss miss set 0 set 1 miss valid tag cache block valid tag cache block valid tag cache block valid tag cache block E 2 lines per set set S 1 valid tag cache block valid tag cache block E way associative cache 15 213 S 03 12 15 213 S 03 Accessing Set Associative Caches Accessing Set Associative Caches Set selection Line matching and word selection identical to direct mapped cache set 0 selected set set 1 valid tag valid tag valid tag valid tag cache block cache block cache block cache block set S 1 t bits m 1 valid tag valid tag s bits block offset Accessing Set Associative Caches Line matching and word selection 0 1 1001 0110 selected set i m 1 15 1 0110 1 2 3 4 w0 5 6 w1 w2 3 4 5 6 w1 w2 7 w3 If 1 and 2 then cache hit s bits b bits i 100 set index block offset0 15 213 S 03 2 Way Associative Cache Simulation M 16 byte addresses B 2 bytes block S 2 sets E 2 entry set Address trace reads 00002 miss 00012 hit 01112 miss 10002 miss hit 00002 0 1 7 8 0 7 w3 v 0 1 0 1 0 1 0 s bits b bits i 100 set index block offset0 15 213 S 03 2 w0 t bits 0110 tag 1 14 3 If cache hit block offset selects starting byte t bits 0110 tag 1001 2 The tag bits in one of the cache lines must match the tag bits in the address t 2 s 1 b 1 xx x x Word selection is the same as in a direct mapped cache 1 0 1 m 1 15 213 S 03 selected set i 1 1 The valid bit must be set 0 13 must compare the tag in each valid line in the selected set b bits 00 001 set index tag cache block …
View Full Document