ETD system

Electronic theses and dissertations repository

 

Tesi etd-04162009-145440


Thesis type
Tesi di dottorato di ricerca
Author
SOLINAS, MARCO
URN
etd-04162009-145440
Title
Cache Architectures for Wire-Delay Dominated CMP Systems
Settore scientifico disciplinare
ING-INF/05
Corso di studi
INGEGNERIA DELL'INFORMAZIONE
Supervisors
Relatore Prof. Dini, Gianluca
Relatore Ing. Foglia, Pierfrancesco
Relatore Prof. Prete, Cosimo Antonio
Parole chiave
  • wire-delay
  • NUCA
  • coherence protocol
  • CMP
  • cache memory
Data inizio appello
29/05/2009;
Consultabilità
Parziale
Data di rilascio
29/05/2049
Riassunto analitico
Increasing on-chip wire delay and growing off-chip miss latency, present two key
challenges in designing large Level-2 (L2) CMP caches. Currently, some CMPs
use a shared L2 cache to maximize cache capacity and minimize off-chip misses.
Others use private L2 caches, replicating data to limit the delay from slow on-chip
wires and minimize cache access time. Ideally, to improve performance for a wide
variety of workloads, CMPs prefer both the capacity of a shared cache and the
access latency of private caches. In this context, NUCA caches have been proved
to be able to tolerate wire delay effects while maintaining a huge on-chip storage
capacity.
In this thesis, we investigate the choice of the coherence strategy (MESI and
MOESI) and the whole system topology as design tradeoffs for S-NUCA based
CMP system, and propose and evaluate a novel block migration scheme for DNUCA
based systems, in which are addressed two specific problems that can arise
due to the presence of multiple traffic sources.
Results show that, in S-NUCA based CMP systems, choosing between MESI and
MOESI has not a significant impact on performance, while the system topology can
lead to very different behaviors.
Block migration is introduced in NUCA cache to reduce access latency in a shared
cache. Our results show that the migration mechanism is effective in reducing the
average L1 miss latency, but the impact on performance is smaller, as a
consequence of the very little L1 miss rate.
File