logo SBA

ETD

Archivio digitale delle tesi discusse presso l’Università di Pisa

Tesi etd-04162009-145440


Tipo di tesi
Tesi di dottorato di ricerca
Autore
SOLINAS, MARCO
URN
etd-04162009-145440
Titolo
Cache Architectures for Wire-Delay Dominated CMP Systems
Settore scientifico disciplinare
ING-INF/05
Corso di studi
INGEGNERIA DELL'INFORMAZIONE
Relatori
Relatore Prof. Dini, Gianluca
Relatore Ing. Foglia, Pierfrancesco
Relatore Prof. Prete, Cosimo Antonio
Parole chiave
  • cache memory
  • CMP
  • coherence protocol
  • NUCA
  • wire-delay
Data inizio appello
29/05/2009
Consultabilità
Parziale
Data di rilascio
29/05/2049
Riassunto
Increasing on-chip wire delay and growing off-chip miss latency, present two key
challenges in designing large Level-2 (L2) CMP caches. Currently, some CMPs
use a shared L2 cache to maximize cache capacity and minimize off-chip misses.
Others use private L2 caches, replicating data to limit the delay from slow on-chip
wires and minimize cache access time. Ideally, to improve performance for a wide
variety of workloads, CMPs prefer both the capacity of a shared cache and the
access latency of private caches. In this context, NUCA caches have been proved
to be able to tolerate wire delay effects while maintaining a huge on-chip storage
capacity.
In this thesis, we investigate the choice of the coherence strategy (MESI and
MOESI) and the whole system topology as design tradeoffs for S-NUCA based
CMP system, and propose and evaluate a novel block migration scheme for DNUCA
based systems, in which are addressed two specific problems that can arise
due to the presence of multiple traffic sources.
Results show that, in S-NUCA based CMP systems, choosing between MESI and
MOESI has not a significant impact on performance, while the system topology can
lead to very different behaviors.
Block migration is introduced in NUCA cache to reduce access latency in a shared
cache. Our results show that the migration mechanism is effective in reducing the
average L1 miss latency, but the impact on performance is smaller, as a
consequence of the very little L1 miss rate.
File