(just add at and dots to get address)
The end of Dennard scaling is expected to shrink the range of DVFS in future nodes, limiting the energy savings of this technique. This paper evaluates how much we can increase the effectiveness of DVFS by using a software decoupled access-execute approach. Decoupling the data access from execution allows us to apply optimal voltage-frequency selection for each phase and therefore improve energy efficiency over standard coupled execution.
Virtual-cache coherence: Stefanos Kaxiras and Alberto Ros, "A New Perspective for Efficient Virtual-Cache Coherence" ISCA 2013
Directory-less NoC coherence: Alberto Ros, Stefanos Kaxiras, "Complexity-Effective Multicore Coherence" Parallel Architectures and Compilation Techniques (PACT) 2012
Snoop-less bus coherence: Stefanos Kaxiras, Alberto Ros "Efficient, Snoopless, System-On-Chip Coherence" IEEE System on Chip Conference (SOCC) 2012
Submitted for review: "Managing Power Constraints in a Single-core Scenario through Power Tokens" Juan M. Cebrian, Juan L. Aragon, Jose M. Garcia, Pavlos Petoumenos and Stefanos Kaxiras
"Leakage-efficient Design of Value Predictors through State and Non-state Preserving Techniques" Juan M. Cebrian, Juan L. Aragon, Jose M. Garcia and Stefanos Kaxiras. To appear in The Journal of Supercomputing [doi:10.1007/s11227-010-0396-0]
Juan M. Cebrian, Juan L. Aragon, Jose M. Garcia, Pavlos Petoumenos and Stefanos Kaxiras. "Efficient Microarchitecture Policies for Accurately Adapting to Power Constraints." IEEE International Parallel and Distributed Processing Symposium, May 25 - Friday, May 29, 2009, Rome.
Daniele Ludovici, Georgios Keramidas, Georgi N. Gaydadjiev, and Stefanos Kaxiras, "Integration of Power Saving Techniques in the UNISIM Simulation Framework Through the Shadow Module Design Paradigm." Workshop on Rapid Simulation and Performance Evaluation: Methods and Tools In conjunction with the 4th International Conference on High-Performance Embedded Architectures and Compilers, Jan., 2009.
G. Keramidas, A. Antonopoulos, D.N. Serpanos, S. Kaxiras, "Non Deterministic Caches: A Simple and Effective Way to Hedge against Side Channel Attacks." 2nd Workshop on Embedded Systems Security (WESS'2007) A Workshop of ACM EMSOFT'2007.
G. Keramidas, P. Xekalakis, S. Kaxiras, "Recruiting Decay for Dynamic Power Reduction in Set-Associative Caches." Transactions on High-Performance Embedded Architectures and Compilers, 2(1):2-20, 2007.
Juan M. Cebrian, Juan L. Aragon, Jose M. Garcia, Stefanos Kaxiras, "Adaptive VP decay: making value predictors leakage-efficient designs for high performance processors." Conf. Computing Frontiers, 2007: 113-122.
Pavlos Petoumenos, Georgios Keramidas, Stefanos Kaxiras, "Cache Replacement Based on Reuse-Distance Prediction." International Conference on Computer Design 2007.
Georgios Keramidas, Polychronis Xekalakis, Stefanos Kaxiras, "Applying Decay to Reduce Dynamic Power in Set-Associative Caches." HiPEAC Conference 2007.
Georgios Keramidas, Pavlos Petoumenos, Stefanos Kaxiras, Alexandros Antonopoulos, Dimitrios Serpanos, "Preventing Denial-of-Service Attacks in Shared CMP Caches." SAMOS VI: Embedded Computer Systems: Architectures, MOdeling, and Simulation, Samos, Greece, July 17-20, 2006.
Modeling Cache Sharing on Chip Multiprocessor Architectures
Pavlos Petoumenos, Georgios Keramidas, Håkan Zeffer, Stefanos Kaxiras, Erik Hagersten
IISWC 2006.
STATSHARE: A Statistical Model for Managing Cache
Sharing via Decay (pdf 2.5MB!)
Pavlos Petoumenos, Georgios Keramidas, Håkan Zeffer, Stefanos Kaxiras, Erik Hagersten
MoBS 2006 with ISCA 33.
Dynamic Dictionary-Based Data Compression for Level-1 Caches
Georgios Keramidas, Konstantinos Aisopos, Stefanos Kaxiras
ARCS 2006.
A Simple Mechanism to Adapt Leakage-Control Policies to
Temperature
Stefanos Kaxiras, Polychronis Xekalakis, Georgios Keramidas
ISLPED 2005.
4T-Decay Sensors: A New Class of Small, Fast, Robust,
and Low-Power, Temperature/Leakage Sensors
Stefanos Kaxiras, Polychronis Xekalakis
ISLPED 2004.
IPSTASH-2:
IPStash: A Set-Associative Memory Approach for
Efficient IP-lookup
Stefanos Kaxiras, Georgios Keramidas
INFOCOM 2005.
IPStash: a Power-Efficient Memory Architecture for IP-lookup (pdf)
Stefanos Kaxiras, Georgios Keramidas
IEEE MICRO-36.
TCP: TAG Correlating Prefetchers (pdf)
Zhigang Hu, Margaret Martonosi, Stefanos Kaxiras,
HPCA-9, Feb. 2003.
Timekeeping Techniques for Predicting and Optimizing Memory Behavior
Zhigang Hu, Stefanos Kaxiras and Margaret Martonosi
IEEE International Solid-State Circuits Conference 2002
Timekeeping in the Memory System: Predicting and Optimizing Memory Behavior (pdf)
Zhigang Hu, Stefanos Kaxiras and Margaret Martonosi
29th International Symposium on Computer Architecture (ISCA-29), May 2002
Implementing Decay Techniques using 4T Quasi-Static Memory Cells (pdf)
P. Juang, P. Diodato, S. Kaxiras, K. Skadron, Z. Hu, M. Martonosi, D. W. Clark.
Computer Architecture Letters, Volume 1, Sep. 2002
Managing Leakage for Transient Data: Decay and Quasi-Static 4T Memory Cell (pdf)
Zhigang Hu, Philo Juang, Phil Diodato, Stefanos Kaxiras, Kevin Skadron, Margaret Martonosi and Douglas W. Clark
International Symposium on Low Power Electronics and Design (ISLPED 2002), August 2002
Let Caches Decay: Reducing Leakage Energy via Exploitation of Cache Generational Behavior (pdf)
Zhigang Hu, Stefanos Kaxiras, Margaret Martonosi
ACM Transactions on Computer Systems, May 2002
Cache Decay: Exploiting Generational Behavior to Reduce Cache Leakage Power
(pdf)
(ps)
Stefanos Kaxiras, Zhigang Hu, Margaret Martonosi
ISCA 2001
This paper also includes Adaptive Decay techniques and
Instruction-based profiling for leakage control.
Cache Decay: A Mechanism to Reduce Cache
Leakage Power (pdf)
(ps)
Stefanos Kaxiras, Zhigang Hu, Girija Narlikar, Rae McLellan
Power Aware Computer Systems Workshop with ASPLOS IX, 2000
Comparing Power Consumption of an SMT and a CMP DSP for Mobile Phone Workloads (pdf)
(ps)
Stefanos Kaxiras, Girija Narlikar, Alan D. Berenbaum, Zhigang Hu
International Conference on Compilers, Architecture, and Synthesis for Embedded Systems (CASES) 2001
Improving Cache Power Efficiency with an Asymmetric Set-Associative Cache (pdf)
(ps)
Zhigang Hu, Margaret Martonosi, Stefanos Kaxiras
Workshop on Memory Performance Issues, 2001.
Updated and expanded journal version (pdf)
Simultaneous Multithreaded DSPs: Scaling from High performance to Low Power (pdf)
(ps)
Stefanos Kaxiras, Alan D. Berenbaum, Girija Narlikar
Bell Laboratories Technical Memorandum 10009639-001024-06TM
Coherence Communication Prediction in Shared Memory Multiprocessors (pdf)
(ps)
Stefanos Kaxiras and Cliff Young
HPCA-6, Jan. 2000
Improving CC-NUMA Performance Using Instruction-Based Prediction (pdf)
(ps)
Stefanos Kaxiras and James R. Goodman.
HPCA-5, Jan 1999.
The Use of Instruction-based Prediction in Hardware Shared-Memory (pdf)
(ps)
Stefanos Kaxiras
University of Wisconsin Computer Sciences Dept. TR-1368, April 1998
Dynamic Optimizations in Linda Systems (pdf)
An unpublished paper on prediction and Linda
Stefanos Kaxiras & Ioannis Schoinas, 1993
For more on instruction-based prediction see also my thesis below
Distributed Vector Architecture: Beyond a Single Vector-IRAM (pdf)
(ps)
Stefanos Kaxiras, Rabin Sugumar, Jim Schwarzmeier
Workshop on Mixing Logic and DRAM: Chipdf that Compute and Remember, Denver, Colorado, June 1, 1997
Distributed Vector Architecture: Fine Grain Parallelism with Efficient Communication (pdf)
(ps)
Stefanos Kaxiras and Rabin Sugumar.
University of Wisconsin Computer Sciences Dept. TR-1339, February 1997
Also available from CRAY Research
DataScalar Architectures (pdf)
(ps)
Doug Burger, Stefanos Kaxiras, and James R. Goodman
24th International Symposium on Computer Architecture (ISCA), June, 1997.
DataScalar Architectures and the SPSD Execution Model (pdf)
(ps)
Douglas Burger, Stefanos Kaxiras, and James R. Goodman
University of Wisconsin Computer Sciences Dept. TR-1317, June 1996
A Study of Three Dynamic Approaches to Handle Widely Shared Data in Shared-Memory Multiprocessors (pdf)
(ps)
Stefanos Kaxiras, Stein Gjessing, and James R. Goodman.
International Conference on Supercomputing, July 1998.
Improving Request-Combining for Widely Shared Data in Shared-Memory Multiprocessors (pdf)
(ps)
Stefanos Kaxiras and James R. Goodman.
Euromicro MPCS, April 1998.
The GLOW Cache Coherence Protocol Extensions for Widely Shared Data (pdf)
(ps)
Stefanos Kaxiras and James R. Goodman
Proceedings International Conference on Supercomputing, May 1996
(Extended version as technical report TR-1305)
Kiloprocessor Extensions to SCI (pdf)
(ps)
Stefanos Kaxiras
Proceedings of the 10th International Parallel Processing Symposium, April 1996
(single sided pdf) (double sided pdf) Advisor: Jim Goodman
A Hardware Synthesis Methodology Using Prolog,
P. Tsanakas, G. Papakonstantinou, S. Kaxiras
Microprocessing and Microprogramming 32 (1991) 307-314, North-Holland