Skip to main content

By Year:

    2024 2023 2022 2021 2020

    2019 2018 2017 2016 2015 2012 2011 2010

    2009

2024

  • Building Better Tools To Enable Power- and Sustainability-Aware Co-Design. Matthew D. Sinclair, Brandon Tran, and Akanksha Chaudhari. To appear in NSF Workshop on Sustainable Computing for Sustainability (NSF-WSCS). April 2024.
  • T3: Transparent Tracking & Triggering for Fine-grained Overlap of Compute & Collectives. Suchita Pati, Shaizeen Aga, Mahzabeen Islam, Nuwan Jayasena, and Matthew D. Sinclair. To appear in International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS). April 2024.

2023

  • Architecture 2.0. Siddharth Garg, Brian Hirano, Jenny Huang, Yingyan (Celine) Lin, Vijay Janapa Reddi, Tushar Krishna, Srivatsan Krishnan, Benjamin Lee, Jason Lowe-Power, Martin Maas, Shvetank Prakash, Matthew D. Sinclair, Srinivas Sridharan, Amir Yazdanbakhsh, Jason Yik, and Cliff Young. December 2023. Download: Preliminary Summary SIGARCH Blog Post
  • Fifty Years of ISCA: A data-driven retrospective on key trends. Matthew D. Sinclair, Parthasarathy Ranganathan, Gaurang Upasani, Adrian Sampson, David Patterson, Rutwik Jain, Shaan Shah, Nidhi Parthasarathy. In IEEE Micro, vol. 43, no. 6, pp. 109-124, Nov.-Dec. 2023, doi: 10.1109/MM.2023.3324465. Download: PDF Data
  • Tale of Two Cs: Computation vs. Communication Scaling for Future Transformers on Future Hardware. Suchita Pati, Shaizeen Aga, Mahzabeen Islam, Nuwan Jayasena, Matthew D. Sinclair. In 2023 IEEE International Symposium on Workload Characterization (IISWC), October 2023.
  • Fifty Years of ISCA: A data-driven retrospective on key trends. Gaurang Upasani, Matthew D. Sinclair, Adrian Sampson, Parthasarathy Ranganathan, David Patterson, Shaan Shah, Nidhi Parthasarathy, Rutwik Jain. Preprint on ArXiV, June 2023. Download: PDF
  • Closing the Gap: Improving the Accuracy of gem5’s GPU Models. Vishnu Ramadas, Daniel Kouchekinia, Ndubuisi Osuji, and Matthew D. Sinclair. In 5th gem5 Users' Workshop, June 2023. Download: Abstract
  • Improving gem5’s GPUFS Support. Vishnu Ramadas, Matthew Poremba, Bradford M. Beckmann, and Matthew D. Sinclair. In 5th gem5 Users' Workshop, June 2023. Download: Abstract
  • Analyzing the Benefits of More Complex Cache Replacement Policies in Moderns GPU LLCs. Jarvis Jia and Matthew D. Sinclair. In 5th gem5 Users' Workshop, June 2023. Download: Abstract
  • Improving the Speed of gem5’s GPU Regression Tests. James Braun and Matthew D. Sinclair. In 5th gem5 Users' Workshop, June 2023. Download: Abstract
  • Integrating Per-Stream Stat Tracking into Accel-Sim. Shichen (Justin) Qiao, Xin (Cassie) Su, Matthew D. Sinclair. Preprint on ArXiV, April 2023. Download: PDF
  • Computation vs. Communication Scaling for Future Transformers on Future Hardware. Suchita Pati, Shaizeen Aga, Mahzabeen Islam, Nuwan Jayasena, Matthew D. Sinclair. Preprint on ArXiV, February 2023. Download: PDF

2022

  • Improving the Scalability of GPU Synchronization Primitives. Preyesh Dalmia, Rohan Mahapatra, Jeremy Intan, Dan Negrut, and Matthew D. Sinclair. In Transactions on Parallell and Distributed Computing (TPDS), 2022. Download: PDF Artifact Code
  • Demystifying BERT: System Design Implications. Suchita Pati, Shaizeen Aga, Nuwan Jayasena, and Matthew D. Sinclair. In IEEE International Symposium on Workload Characterization (IISWC), November 2022.
  • Not All GPUs Are Created Equal: Characterizing Variability in Large-Scale, Accelerator-Rich Systems. Prasoon Sinha, Akhil Guliani, Rutwik Jain, Brandon Tran, Matthew D. Sinclair, and Shivaram Venkataraman. In the International Conference for High Performance Computing, Networking, Storage, and Analysis (SC), November 2022. Download: PDF Artifact
  • Not All GPUs Are Created Equal: Characterizing Variability in Large-Scale, Accelerator-Rich Systems. Prasoon Sinha, Akhil Guliani, Rutwik Jain, Brandon Tran, Matthew D. Sinclair, and Shivaram Venkataraman. Preprint on ArXiV, August 2022. Download: PDF
  • gem5 GPU Accuracy Profiler (GAP). Charles Jamieson, Anushka Chandrashekar, Ian McDougall, and Matthew D. Sinclair. In 4th gem5 Users' Workshop, June 2022. Download: Abstract PDF Presentation
  • Johnathan Alsop, Weon Taek Na, Samuel Grayson, Matthew D. Sinclair, and Sarita V. Adve. A Case for Fine-grained Coherence Specialization in Heterogeneous Systems, in Transactions on Architecture and Code Optimizations (TACO), 2022. Download: PDF
  • Only Buffer When You Need To: Reducing On-chip GPU Traffic with Reconfigurable Local Atomic Buffers. Preyesh Dalmia, Rohan Mahapatra, and Matthew D. Sinclair. In 28th IEEE International Symposium on High-Performance Computer Architecture (HPCA 2022), February 2022. Download: PDF

2021

  • DENNI: Distributed Neural Network Inference on Severely Resource Constrained Edge Devices. Rohit Sanu, Ryan J. Toepfer, Matthew D. Sinclair, and Henry Duwe III. In 40th IEEE International Performance Computing and Communications Conference (IPCCC), October 2021. Download: PDF
  • Understanding GPU Variability in Supercomputers. Prasoon Sinha, Matthew Sinclair, and Shivaram Venkataraman. In MICRO Student Research Competition (SRC), October 2021.
  • A Case for Fine-grain Coherence Specialization in Heterogeneous Systems. Johnathan Alsop, Weon Taek Na, Matthew D. Sinclair, Samuel Grayson, and Sarita V. Adve. Preprint on ArXiV, April 2021. Download: PDF
  • Demystifying BERT: Implications for Accelerator Design. Suchita Pati, Shaizeen Aga, Nuwan Jayasena, and Matthew D. Sinclair. Preprint on ArXiV, April 2021. Download: PDF
  • Improving GPU Utilization in ML Workloads Through Finer-Grained Synchronization. Reese Kuper, Suchita Pati, and Matthew D. Sinclair. At 3rd Young Architects Workshop (YArch), April 2021.
  • Enabling Reproducible and Agile Full-System Simulation. Bobby R. Bruce, Ayaz Akram, Hoa Nguyen, Kyle Roarty, Mahyar Samani, Marjan Fariborz, Trivikram Reddy, Matthew D. Sinclair, and Jason Lowe-Power. In 2021 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), March 2021. Best Paper Nominee! Details: Materials PDF Talk
  • Co-designing Power Management with Job Scheduling for Efficient Exascale Computing. Matthew D. Sinclair and Shivaram Venkataraman. At DOE ASCR Workshop on Reimagining Codesign, March 2021. Details: PDF
  • Leveraging open source simulators for HPC codesign. Bobby Bruce, Jason Lowe-Power, and Matthew D. Sinclair. At DOE ASCR Workshop on Reimagining Codesign, March 2021. Details: PDF
  • Deadline-Aware Offloading for High-Throughput Accelerators. Tsung Tai Yeh, Matthew D. Sinclair, Bradford M. Beckmann, and Timothy G. Rogers. In Proceedings of the 27th IEEE International Symposium on High-Performance Computer Architecture (HPCA), March 2021. Download: PDF Presentation Short Presentation

2020

  • Deterministic Atomic Buffering. Yuan Hsi Chou*, Christopher Ng*, Shaylin Cattell, Jeremy Intan, Matthew D. Sinclair, Joseph Devietti, Timothy G. Rogers, and Tor M. Aamodt. In 53rd IEEE/ACM International Symposium on Microarchitecture (MICRO), October 2020. Download: PDF Talk Lightning Talk Source Code
    * NOTE: First two authors contributed equally and are listed alphabetically
  • SeqPoint: Identifying Representative Iterations of Sequence-based Neural Networks. Suchita Pati, Shaizeen Aga, Matthew D. Sinclair, and Nuwan Jayasena. In 2020 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), August 2020. Download: PDF
    Preprint on ArXiV, July 2020. Download: PDF
  • Specializing Coherence, Consistency, and Push/Pull for GPU Graph Analytics. Giordano Salvador, Johnathan Alsop, Wesley H. Darvin, Muhammad Huzaifa, Matthew D. Sinclair, and Sarita V. Adve. An extended abstract in 2020 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), August 2020.
    Extended version on ArXiV: PDF
  • Inter-Kernel Reuse-Aware Thread Block Scheduling. Muhammad Huzaifa, Johnathan Alsop, Abdulrahman Mahmoud, Giordano Salvador, Matthew D. Sinclair, and Sarita V. Adve. In ACM Transactions on Architecture and Code Optimization (TACO). 17,3, Article 24 (August 2020), 27 pages. Download: PDF
  • The gem5 Simulator: Version 20.0+. Jason Lowe-Power, Abdul Mutaal Ahmad, Ayaz Akram, Mohammad Alian, Rico Amslinger, Matteo Andreozzi, Adria Armejach, Nils Asmussen, Srikant Bharadwaj, Gabe Black, Gedare Bloom, Bobby R. Bruce, Daniel Rodrigues Carvalho, Jeronimo Castrillon, Lizhong Chen, Nicolas Derumigny, Stephan Diestelhorst, Wendy Elsasser, Marjan Fariborz, Amin Farmahini-Farahani, Pouya Fotouhi, Ryan Gambord, Jayneel Gandhi, Dibakar Gope, Thomas Grass, Bagus Hanindhito, Andreas Hansson, Swapnil Haria, Austin Harris, Timothy Hayes, Adrian Herrera, Matthew Horsnell, Syed Ali Raza Jafri, Radhika Jagtap, Hanhwi Jang, Reiley Jeyapaul, Timothy M. Jones, Matthias Jung, Subash Kannoth, Hamidreza Khaleghzadeh, Yuetsu Kodama, Tushar Krishna, Tommaso Marinelli, Christian Menard, Andrea Mondelli, Tiago Muck, Omar Naji, Krishnendra Nathella, Hoa Nguyen, Nikos Nikoleris, Lena E. Olson, Marc Orr, Binh Pham, Pablo Prieto, Trivikram Reddy, Alec Roelke, Mahyar Samani, Andreas Sandberg, Javier Setoain, Boris Shingarov, Matthew D. Sinclair, Tuan Ta, Rahul Thakur, Giacomo Travaglini, Michael Upton, Nilay Vaish, Ilias Vougioukas, Zhengrong Wang, Norbert Wehn, Christian Weis, David A. Wood, Hongil Yoon, and Eder F. Zulian. Preprint on ArXiV, July 2020. Download: PDF
  • Modeling Modern GPU Applications in gem5. Kyle Roarty and Matthew D. Sinclair. In 3rd gem5 Users' Workshop, June 2020. Download: HTML Presentation
  • Enabling Multi-GPU Support in gem5. Bobbi W. Yogatama, Matthew D. Sinclair, and Michael M. Swift. In 3rd gem5 Users' Workshop, June 2020. Download: HTML Presentation
  • Independent Forward Progress of Work-groups. Alexandru Dutu, Matthew D. Sinclair, Bradford M. Beckmann, David A. Wood, and Marcus Chow. In 47th International Symposium on Computer Architecture (ISCA), May 2020. Download: PDF

2019

  • Optimizing GPU Cache Policies for MI Workloads. Johnathan Alsop, Matthew D. Sinclair, Anthony Gutierrez, Srikant Bharadwaj, Xianwei Zhang, Bradford Beckmann, Alexandru Dutu, Onur Kayiran, Michael LeBeane, Brandon Potter, Sooraj Puthoor, and Tsung Tai Yeh. Short paper in 2019 IEEE International Symposium on Workload Characterization (IISWC), November 2019.
    Extended version on ArXiV: PDF
  • Analyzing Machine Learning Workloads Using a Detailed GPU Simulator. Jonathan Lew, Deval Shah, Suchita Pati, Shaylin Cattell, Mengchi Zhang, Amruth Sandhupatla, Christopher Ng, Negar Goli, Matthew D. Sinclair, Timothy G. Rogers, and Tor Aamodt. Extended abstract and poster in 2019 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), March 2019.
    Extended version on ArXiV: PDF

2018

  • Spandex: A Generalized Interface for Flexible Heterogeneous Coherence. Johnathan Alsop, Matthew D. Sinclair, and Sarita V. Adve. In 45th International Symposium on Computer Architecture (ISCA), June 2018. Download: BibTeX PDF
  • Lost in Abstraction: Pitfalls of Analyzing GPUs at the Intermediate Language Level. Anthony Gutierrez, Bradford Beckmann, Alexandru Dutu, Joseph Gross, John Kalamatianos, Onur Kayiran, Michael LeBeane, Matthew Poremba, Brandon Potter, Sooraj Puthoor, Matthew D. Sinclair, Mark Wyse, Jieming Yin, Xianwei Zhang, Akshay Jain, and Timothy G. Rogers. In 24th IEEE International Symposium on High Performance Computer Architecture (HPCA), February 2018. Download: BibTeX PDF
  • HPVM: Heterogeneous Parallel Virtual Machine. Maria Kotsifakou*, Prakalp Srivastava*, Matthew D. Sinclair, Rakesh Komuravelli, Vikram Adve, and Sarita Adve. In 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), February 2018. Download: BibTeX PDF
    * NOTE: First two authors contributed equally and are listed alphabetically

2017

  • Efficient Coherence and Consistency for Specialized Memory Hierarchies. Matthew D. Sinclair, Ph.D. Thesis, August 2017. Selected as 2018 David J. Kuck Outstanding PhD Thesis Award Winner and ACM SIGARCH - IEEE Computer Society TCCA 2018 Outstanding Dissertation Award Honorable Mention! Download: BibTeX PDF
  • HeteroSync: A Benchmark Suite for Fine-Grained Synchronization on Tightly Coupled GPUs. Matthew D. Sinclair, Johnathan Alsop, and Sarita V. Adve, in the IEEE International Symposium on Workload Characterization (IISWC), October 2017. Download: BibTeX PDF Presentation
  • Chasing Away RAts: Semantics and Evaluation for Relaxed Atomics on Heterogeneous Systems. Matthew D. Sinclair, Johnathan Alsop, and Sarita V. Adve, in the 44th International Symposium on Computer Architecture (ISCA), June 2017. Download: BibTeX PDF Presentation Lightning Presentation

2016

  • POSTER: hVISC: A Portable Virtual Instruction Set for Heterogeneous Parallel Systems. Prakalp Srivastava, Maria Kotsifakou, Matthew D. Sinclair, Rakesh Komuravelli, Vikram Adve, and Sarita Adve. In the 25th International Conference on Parallel Architecture and Compilation (PACT), September 2016. Download: BibTeX PDF
  • GSI: A GPU Stall Inspector to Characterize the Source of Memory Stalls for Tightly Coupled GPUs. Johnathan Alsop, Matthew D. Sinclair, Rakesh Komuravelli, and Sarita V. Adve. In IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), April 2016. Download: BibTeX PDF

2015

  • Efficient GPU Synchronization without Scopes: Saying No to Complex Consistency Models. Matthew D. Sinclair, Johnathan Alsop, and Sarita V. Adve, in the 48th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), December 2015. Selected as an IEEE MICRO Top Picks Honorable Mention from the 2015 Computer Architecture Conferences! Download: BibTeX PDF Presentation
  • Stash: Have Your Scratchpad and Cache it Too. Rakesh Komuravelli*, Matthew D. Sinclair*, Johnathan Alsop, Muhammad Huzaifa, Maria Kotsifakou, Prakalp Srivastava, Sarita V. Adve, and Vikram Adve. In 42nd International Symposium on Computer Architecture (ISCA), June 2015. Selected as an IEEE MICRO Top Picks Honorable Mention from the 2015 Computer Architecture Conferences! Download: BiBTeX PDF Presentation
    * NOTE: First two authors contributed equally and are listed alphabetically

2012

  • Vikram Adve, Sarita Adve, Rakesh Komuravelli, Matthew D. Sinclair, and Prakalp Srivastava. Virtual Instruction Set Computing for Heterogeneous Systems. In 4th USENIX Workshop on Hot Topics in Parallelism (HotPar), June 2012. Download: PDF

2011

  • Porting CMP Benchmarks to GPUs. Matthew Sinclair, Henry Duwe, and Karthikeyan Sankaralingam. Technical Report TR-1693, Department of Computer Sciences, The University of Wisconsin-Madison, 2011. Download: BibTeX PDF
  • Challenge Benchmarks that Must Conquered to Sustain the GPU Revolution. Emily Blem, Matthew Sinclair, and Karthikeyan Sankaralingam. In Proceedings of 4th Annual Workshop on Emerging Applications and Many-Core Architecture (EAMA), June 2011. Download: Details BibTeX PDF Presentation
  • Enabling New Uses for GPUs. Matthew D. Sinclair. Masters Thesis, University of Wisconsin-Madison, May 2011. Download: BibTeX PDF
  • Sampling + DMR: Practical and Low-overhead Permanent Fault Detection. Shuou Nomura, Matthew D. Sinclair, Chen-han Ho, Venkatraman Govindaraju, Marc de Kruijf, and Karthikeyan Sankaralingam. In Proceedings of 38th International Symposium on Computer Architecture (ISCA 2011). Download: Details BibTeX PDF

2010

  • GRASSY: Leveraging GPU Texture Units for Asteroseismic Data Analysis. Karthikeyan Sankaralingam, Richard Townsend, and Matthew D. Sinclair. In Proceedings of GPU Technology Conference (GTC), 2010. Download: Details BibTeX Presentation Video
  • Leveraging the untapped computation power of GPUs: fast spectral synthesis using texture interpolation. Richard Townsend, Karthikeyan Sankaralingam, and Matthew D. Sinclair. Addison-Wesley, 2010. Download: Details BibTeX

2009

  • Bitonic-MapReduce: Optimization of MapReduce on the Cell B.E. Architecture with a Bitonic Sort. Matthew D. Sinclair. Published in UW-Madison 2009-2010 Trewartha Papers. Download: PDF BibTeX

Student Publications

  • Reducing Synchronization and Communication Overhead in GPUs. Preyesh Dalmia. PhD Thesis, September 2023. Download: PDF
  • Reducing Synchronization Overhead for Persistent RNNs. Qinjun Jiang. Published in UW-Madison 2020-2021 Trewartha Papers, May 2021. Download: PDF
  • Designing Efficient Barriers and Semaphores for Graphics Processing Units. Rohan Mahapatra, MS Project, August 2020. Download: PDF
  • Exploring GPU Architectural Optimizations for RNNs. Suchita Pati, at 1st Young Architects Workshop (YArch), February 2019. Download: PDF