Conference Papers
              2024
              
                  - Realizing the AMD Exascale Heterogeneous Processor Vision
                  
Alan Smith, Gabriel H. Loh, Michael J. Schulte, Mike Ignatowski, Samuel Naffziger, Mike Mantor, Mark Fowler, Nathan Kalyanasundharam, Vamsi Alla, Nicholas Malaya, Joseph L. Greathouse, Eric Chapman, Raja Swaminathan
                  
Published in the Proceedings of the 51st Annual International Symposium on Computer Architecture
                  (ISCA 2024), June, 2024 (industry session acceptance rate: 4/26 ≈ 15%)
                  
Abstract: HTML
                  
Paper: PDF
                   
              
              2023
              
                  - A Research Retrospective on AMD’s Exascale Computing Journey
                  
Gabriel H. Loh, Michael J. Schulte, Mike Ignatowski, Vignesh Adhinarayanan, Shaizeen Aga, Derrick Aguren, Varun Agrawal, Ashwin M. Aji, John Alsop, Paul Bauman, Bradford M. Beckmann, Majed Valad Beigi, Sergey Blagodurov, Travis Boraten, Michael Boyer, William Brantley, Noel Chalmers, Shaoming Chen, Kevin Cheng, Michael L. Chu, David Cownie, Nicholas Curtis, Joris Del Pino, Nam Duong, Alexandru Dutu, Yasuko Eckert, Christopher Erb, Chip Freitag, Joseph L. Greathouse, Sudhanva Gurumurthi, Anthony Gutierrez, Khaled Hamidouche, Sachin Hossamani, Wei Huang, Mahzabeen Islam, Nuwan Jayasena, John Kalamatianos, Onur Kayiran, Jagadish Kotra, Alan Lee, Daniel Lowell, Niti Madan, Abhinandan Majumdar, Nicholas Malaya, Srilatha Manne, Susumu Mashimo, Damon McDougall, Elliott Mednick, Michael Mishkin, Mark Nutter, Indrani Paul, Matthew Poremba, Brandon Potter, Kishore Punniyamurthy, Sooraj Puthoor, Steven E. Raasch, Karthik Rao, Greg Rodgers, Marko Scrbak, Mohammad Seyedzadeh, John Slice, Vilas Sridharan, Rene van Oostrum, Eric van Tassell, Abhinav Vishnu, Samuel Wasmundt, Mark Wilkening, Noah Wolfe, Mark Wyse, Adithya Yalavarti, Dmitri Yudanov
                  
Published in the
                  Proceedings of the 50th Annual International Symposium on Computer Architecture
                  (ISCA 2023), June, 2023 (industry session acceptance rate: 5/11 ≈ 45%)
                  
Abstract: HTML
                  
Paper: ACM Author-Izer Free Download |
                  ACM |
                  PDF
                   
              
              2019
              
              2018
              
                  - Machine Learning for Performance and Power Modeling of Heterogeneous Systems (Invited Paper)
                  
Joseph L. Greathouse, Gabriel H. Loh
                  
Published in the
                  Proceedings of the 2018 International Conference on Computer Aided Design
                  (ICCAD 2018), November, 2018
                  
Abstract: HTML
                  
Paper: ACM Author-Izer Free Download |
                  ACM |
                  IEEE |
                  PDF
                  
Presentation: PPTX |
                  PPT |
                  PDF
                   
                  - Interference from GPU System Service Requests
                  
Arkaprava Basu, Joseph L. Greathouse, Guru Venkataramani, Ján Veselý
                  
Published in the
                  Proceedings of the 2018 IEEE International Symposium on Workload Characterization
                  (IISWC 2018), September, 2018 (acceptance rate: 17/47 ≈ 36%)
                  
Nominated for Best Paper at IISWC'18
                  
Abstract: HTML
                  
Paper: IEEE |
                  PDF
                  
Presentation: PPTX |
                  PPT |
                  PDF
                   
                  - 3D Numerical Analysis of Two-Phase Immersion Cooling for Electronic Components
                  
Xudong An, Manish Arora, Wei Huang, William C. Brantley, Joseph L. Greathouse
                  
Published in the
                  Proceedings of the 17th IEEE Intersociety Conference on Thermomechanical Phenomena in Electronic Systems
                  (ITherm 2018), May, 2018
                  
Abstract: HTML
                  
Paper: IEEE |
                  PDF
                  
Presentation: PPTX |
                  PPT |
                  PDF
                   
              
              2017
              
                  - Accelerating Matrix Processing with GPUs
                  
Nicholas Malaya, Shuai Che, Joseph L. Greathouse, René van Oostrum, Michael J. Schulte
                  
Published in the
                  Proceedings of the 24th IEEE Symposium on Computer Arithmetic
                  (ARITH 24), July, 2017 (acceptance rate: 22/50 = 40%)
                  
Abstract: HTML
                  
Paper: IEEE |
                  PDF
                  
Presentation: PDF
                   
                  
                  - DVFS Space Exploration in Power Constrained Processing-in-Memory Systems
                  
Marko Ščrbak, Joseph L. Greathouse, Nuwan Jayasena, Krishna Kavi
                  
Published in the
                  Proceedings of the 30th International Conference on Architecture of Computing Systems
                  (ARCS 2017), April, 2017 (acceptance rate: 19/42 ≈ 45%)
                  
Abstract: HTML
                  
Paper: PDF | The final publication is available at Springer via 
http://dx.doi.org/10.1007/978-3-319-54999-6_17
                  
Presentation: PPTX |
                  PPT |
                  PDF
                   
                  - Dynamic GPGPU Power Management using Adaptive Model Predictive Control
                  
Abhinandan Majumdar, Leonardo Piga, Indrani Paul, Joseph L. Greathouse, Wei Huang, David H. Albonesi
                  
Published in the
                  Proceedings of the 23rd IEEE International Symposium on High Performance Computer Architecture
                  (HPCA 2017), February, 2017 (acceptance rate: 50/224 ≈ 22%)
                  
Abstract: HTML
                  
Paper: IEEE |
                  PDF
                  
Presentation: PPTX |
                  PPT |
                  PDF
                   
                  - Design and Analysis of an APU for Exascale Computing
                  
Thiruvengadam Vijayaraghavan, Yasuko Eckert, Gabriel H. Loh, Michael J. Schulte, Mike Ignatowski, Bradford M. Beckmann, William C. Brantley, Joseph L. Greathouse, Wei Huang, Arun Karunanithi, Onur Kayiran, Mitesh Meswani, Indrani Paul, Matthew Poremba, Steven Raasch, Steven K. Reinhardt, Greg Sadowski, Vilas Sridharan
                  
Published in the
                  Proceedings of the 23rd IEEE International Symposium on High Performance Computer Architecture
                  (HPCA 2017), February, 2017 (industry session acceptance rate: 5/15 ≈ 33%)
                  
Abstract: HTML
                  
Paper: IEEE |
                  PDF
                   
                  - Dynamic Buffer Overflow Detection for GPGPUs
                  
Christopher Erb, Mike Collins, Joseph L. Greathouse
                  
Published in the
                  Proceedings of the 2017 IEEE/ACM International Symposium on Code Generation and Optimization
                  (CGO 2017), February, 2017 (acceptance rate: 26/116 ≈ 22%)
                  
Abstract: HTML
                  
Paper: ACM |
                  IEEE |
                  PDF
                  
Presentation: PPTX |
                  PPT |
                  PDF
                  
Software: GitHub
                   
              
              2016
              
                  - Measuring and Modeling On-Chip Interconnect Power on Real Hardware
                  
Vignesh Adhinarayanan, Indrani Paul, Joseph L. Greathouse, Wei Huang, Ashutosh Pattnaik, Wu-chun Feng
                  
Published in the
                  Proceedings of the 2016 IEEE International Symposium on Workload Characterization
                  (IISWC 2016), September, 2016 (acceptance rate: 21/69 ≈ 30%)
                  
Awarded Best Paper at IISWC 2016              
                  
Abstract: HTML
                  
Paper: IEEE |
                  PDF
                  
Presentation: PPTX |
                  PPTX |
                  PDF
                   
                  - Horton Tables: Fast Hash Tables for In-Memory Data-Intensive Computing
                  
Alex D. Breslow, Dong Ping Zhang, Joseph L. Greathouse, Nuwan Jayasena, Dean M. Tullsen
                  
Published in the
                  Proceedings of the 2016 USENIX Annual Technical Conference
                  (USENIX ATC 2016), June, 2016 (acceptance rate: 47/247 ≈ 19%)
                  
Abstract: HTML
                  
Paper: USENIX | 
                  PDF
                  
Presentation: USENIX |
                  PPTX | 
                  PPT |
                  PDF
                   
              
              2015
              
                  - Structural Agnostic SpMV: Adapting CSR-Adaptive for Irregular Matrices
                  
Mayank Daga, Joseph L. Greathouse
                  
Published in the
                  Proceedings of the 2015 IEEE International Conference on High Performance Computing
                  (HiPC 2015), December, 2015 (acceptance rate: 48/201  ≈ 24%)
                  
Abstract: HTML
                  
Paper: IEEE |
                  PDF
                  
                  
Presentation: PPTX |
                  PPT |
                  PDF
                   
                  - A Taxonomy of GPGPU Performance Scaling
                  
Abhinandan Majumdar, Gene Wu, Kapil Dev, Joseph L. Greathouse, Indrani Paul, Wei Huang, Arjun Karthik Venugopal, Leonardo Piga, Chip Freitag, Sooraj Puthoor
                  
Published in the
                  Proceedings of the 2015 IEEE International Symposium on Workload Characterization
                  (IISWC 2015), October, 2015 (acceptance rate: 29/61  ≈ 48%)
                  
Abstract: HTML
                  
Paper: IEEE |
                  PDF
                  
                  
Poster: PDF
                   
                  - GPGPU Performance and Power Estimation Using Machine Learning
                  
Gene Wu, Joseph L. Greathouse, Alexander Lyashevsky, Nuwan Jayasena, Derek Chiou
                  
Published in the
                  Proceedings of the 21st IEEE International Symposium on High Performance Computer Architecture
                  (HPCA 2015), February, 2015 (acceptance rate: 51/226 ≈ 23%)
                  
Abstract: HTML
                  
Paper: IEEE | 
                  PDF
                  
                  
Presentation: PPTX | 
                  PPT |
                  PDF
                   
              
              2014
              
                  - PPEP: Online Performance, Power, and Energy Prediction Framework and DVFS Space Exploration
                  
Bo Su, Junli Gu, Li Shen, Wei Huang, Joseph L. Greathouse, Zhiying Wang
                  
Published in the
                  Proceedings of the 47th Annual IEEE/ACM International Symposium on Microarchitecture
                  (MICRO-47), December, 2014 (acceptance rate: 53/273 ≈ 19%)
                  
Abstract: HTML
                  
Paper: IEEE |
                  PDF
                  
                  
Presentation: PPTX | 
                  PPT |
                  PDF
                  
Lightning Talk: PPTX | 
                  PPT |
                  PDF
                  
Poster: PDF
                   
                  - Efficient Sparse Matrix-Vector Multiplication on GPUs using the CSR Storage Format
                  
Joseph L. Greathouse, Mayank Daga
                  
Published in the
                  Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
                  (SC14), November, 2014 (acceptance rate: 83/394 ≈ 21%)
                  
Abstract: HTML
                  
Paper: IEEE |
                  PDF
                  
                  
Presentation: PPTX | 
                  PPT |
                  PDF
                   
                  - TOP-PIM: Throughput-Oriented Programmable Processing in Memory
                  
Dong Ping Zhang, Nuwan Jayasena, Alexander Lyashevsky, Joseph L. Greathouse, Lifan Xu, Michael Ignatowski
                  
Published in the
                  Proceedings of the 23rd International Symposium on High-Performance Parallel and Distributed Computing
                  (HPDC'14), June, 2014 (acceptance rate: 21/130 ≈ 16%)
                  
Nominated for Best Paper at HPDC'14
                  
Abstract: HTML
                  
Paper: ACM Author-Izer Free Download |
                  ACM |
                  PDF
                  
                  
Presentation: PPTX | 
                  PPT |
                  PDF
                   
                  - Implementing a Leading Loads Performance Predictor on Commodity Processors
                  
Bo Su, Joseph L. Greathouse, Junli Gu, Michael Boyer, Li Shen, Zhiying Wang
                  
Published in the
                  Proceedings of the 2014 USENIX Annual Technical Conference
                  (USENIX ATC 2014), June, 2014 (acceptance rate: 44/241 ≈ 18%)
                  
Abstract: HTML
                  
Paper: USENIX | 
                  PDF
                  
                  
Presentation: USENIX |
                  PPTX | 
                  PPT |
                  PDF
                  
Video available at USENIX
                   
              
              2012
              
              
              
              2011
              
                  - Demand-Driven Software Race Detection using Hardware Performance Counters
                  
Joseph L. Greathouse, Zhiqiang Ma, Matthew I. Frank, Ramesh Peri, Todd Austin
                  
Published in the
                  Proceedings of the 38th Annual International Symposium on Computer Architecture
                  (ISCA 2011), June, 2011 (acceptance rate: 40/208 ≈ 19%)
                  
Abstract: HTML
                  
Paper: ACM Author-Izer Free Download |
                  ACM |
                  PDF
                  
                  
Presentation: PPTX |
                  PPT |
                  PDF
                  
Video available at the ACM.
                   
                  
                  - Highly Scalable Distributed Dataflow Analysis
                  
Joseph L. Greathouse, Chelsea LeBlanc, Todd Austin, Valeria Bertacco
                  
Published in the
                  Proceedings of the 9th Annual IEEE/ACM International Symposium on Code Generation and Optimization
                  (CGO 2011), April, 2011 (acceptance rate: 28/105 ≈ 27%)
                  
Awarded Best Student Presentation at CGO 2011
                  
Abstract: HTML
                  
Paper: IEEE |
                  PDF
                  
                  
Presentation: PPTX |
                  PPT |
                  PDF
                   
              
              
              2008
              
          Workshop Papers
              2018
              
              2016
              
              
              2014
              
                  - Adaptive GPU Cache Bypassing
                  
Yingying Tian, Sooraj Puthoor, Joseph L. Greathouse, Bradford M. Beckmann, Daniel Jiménez
                  
Published in the
                  Proceedings of the 8th Workshop on General Purpose Processing on GPUs
                  (GPGPU-8), June, 2014 (acceptance rate: 11/17 ≈ 65%)
                  
Abstract: HTML
                  
Paper: ACM Author-Izer Free Download |
                  ACM | 
                  PDF
                  
                  
Presentation: PPTX |
                  PPT |
                  PDF
                   
                  
                  - A Power Characterization and Management of GPU Graph Traversal
                  
Adam McLaughlin, Indrani Paul, Joseph L. Greathouse, Srilatha Manne, Sudhakar Yalamanchili
                  
Published at the Fourth Workshop on Architectures and Systems for Big Data
                  (ASBD 2014), June, 2014 (acceptance rate: 6/13 ≈ 46%)
                  
Abstract: HTML
                  
Paper: PDF
                  
                  
Presentation: PPTX |
                  PPT |
                  PDF
                   
              
              
              2013
              
                  - Simulation of Exascale Nodes through Runtime Hardware Monitoring
                  
Joseph L. Greathouse, Alexander Lyashevsky, Mitesh Meswani, Nuwan Jayasena, Michael Ignatowski
                  
Published at the ASCR Workshop on Modeling & Simulation of Exascale Systems & Applications
                  (ModSim 2013), September, 2013 (acceptance rate: 32/74 ≈ 43%)
                  
Abstract: HTML
                  
Paper: PDF
                  
                  
Presentation: PPTX |
                  PPT |
                  PDF
                   
                  
                  - A New Perspective on Processing-in-memory Architecture Design
                  
Dong Ping Zhang, Nuwan Jayasena, Alexander Lyashevsky, Joseph Greathouse, Mitesh Meswani, Mark Nutter, Mike Ignatowski
                  
Published in the
                  Proceedings of the ACM SIGPLAN Workshop on Memory Systems Performance and Correctness
                  (MSPC 2013), June, 2013
                  
Abstract: HTML
                  
Paper: ACM Author-Izer Free Download |
                  ACM |
                  PDF
                  
                  
Presentation: PPTX |
                  PPT |
                  PDF
                   
              
              
              2011
              
          Ph.D. Dissertation
          
              - Hardware Mechanisms for Distributed Dynamic Software Analysis
              
Joseph L. Greathouse
              
Abstract: HTML
              
Paper: DeepBlue |
              PDF
              
              
Presentation: PPTX |
              PPT |
              PDF