Publications

Filtered as: all types - all years - all authors - all keywords
Sorted by: author - year
Corresponding
bibtex list 

Aaltonen, P. 

2016 

  1. J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
    Mathematical Foundations of the GraphBLAS
    Proc. High Performance Extreme Computing (HPEC), 2016

Ahmad, Mohammad 

2006 

  1. Peter A. Milder, Mohammad Ahmad, James C. Hoe and Markus Püschel
    Fast and Accurate Resource Estimation of Automatically Generated Custom DFT IP Cores
    Proc. FPGA, pp. 211-220, 2006

Akin, Berkin 

2016 

  1. Berkin Akin, Franz Franchetti and James C. Hoe
    HAMLeT Architecture for Parallel Data Reorganization in Memory
    IEEE Micro, Vol. 36, No. 1, pp. 14-23, 2016

2015 

  1. Berkin Akin
    A Formal Approach to Memory Access Optimization: Data Layout, Reorganization, and Near-Data Processing
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2015
  2. Berkin Akin, Franz Franchetti and James C. Hoe
    Data Reorganization in Memory Using 3D-stacked DRAM
    Proc. International Symposium on Computer Architectur (ISCA), 2015
  3. Qi Guo, Tze-Meng Low, N. Alachiotis, Berkin Akin, Lawrence Pileggi, James C. Hoe and Franz Franchetti
    Enabling Portable Energy Efficiency with Memory Accelerated Library
    Proc. MICRO, 2015
  4. Berkin Akin, Franz Franchetti and James C. Hoe
    FFTs with Near-Optimal Memory Access Through Block Data Layouts: Algorithm, Architecture and Design Automation
    Journal of Signal Processing Systems, 2015

2014 

  1. Qi Guo, N. Alachiotis, Berkin Akin, F. Sadi, G. Xu, Tze-Meng Low, Lawrence Pileggi, James C. Hoe and Franz Franchetti
    3D-Stacked Memory-Side Acceleration: Accelerator and System Design
    Proc. Workshop on Near Data Processing (WONDP), 2014
  2. F. Sadi, Berkin Akin, Thom Popovici, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    Algorithm/Hardware Co-optimized SAR Image Reconstruction with 3D-stacked Logic in Memory
    Proc. IEEE High Performance Extreme Computing (HPEC), 2014
  3. Berkin Akin, Franz Franchetti and James C. Hoe
    FFTs with Near-Optimal Memory Access Through Block Data Layouts
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014
  4. Berkin Akin, James C. Hoe and Franz Franchetti
    HAMLeT: Hardware Accelerated Memory Layout Transform within 3D-stacked DRAM
    Proc. IEEE High Performance Extreme Computing (HPEC), 2014
  5. Berkin Akin, Franz Franchetti and James C. Hoe
    Understanding the Design Space of DRAM-optimized Hardware FFT Accelerators
    Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 248-255, 2014

2013 

  1. Qiuling Zhu, Berkin Akin, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    A 3D-Stacked Logic-in-Memory Accelerator for Application-Specific Data Intensive Computing
    Proc. IEEE International 3D Systems Integration Conference (3DIC), pp. 1-7, 2013

2012 

  1. Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
    Algorithm and Architecture Optimization for Large Size Two Dimensional Discrete Fourier Transform
    ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA), 2012
  2. Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
    Memory Bandwidth Efficient Two-Dimensional Fast Fourier Transform Algorithm and Implementation for Large Problem Sizes
    Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), 2012

Alachiotis, N. 

2015 

  1. Qi Guo, Tze-Meng Low, N. Alachiotis, Berkin Akin, Lawrence Pileggi, James C. Hoe and Franz Franchetti
    Enabling Portable Energy Efficiency with Memory Accelerated Library
    Proc. MICRO, 2015

2014 

  1. Qi Guo, N. Alachiotis, Berkin Akin, F. Sadi, G. Xu, Tze-Meng Low, Lawrence Pileggi, James C. Hoe and Franz Franchetti
    3D-Stacked Memory-Side Acceleration: Accelerator and System Design
    Proc. Workshop on Near Data Processing (WONDP), 2014

Al Badawi, A. 

2023 

  1. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Cryptology ePrint Archive, 2023
  2. D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring-Processing Unit
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
  3. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023

2022 

  1. N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
    Towards Full-Stack Acceleration for Fully Homomorphic Encryption
    Proc. High Performance Extreme Computing (HPEC), 2022

Almasi, G. 

2012 

  1. Franz Franchetti, Yevgen Voronenko and G. Almasi
    Automatic Generation of the HPC Challenges Global FFT Benchmark for BlueGene/P
    Proc. High Performance Computing for Computational Science (VECPAR), 2012

Andrews, Michael 

2008 

  1. Jeremy Johnson and Michael Andrews
    Statistical Evaluation of a Self-Tuning Vectorized Library for the Walsh-Hadamard Transform
    Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008

2007 

  1. Jeremy Johnson and Michael Andrews
    Performance Analysis of a Family of WHT Algorithms
    Proc. Workshop on Performance Optimization for High-Level Languages and Libraries (POHLL), 2007 IEEE International Parallel and Distributed Processing Symposium , IEEE, pp. 450, 2007

Angelopoulos, C. 

2012 

  1. C. Angelopoulos, Franz Franchetti and Markus Püschel
    Automatic Generation of FFT Libraries for GPUs
    NVIDIA Research Summit at the GPU Technology Conference, 2012

2010 

  1. C. Angelopoulos, Franz Franchetti and Markus Püschel
    DFT Transform on the Fermi (GTX480): Automatic Program Generation
    NVIDIA Research Summit at the GPU Technology Conference, 2010

Arbatov, Volodymyr 

2011 

  1. Daniel McFarlin, Volodymyr Arbatov, Franz Franchetti and Markus Püschel
    Automatic SIMD Vectorization of Fast Fourier Transforms for the Larrabee and AVX Instruction Sets
    Proc. International Conference on Supercomputing (ICS), 2011
  2. Christian Berger, Volodymyr Arbatov, Yevgen Voronenko, Franz Franchetti and Markus Püschel
    Real-Time Software Implementation of an IEEE 802.11a Baseband Receiver on Intel Multicore
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1693-1696 , 2011

2010 

  1. Yevgen Voronenko, Volodymyr Arbatov, Christian Berger, Ronghui Peng, Markus Püschel and Franz Franchetti
    Computer Generation of Platform-Adapted Physical Layer Software
    Proc. Software Defined Radio (SDR), 2010

Asheghi, M. 

2015 

  1. M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
    Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
    Computer, Vol. 48, No. 12, pp. 24-33, 2015

Austel, V. 

2006 

  1. F. Gygi, E. W. Draeger, M. Schulz, B. R. de Supinski, J. A. Gunnels, V. Austel, J. C. Sexton, Franz Franchetti, Stefan Kral, Christoph W. Ueberhuber and Juergen Lorenz
    Large-Scale Electronic Structure Calculations of High-Z Metals on the BlueGene/L Platform
    Proc. Supercomputing (SC), 2006

Bader, D. 

2016 

  1. J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
    Mathematical Foundations of the GraphBLAS
    Proc. High Performance Extreme Computing (HPEC), 2016

Balog, Mike 

2002 

  1. Mike Balog
    A Flexible Framework for Implementing FFT Processors
    Master thesis, Electrical and Computer Engineering, Drexel University, 2002

Beck, Calvin 

2021 

  1. Yannick Zakowski, Calvin Beck, Irene Yoon, Ilia Zaichuk, Vadim Zaliva and Steve Zdancewic
    Modular, Compositional, and Executable Formal Semantics for LLVMIR
    Proc. International Conference on Functional Programming (ICFP), 2021

Becker, Peter 

2001 

  1. Peter Becker
    A High Speed VLSI Architecture for the Discrete Haar Wavelet Transform
    Master thesis, Electrical and Computer Engineering, Drexel University, 2001

Benlachtar, Yannis 

2011 

  1. Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Design studies for ASIC implementations of 28 GS/s optical QPSK- and 16-QAM-OFDM transceivers
    Optics Express, Vol. 19, No. 21, pp. 20857-20864, 2011

2010 

  1. Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, Christian Berger, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Design Studies for an ASIC Implementation of an Optical OFDM Transceiver
    Proc. European Conference on Optical Communication (ECOC), pp. 1-3, 2010
  2. Yannis Benlachtar, Rachid Bouziane, Robert I. Killey, Christian Berger, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel and Madeleine Glick
    Optical OFDM for the Data Center
    Proc. International Conference on Transparent Optical Networks (ICTON), pp. 1-4, 2010
  3. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Real-Time Digital Signal Processing for the Generation of Optical Orthogonal Frequency Division Multiplexed Signals
    IEEE Journal of Selected Topics in Quantum Electronics, Vol. 16, No. 5, pp. 1235-1244 , 2010

2009 

  1. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    21.4 GS/s Real-Time DSP-Based Optical OFDM Signal Generation and Transmission Over 1600 km of Uncompensated Fibre
    Proc. European Conference on Optical Communication (ECOC), pp. 1-2, 2009
  2. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
    Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009

Berger, Christian 

2012 

  1. Robert Koutsoyannis, Peter A. Milder, Christian Berger, Madeleine Glick, James C. Hoe and Markus Püschel
    Improving Fixed-Point Accuracy of FFT Cores in O-OFDM Systems
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2012
  2. Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
    Local Interpolation-based Polar Format SAR: Algorithm, Hardware Implementation and Design Automation
    Journal of Signal Processing Systems, 2012
  3. Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
    Polar Format Synthetic Aperture Radar in Energy Efficient Application-Specific Logic-in-Memory
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1557-1560, 2012

2011 

  1. Qiuling Zhu, E. L. Turner, Christian Berger, Lawrence Pileggi and Franz Franchetti
    Application-Specific Logic-in-Memory for Polar Format Synthetic Aperture Radar
    Proc. High Performance Extreme Computing (HPEC), 2011
  2. Christian Berger, Volodymyr Arbatov, Yevgen Voronenko, Franz Franchetti and Markus Püschel
    Real-Time Software Implementation of an IEEE 802.11a Baseband Receiver on Intel Multicore
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1693-1696 , 2011

2010 

  1. Yevgen Voronenko, Volodymyr Arbatov, Christian Berger, Ronghui Peng, Markus Püschel and Franz Franchetti
    Computer Generation of Platform-Adapted Physical Layer Software
    Proc. Software Defined Radio (SDR), 2010
  2. Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, Christian Berger, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Design Studies for an ASIC Implementation of an Optical OFDM Transceiver
    Proc. European Conference on Optical Communication (ECOC), pp. 1-3, 2010
  3. Yannis Benlachtar, Rachid Bouziane, Robert I. Killey, Christian Berger, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel and Madeleine Glick
    Optical OFDM for the Data Center
    Proc. International Conference on Transparent Optical Networks (ICTON), pp. 1-4, 2010

Beth, Thomas 

2002 

  1. Markus Püschel, Sebastian Egner and Thomas Beth
    AREP
    in New Reference Book on Computer Algebra, Eds. J. Grabmeier, E. Kaltofen, J. Grabmeier, E. Kaltofen, Springer 2002

Bientinesi, Paolo 

2018 

  1. Daniele G. Spampinato, Diego Fabregat-Traver, Paolo Bientinesi and Markus Püschel
    Program Generation for Small-Scale Linear Algebra Applications
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 327-339, 2018

Blanco, Mark 

2021 

  1. Mark Blanco, S. McMillan and Tze-Meng Low
    Delayed Asynchronous Iterative Graph Algorithms
    Proc. High Performance Extreme Computing (HPEC), 2021

2020 

  1. Mark Blanco, S. McMillan and Tze-Meng Low
    Towards an Objective Metric for the Performance of Exact Triangle Count
    Proc. High Performance Extreme Computing (HPEC), 2020

2019 

  1. Mark Blanco, Tze-Meng Low and Kyungjoo Kim
    Exploration of Fine-Grained Parallelism for Load Balancing Eager K-truss on GPU and CPU
    Proc. High Performance Extreme Computing (HPEC), 2019

Bokor, J. 

2015 

  1. M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
    Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
    Computer, Vol. 48, No. 12, pp. 24-33, 2015

Bolten, Matthias 

2017 

  1. Matthias Bolten, Franz Franchetti, P. H. J. Kelly, Christian Lengauer and Marcus Mohr
    Algebraic Description and Automatic Generation of Multigrid Methods in SPIRAL
    Concurrency and Computation: Practice and Experience, 2017

Bondalapati, Kiran 

2000 

  1. Neungsoo Park, Viktor K. Prasanna, Kiran Bondalapati and Dongsoo Kang
    Dynamic Data Layouts for Cache-conscious Factorization of DFT
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 693-701, 2000

Bonelli, Andreas 

2006 

  1. Andreas Bonelli, Franz Franchetti, Juergen Lorenz, Markus Püschel and Christoph W. Ueberhuber
    Automatic Performance Optimization of the Discrete Fourier Transform on Distributed Memory Computers
    Proc. International Symposium on Parallel and Distributed Processing and Application (ISPA), Lecture Notes In Computer Science, Springer, Vol. 4330, pp. 818-832, 2006
  2. Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
    Parallelism in Spiral
    Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006

Bouziane, Rachid 

2011 

  1. Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Design studies for ASIC implementations of 28 GS/s optical QPSK- and 16-QAM-OFDM transceivers
    Optics Express, Vol. 19, No. 21, pp. 20857-20864, 2011

2010 

  1. Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, Christian Berger, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Design Studies for an ASIC Implementation of an Optical OFDM Transceiver
    Proc. European Conference on Optical Communication (ECOC), pp. 1-3, 2010
  2. Yannis Benlachtar, Rachid Bouziane, Robert I. Killey, Christian Berger, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel and Madeleine Glick
    Optical OFDM for the Data Center
    Proc. International Conference on Transparent Optical Networks (ICTON), pp. 1-4, 2010
  3. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Real-Time Digital Signal Processing for the Generation of Optical Orthogonal Frequency Division Multiplexed Signals
    IEEE Journal of Selected Topics in Quantum Electronics, Vol. 16, No. 5, pp. 1235-1244 , 2010

2009 

  1. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    21.4 GS/s Real-Time DSP-Based Optical OFDM Signal Generation and Transmission Over 1600 km of Uncompensated Fibre
    Proc. European Conference on Optical Communication (ECOC), pp. 1-2, 2009
  2. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
    Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009

Breitzman, Anthony F. 

2004 

  1. Anthony F. Breitzman and Jeremy Johnson
    Automatic Derivation and Implementation of Fast Convolution Algorithms
    Journal of High Performance Computing and Applications, special issue on ``Computer Algebra and Signal Processing'', Vol. 37, No. 2, pp. 157-186, 2004

2003 

  1. Anthony F. Breitzman
    Automatic Derivation and Implementation of Fast Convolution Algorithms
    PhD. thesis, Computer Science, Drexel University, 2003

Brinich, P. 

2023 

  1. N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
    Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023
  2. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023
  3. P. Brinich, N. Zhang, A. Ebel, Franz Franchetti and Jeremy Johnson
    Twiddle Factor Generation for a Vectorized Number Theoretic Transform
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023

2022 

  1. N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
    Towards Full-Stack Acceleration for Fully Homomorphic Encryption
    Proc. High Performance Extreme Computing (HPEC), 2022

Broderick, P. 

2023 

  1. B. Van Straalen, P. Colella, Franz Franchetti and P. Broderick
    FFTX: Next-Generation Open-Source Software for Fast Fourier Transforms
    to appear in Proc. Exascale Computing Project Annual Meeting (ECPAM), 2023
  2. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023

2022 

  1. N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
    Towards Full-Stack Acceleration for Fully Homomorphic Encryption
    Proc. High Performance Extreme Computing (HPEC), 2022

Brouwer, Paul 

2020 

  1. Sanil Rao, A. Kutuluru, Paul Brouwer, S. McMillan and Franz Franchetti
    GBTLX: A First Look
    Proc. IEEE High Performance Extreme Computing (HPEC), 2020

Buluc, A. 

2016 

  1. J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
    Mathematical Foundations of the GraphBLAS
    Proc. High Performance Extreme Computing (HPEC), 2016

Cai, M. 

2022 

  1. J. Nguyen, M. Cai, Z. Zuo, L. Tang, K. Mai and Franz Franchetti
    LIMA: Hardware for FFT based Large Integer Multiplication
    Proc. High Performance Extreme Computing (HPEC), 2022

Canida, K. 

2023 

  1. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Cryptology ePrint Archive, 2023
  2. D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring-Processing Unit
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
  3. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023

2022 

  1. N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
    Towards Full-Stack Acceleration for Fully Homomorphic Encryption
    Proc. High Performance Extreme Computing (HPEC), 2022

Canning, A. 

2020 

  1. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFT and Solver Libraries for Exascale: FFTX and SpectralPack
    Exascale Computing Project (ECP) Annual Meeting, 2020, Poster

2019 

  1. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFT and Solvers for Exascale: FFTX and SpectralPACK
    Exascale Computing Project (ECP) Annual Meeting, 2019

2018 

  1. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFTX and SpectralPack: A First Look
    Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018

Caparrós Cabezas, Victoria 

2014 

  1. Georg Ofenbeck, Ruedi Steinmann, Victoria Caparrós Cabezas, Daniele G. Spampinato and Markus Püschel
    Applying the Roofline Model
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 76 - 85, 2014
  2. Victoria Caparrós Cabezas and Markus Püschel
    Extending the Roofline Model: Bottleneck Analysis with Microarchitectural Constraints
    Proc. IEEE International Symposium on Workload Characterization (IISWC), pp. 222-231, 2014

Cartolano, Anthony 

2009 

  1. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
    Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009

Chagnon, Tim 

2008 

  1. Jeremy Johnson, Tim Chagnon, Petya Vachranukunkiet, Prawat Nagvajara and Chika Nwankpa
    Sparse LU Decomposition using FPGA
    Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008

Chang, Lawrence C. 

2004 

  1. Lawrence C. Chang, Inpyo Hong, Yevgen Voronenko and Markus Püschel
    Adaptive Mapping of Linear DSP Algorithms to Fixed-Point Arithmetic
    Proc. High Performance Extreme Computing (HPEC), 2004

Chang, Y.-J. 

2010 

  1. W. Yu, Franz Franchetti, James C. Hoe, Y.-J. Chang and Tsuhan Chen
    Fast Bilateral Filtering By Adapting Block Size
    Proc. IEEE International Conference on Image Processing (ICIP), 2010

Chellappa, Srinivas 

2010 

  1. Frédéric de Mesmay, Srinivas Chellappa, Franz Franchetti and Markus Püschel
    Computer Generation of Efficient Software Viterbi Decoders
    Proc. International Conference on High Performance Embedded Architectures and Compilers (HiPEAC), Lecture Notes in Computer Science, Springer, Vol. 5952, pp. 353-368, 2010
  2. Srinivas Chellappa
    Computer Generation of Fourier Transform Libraries for Distributed Memory Architectures
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010

2009 

  1. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    Computer Generation of Fast Fourier Transforms for the Cell Broadband Engine
    Proc. International Conference on Supercomputing (ICS), pp. 26-35, 2009
  2. Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
    Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
    IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
  3. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    High Performance Linear Transform Program Generation for the Cell BE
    Proc. High Performance Extreme Computing (HPEC), 2009

2008 

  1. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    Automatic Linear Transform Program Generation for the Cell BE
    Proc. Supercomputing (SC), 2008, Poster (Abstract reviewed)
  2. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  3. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    FFT Program Generation for the Cell BE
    Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
  4. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    How To Write Fast Numerical Code: A Small Introduction
    Proc. Summer School on Generative and Transformational Techniques in Software Engineering (GTTSE), Lecture Notes in Computer Science, Springer, Vol. 5235, pp. 196-259, 2008
  5. Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
    Program Generation with Spiral: Beyond Transforms
    Proc. High Performance Extreme Computing (HPEC), 2008

Chen, Kang 

2005 

  1. Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
    SPIRAL: Code Generation for DSP Transforms
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005

2004 

  1. Jeremy Johnson and Kang Chen
    A Self-Adapting Distributed Memory Package for Fast Signal Transforms
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 44-, 2004

2002 

  1. Kang Chen
    A Prototypical Self-Optimizing Package for Parallel Implementation of Fast Signal Transforms
    Master thesis, Computer Science, Drexel University, 2002
  2. Kang Chen and Jeremy Johnson
    A Prototypical Self-Optimizing Package for Parallel Implementation of Fast Signal Transforms
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 58-63, 2002

Chen, S. 

2024 

  1. L. Tang, S. Chen, K. Harisrikanth, G. Xu, Franz Franchetti and K. Mai
    A 1.19GHz 9.52Gsamples/sec Radix-8 FFT Hardware Accelerator in 28nm
    Proc. Hot Chips Symposium (HCS), 2024

2022 

  1. L. Tang, S. Chen, K. Harisrikanth, G. Xu, K. Mai and Franz Franchetti
    A High Throughput Hardware Accelerator for FFTW Codelets: A First Look
    Proc. High Performance Extreme Computing (HPEC), 2022

Chen, Tianshi 

2016 

  1. Qi Guo, Tianshi Chen, Y. Chen and Franz Franchetti
    Accelerating Architectural Simulation Via Statistical Techniques: A Survey
    IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 35, No. 3, pp. 433-446, 2016

Chen, Tsuhan 

2012 

  1. W. Yu, Franz Franchetti, James C. Hoe and Tsuhan Chen
    Highly Efficient Performance Portable Tracking of Evolving Surfaces
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 296-307, 2012

2011 

  1. W. Yu, Franz Franchetti, James C. Hoe, José M. F. Moura and Tsuhan Chen
    Performance Portable Tracking of Evolving Surfaces
    Proc. High Performance Extreme Computing (HPEC), 2011

2010 

  1. W. Yu, Franz Franchetti, James C. Hoe and Tsuhan Chen
    Fast and Robust Active Contours for Image Segmentation
    Proc. IEEE International Conference on Image Processing (ICIP), 2010
  2. W. Yu, Franz Franchetti, James C. Hoe, Y.-J. Chang and Tsuhan Chen
    Fast Bilateral Filtering By Adapting Block Size
    Proc. IEEE International Conference on Image Processing (ICIP), 2010
  3. W. Yu, Tsuhan Chen, Franz Franchetti and James C. Hoe
    High Performance Stereo Vision Designed for Massively Data Parallel Platforms
    IEEE Transactions on Circuits and Systems for Video Technology (T-CSVT), Vol. 20, No. 11, pp. 1509-1519, 2010

2003 

  1. Fang Fang, Rob A. Rutenbar, Markus Püschel and Tsuhan Chen
    Toward Efficient Static Analysis of Finite-Precision Effects in DSP Applications via Affine Arithmetic Modeling
    Proc. Design Automation Conference (DAC), pp. 496-501, 2003

Chen, Y. 

2016 

  1. Qi Guo, Tianshi Chen, Y. Chen and Franz Franchetti
    Accelerating Architectural Simulation Via Statistical Techniques: A Survey
    IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 35, No. 3, pp. 433-446, 2016

Cheng, Z. 

2023 

  1. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023

Chuangsuwanich, Ekapol 

2006 

  1. Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
    Parallelism in Spiral
    Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006

Colella, P. 

2023 

  1. B. Van Straalen, P. Colella, Franz Franchetti and P. Broderick
    FFTX: Next-Generation Open-Source Software for Fast Fourier Transforms
    to appear in Proc. Exascale Computing Project Annual Meeting (ECPAM), 2023

2022 

  1. H. Mankad, Sanil Rao, P. Colella, B. Van Straalen and Franz Franchetti
    ProtoX: A First Look
    Proc. High Performance Extreme Computing (HPEC), 2022

2020 

  1. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFT and Solver Libraries for Exascale: FFTX and SpectralPack
    Exascale Computing Project (ECP) Annual Meeting, 2020, Poster

2019 

  1. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFT and Solvers for Exascale: FFTX and SpectralPACK
    Exascale Computing Project (ECP) Annual Meeting, 2019

2018 

  1. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFTX and SpectralPack: A First Look
    Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018

Cousins, D. B. 

2023 

  1. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Cryptology ePrint Archive, 2023
  2. D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring-Processing Unit
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
  3. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023

2022 

  1. N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
    Towards Full-Stack Acceleration for Fully Homomorphic Encryption
    Proc. High Performance Extreme Computing (HPEC), 2022

Cui, Tao 

2014 

  1. Tao Cui, R. Yang, Gabriela Hug and Franz Franchetti
    Accelerated AC Contingency Calculation on Commodity Multi-core SIMD CPUs
    Proc. IEEE Power and Energy Society General Meeting (PES-GM), 2014

2013 

  1. Q. Li, Tao Cui, Yang Weng, Rohit Negi, Franz Franchetti and Marija Ilic
    An Information-Theoretic Approach to PMU Placement in Electric Power Systems
    IEEE Transactions on Smart Grid, Vol. 4, No. 1, pp. 446-456, 2013
  2. Tao Cui and Franz Franchetti
    A Quasi-Monte Carlo Approach for Radial Distribution System Probabilistic Load Flow
    Proc. IEEE Innovative Smart Grid Technology Conference, 2013
  3. Tao Cui and Franz Franchetti
    A Software Performance Engineering Aproach to Fast Transmission Probabilistic Load Flow
    IEEE PES General Meeting, 2013
  4. Tao Cui and Franz Franchetti
    Power System Probabilistic and Security Analysis on Commodity High Performance Computing Systems
    Proc. High Performance Computing, Networking and Analytics for the Power Grid (HiPCNA-PG), 2013
  5. Tao Cui
    Power System Probabilistic and Security Analysis Using Commodity High Performance Computing Systems
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2013
  6. Cory Thoma, Tao Cui and Franz Franchetti
    Privacy Preserving Smart Meter System Based Retail Level Electricity Market
    Proc. IEEE Power and Energy Society General Meeting (PES-GM), 2013

2012 

  1. Tao Cui and Franz Franchetti
    A Multi-Core High Performance Computing Framework for Probabilistic Solutions of Distribution Systems
    Proc. IEEE Power and Energy Society General Meeting (PES-GM), pp. 1-6, 2012
  2. Tao Cui and Franz Franchetti
    Optimized Parallel Distribution Load Flow Solver on Commodity Multi-core CPU
    Proc. IEEE High Performance Extreme Computing (HPEC), 2012
  3. Cory Thoma, Tao Cui and Franz Franchetti
    Secure Multiparty Computation Based Privacy Preserving Smart Metering System
    Proc. North American Power Symposium (NAPS), pp. 1-6, 2012

2011 

  1. Tao Cui and Franz Franchetti
    A Monte Carlo Framework for Probabilistic Distribution Power Flow
    Carnegie Mellon Conference on the Electricity Industry, 2011
  2. Tao Cui and Franz Franchetti
    A Multi-core High Performance Computing Framework for Distribution Power Flow
    Proc. North American Power Symposium (NAPS), 2011
  3. Tao Cui and Franz Franchetti
    Autotuning a Random Walk Boolean Satisfiability Solver
    Proc. International Workshop on Automatic Performance Tuning (iWAPT), 2011

2010 

  1. Q. Li, Tao Cui, Rohit Negi, Franz Franchetti and Marija Ilic
    On-line Decentralized Charging of Plug-In Electric Vehicles in Power Systems
    , 2010

D'Alberto, Paolo 

2008 

  1. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008

2007 

  1. Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
    Generating FPGA Accelerated DFT Libraries
    Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007
  2. Paolo D'Alberto, Markus Püschel and Franz Franchetti
    Performance/Energy Optimization of DSP Transforms on the XScale Processor
    Proc. International Conference on High Performance Embedded Architectures and Compilers (HiPEAC), Lecture Notes in Computer Science, Springer, Vol. 4367, pp. 201-214, 2007

2006 

  1. Paolo D'Alberto, Peter A. Milder, Franz Franchetti, James C. Hoe, Markus Püschel and José M. F. Moura
    Discrete Fourier Transform Compiler for FPGA and CPU/FPGA Partitioned Implementations
    Proc. High Performance Extreme Computing (HPEC), 2006

de Mesmay, Frédéric 

2010 

  1. Frédéric de Mesmay, Srinivas Chellappa, Franz Franchetti and Markus Püschel
    Computer Generation of Efficient Software Viterbi Decoders
    Proc. International Conference on High Performance Embedded Architectures and Compilers (HiPEAC), Lecture Notes in Computer Science, Springer, Vol. 5952, pp. 353-368, 2010
  2. Frédéric de Mesmay, Yevgen Voronenko and Markus Püschel
    Offline Library Adaptation Using Automatically Generated Heuristics
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 1-10, 2010
  3. Frédéric de Mesmay
    On the Computer Generation of Adaptive Numerical Libraries
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010

2009 

  1. Frédéric de Mesmay, Arpad Rimmel, Yevgen Voronenko and Markus Püschel
    Bandit-Based Optimization on Graphs with Application to Library Performance Tuning
    Proc. International Conference on Machine Learning (ICML), pp. 729-736, 2009
  2. Yevgen Voronenko, Frédéric de Mesmay and Markus Püschel
    Computer Generation of General Size Linear Transform Libraries
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 102-113, 2009
  3. Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
    Operator Language: A Program Generation Framework for Fast Kernels
    Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009

2008 

  1. Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Automatic Generation of Adaptive Libraries for Matrix-Multiplication
    Parallel Matrix Algorithms and Applications (PMAA), 2008, Presentation (Abstract reviewed)
  2. Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Automatic Generation of Multithreaded Vectorized Adaptive Libraries for Matrix Multiplication
    Proc. International Workshop on Parallel Matrix Algorithms and Applications (PMAA), 2008
  3. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  4. Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
    Generating High-Performance General Size Linear Transform Libraries Using Spiral
    Proc. High Performance Extreme Computing (HPEC), 2008
  5. Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
    Program Generation with Spiral: Beyond Transforms
    Proc. High Performance Extreme Computing (HPEC), 2008
  6. Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
    System Demonstration of Spiral: Generator for High-Performance Linear Transform Libraries
    Proc. Algebraic Methodology and Software Technology (AMAST), 2008

Deri, Joya 

2016 

  1. Joya Deri, Franz Franchetti and José M. F. Moura
    Big Data Computation of Taxi Movement in New York City
    Proc. IEEE International Conference on Big Data (Big Data), IEEE, pp. 2616-2625, 2016

de Supinski, B. R. 

2006 

  1. F. Gygi, E. W. Draeger, M. Schulz, B. R. de Supinski, J. A. Gunnels, V. Austel, J. C. Sexton, Franz Franchetti, Stefan Kral, Christoph W. Ueberhuber and Juergen Lorenz
    Large-Scale Electronic Structure Calculations of High-Z Metals on the BlueGene/L Platform
    Proc. Supercomputing (SC), 2006

2005 

  1. F. Gygi, E. W. Draeger, B. R. de Supinski, R. K. Yates, Franz Franchetti, Stefan Kral, Juergen Lorenz, Christoph W. Ueberhuber, J. A. Gunnels and J. C. Sexton
    Large-Scale First-Principles Molecular Dynamics Simulations on the BlueGene/L Platform Using the Qbox Code
    Proc. Supercomputing (SC), 2005

Dongarra, Jack 

2005 

  1. José M. F. Moura, Markus Püschel, David Padua and Jack Dongarra
    Scanning the Issue: Special Issue on Program Generation, Optimization, and Platform Adaptation
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 211-215, 2005

Draeger, E. W. 

2006 

  1. F. Gygi, E. W. Draeger, M. Schulz, B. R. de Supinski, J. A. Gunnels, V. Austel, J. C. Sexton, Franz Franchetti, Stefan Kral, Christoph W. Ueberhuber and Juergen Lorenz
    Large-Scale Electronic Structure Calculations of High-Z Metals on the BlueGene/L Platform
    Proc. Supercomputing (SC), 2006

2005 

  1. F. Gygi, E. W. Draeger, B. R. de Supinski, R. K. Yates, Franz Franchetti, Stefan Kral, Juergen Lorenz, Christoph W. Ueberhuber, J. A. Gunnels and J. C. Sexton
    Large-Scale First-Principles Molecular Dynamics Simulations on the BlueGene/L Platform Using the Qbox Code
    Proc. Supercomputing (SC), 2005

Duff, B. 

2014 

  1. B. Duff, J. Larkin, M. Franusich and Franz Franchetti
    Automatic Generation of 3-D FFTs
    submitted for publication

Ebel, A. 

2023 

  1. N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
    Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023
  2. P. Brinich, N. Zhang, A. Ebel, Franz Franchetti and Jeremy Johnson
    Twiddle Factor Generation for a Vectorized Number Theoretic Transform
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023

Egner, Sebastian 

2002 

  1. Markus Püschel, Sebastian Egner and Thomas Beth
    AREP
    in New Reference Book on Computer Algebra, Eds. J. Grabmeier, E. Kaltofen, J. Grabmeier, E. Kaltofen, Springer 2002

2001 

  1. Sebastian Egner, Jeremy Johnson, David Padua, Jianxin Xiong and Markus Püschel
    Automatic Derivation and Implementation of Signal Processing Algorithms
    ACM SIGSAM Bulletin Communications in Computer Algebra, Vol. 35, No. 2, pp. 1-19, 2001
  2. Sebastian Egner and Markus Püschel
    Automatic Generation of Fast Discrete Signal Transforms
    IEEE Transactions on Signal Processing, Vol. 49, No. 9, pp. 1992-2002, 2001

Eum, Y. 

2024 

  1. Y. Eum, N. Zhang, L. Tang and Franz Franchetti
    Towards a RISC-V Instruction Set Extension for Multi-word Arithmetic
    Proc. High Performance Extreme Computing (HPEC), 2024

Fabregat-Traver, Diego 

2018 

  1. Daniele G. Spampinato, Diego Fabregat-Traver, Paolo Bientinesi and Markus Püschel
    Program Generation for Small-Scale Linear Algebra Applications
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 327-339, 2018

Fahringer, T. 

2003 

  1. T. Fahringer, Franz Franchetti, M. Geissler, G. Madsen, H. Moritsch and R. Prodan
    On Using ZENTURIO for Performance and Parameter Studies on Clusters and Grids
    Proc. Euromicro Conference on Parallel Distributed and Network-based Processing (Euro PDP), pp. 185-192, 2003

Fang, Fang 

2003 

  1. Fang Fang, Rob A. Rutenbar, Markus Püschel and Tsuhan Chen
    Toward Efficient Static Analysis of Finite-Precision Effects in DSP Applications via Affine Arithmetic Modeling
    Proc. Design Automation Conference (DAC), pp. 496-501, 2003

2002 

  1. Fang Fang, James C. Hoe, Markus Püschel and Smarahara Misra
    Generation of Custom DSP Transform IP Cores: Case Study Walsh-Hadamard Transform
    Proc. High Performance Extreme Computing (HPEC), 2002

Feeder, G. 

2013 

  1. H. E. Sumbul, A. Patterson, A. Tazzoli, G. Feeder, Franz Franchetti, G. Piazza and Lawrence Pileggi
    Trusted Split-Fabrication System-on-Chip Design Technology and Methodology
    Proc. Government Applications & Critical Technology Conference (GOMACTech), 2013

Fraguela, Basilio B. 

2009 

  1. Basilio B. Fraguela, Yevgen Voronenko and Markus Püschel
    Automatic Tuning of Discrete Fourier Transforms Driven by Analytical Modeling
    Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 271-280, 2009

Franchetti, Franz 

2024 

  1. L. Tang, S. Chen, K. Harisrikanth, G. Xu, Franz Franchetti and K. Mai
    A 1.19GHz 9.52Gsamples/sec Radix-8 FFT Hardware Accelerator in 28nm
    Proc. Hot Chips Symposium (HCS), 2024
  2. S. Fu, N. Zhang and Franz Franchetti
    Accelerating High-Precision Number Theoretic Transforms using Intel AVX-512
    Proc. Parallel Architectures and Compilation Techniques (PACT), 2024
  3. E. Tang, James C. Hoe and Franz Franchetti
    Magic Memory: A Memory-Centric Declarative Programming Paradigm to Enable High Productivity on Heterogeneous Systems
    Proc. SRC TECHCON, 2024
  4. T. Zhang, E. Tang, F. Siddique, K. Skadron and Franz Franchetti
    Towards an End-to-End Processing-in-DRAM Acceleration of Spectral Library Search
    Proc. High Performance Extreme Computing (HPEC), 2024
  5. Y. Eum, N. Zhang, L. Tang and Franz Franchetti
    Towards a RISC-V Instruction Set Extension for Multi-word Arithmetic
    Proc. High Performance Extreme Computing (HPEC), 2024
  6. Sanil Rao, A. Prakash and Franz Franchetti
    Towards LibraryX: A Framework for Cross-Library Call Optimization
    Proc. High Performance Extreme Computing (HPEC), 2024

2023 

  1. G. Xu, Franz Franchetti and James C. Hoe
    An Approach to Generating Customized Load-Store Architectures
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2023
  2. Sanil Rao, M. A. H. Monil, H. Mankad, J. Vetter and Franz Franchetti
    FFTX-IRIS: Towards Performance Portability and Heterogeneity for SPIRAL Generated Code
    Proc. Supercomputing (SC), 2023
  3. B. Van Straalen, P. Colella, Franz Franchetti and P. Broderick
    FFTX: Next-Generation Open-Source Software for Fast Fourier Transforms
    to appear in Proc. Exascale Computing Project Annual Meeting (ECPAM), 2023
  4. N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
    Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023
  5. N. Zhang and Franz Franchetti
    Generating Number Theoretic Transforms for Multi-Word Integer Data Types
    Proc. International Symposium on Code Generation and Optimization (CGO), 2023
  6. D. Sun, N. Zhang and Franz Franchetti
    Optimization and Performance Analysis of Shor’s Algorithm in Qiskit
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023
  7. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Cryptology ePrint Archive, 2023
  8. D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring-Processing Unit
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
  9. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023
  10. P. Brinich, N. Zhang, A. Ebel, Franz Franchetti and Jeremy Johnson
    Twiddle Factor Generation for a Vectorized Number Theoretic Transform
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023

2022 

  1. Joao Rivera, Franz Franchetti and Markus Püschel
    A Compiler for Sound Floating-Point Computations using Affine Arithmetic
    Proc. International Symposium on Code Generation and Optimization (CGO), 2022
  2. Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
    A Framework for Low Communication Approaches for Large Scale 3D Convolution
    Proc. International Conference on Parallel Processing (ICPP), 2022
  3. L. Tang, S. Chen, K. Harisrikanth, G. Xu, K. Mai and Franz Franchetti
    A High Throughput Hardware Accelerator for FFTW Codelets: A First Look
    Proc. High Performance Extreme Computing (HPEC), 2022
  4. P. Oostema and Franz Franchetti
    Approximating Manifolds and Geodesics with Curved Surfaces
    Proc. High Performance Extreme Computing (HPEC), 2022
  5. J. Nguyen, Sanil Rao, J. Moreira and Franz Franchetti
    Automatic Generation of Matrix-Vector Code Using SPIRAL for the Power10 ISA
    Proc. IEEE High Performance Extreme Computing (HPEC), 2022
  6. Z. Gong, N. Zhu, M. Ngaw, Joao Rivera, L. Tang, E. Tang, H. Mankad and Franz Franchetti
    Interval Arithmetic-based FFT for Large Integer Multiplication
    Proc. High Performance Extreme Computing (HPEC), 2022
  7. J. Nguyen, M. Cai, Z. Zuo, L. Tang, K. Mai and Franz Franchetti
    LIMA: Hardware for FFT based Large Integer Multiplication
    Proc. High Performance Extreme Computing (HPEC), 2022
  8. E. Tang and Franz Franchetti
    Magic Memory: A Programming Model For Big Data Analytics
    Proc. High Performance Extreme Computing (HPEC), 2022
  9. P. Oostema and Franz Franchetti
    On Manifold Graph Embedding
    Proc. High Performance Extreme Computing (HPEC), 2022
  10. H. Mankad, Sanil Rao, P. Colella, B. Van Straalen and Franz Franchetti
    ProtoX: A First Look
    Proc. High Performance Extreme Computing (HPEC), 2022
  11. N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
    Towards Full-Stack Acceleration for Fully Homomorphic Encryption
    Proc. High Performance Extreme Computing (HPEC), 2022

2021 

  1. N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
    Adaptation of A64 Scalable Vector Extension for Spiral
    Proc. High Performance Computing Conference (HPCC), Japan, 2021
  2. N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
    An Auto-tuning with Adaptation of A64 Scalable Vector Extension for SPIRAL
    Proc. International Workshop on Automatic Performance Tuning (iWAPT), 2021
  3. Joao Rivera, Franz Franchetti and Markus Püschel
    An Interval Compiler for Sound Floating Point Computations
    Proc. International Symposium on Code Generation and Optimization (CGO), 2021
  4. P. Oostema and Franz Franchetti
    Leveraging High Dimensional Spatial Graph Embedding as a Heuristic for Graph Algorithms
    Proc. IEEE Workshop on Parallel / Distributed Combinatorics and Optimization (PDCO), 2021
  5. Scott Mionis, Franz Franchetti and J. Larkin
    Optimized Quantum Circuit Generation with SPIRAL
    Proc. High Performance Extreme Computing (HPEC), 2021

2020 

  1. Thom Popovici, Martin Schatz, Franz Franchetti and Tze-Meng Low
    A Flexible Framework for Multi-Dimensional DFTs
    SIAM Journal on Scientific Computing (SISC), Software and High-Performance Computing, 2020
  2. Jiyuan Zhang, Yi Lu, Daniele G. Spampinato and Franz Franchetti
    FESIA: A Fast and Efficient Set Intersection Approach on Modern CPUs
    Proc. IEEE International Conference on Data Engineering (ICDE), 2020
  3. Jiyuan Zhang, Daniele G. Spampinato and Franz Franchetti
    FESIA: A Fast and SIMD-Efficient Set Intersection Approach on Modern CPUs
    North East Database Day (NEDB), 2020, Poster
  4. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFT and Solver Libraries for Exascale: FFTX and SpectralPack
    Exascale Computing Project (ECP) Annual Meeting, 2020, Poster
  5. Daisuke Takahashi and Franz Franchetti
    FFTE on SVE: SPIRAL-Generated Kernels
    Proc. International Conference on High Performance Computing in Asia-Pacific Region (HPCAsia), pp. 114-122, 2020
  6. Sanil Rao, A. Kutuluru, Paul Brouwer, S. McMillan and Franz Franchetti
    GBTLX: A First Look
    Proc. IEEE High Performance Extreme Computing (HPEC), 2020
  7. Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
    Massive Scaling of MASSIF: Algorithm Development and Analysis for Simulation on GPUs
    Proc. Platform for Advanced Scientific Computing (PASC), Article 13, pp. 1 - 10, 2020
  8. Scott Mionis, Franz Franchetti and J. Larkin
    Quantum Circuit Optimization with SPIRAL: A First Look
    Proc. Supercomputing (SC), 2020
  9. Vadim Zaliva, Ilia Zaichuk and Franz Franchetti
    Verified Translation Between Purely Functional and Imperative Domain Specific Languages in HELIX
    Proc. Working Conference on Verified Software: Theories, Tools, and Experiments (VSTTE), 2020

2019 

  1. Thom Popovici, Martin Schatz, Franz Franchetti and Tze-Meng Low
    A Flexible Framework for Parallel Multi-Dimensional DFTs
    arXiv (Technical Report), 2019
  2. Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
    Algorithm Design at Scale: Porting Parallel FFT-based Fortran Simulations to GPUs
    SIAM Conference on Computational Science and Engineering (CSE), 2019
  3. Anuva Kulkarni, Daniele G. Spampinato and Franz Franchetti
    Design and Specification of Large-scale Simulations for GPUs using FFTX
    Supercomputing, 2019
  4. F. Sadi, Joe Sweeney, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    Efficient SpMV Operation for Large and Highly Sparse Matrices Using Scalable Multi-way Merge Parallelization
    Proc. MICRO, 2019
  5. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFT and Solvers for Exascale: FFTX and SpectralPACK
    Exascale Computing Project (ECP) Annual Meeting, 2019
  6. Anuva Kulkarni, Daniele G. Spampinato and Franz Franchetti
    FFTX for Micromechanical Stress-Strain Analysis
    IEEE High Performance Extreme Computing Conference (HPEC), 2019
  7. Yoko Franchetti, Thomas Nolin and Franz Franchetti
    Towards Precision Medicine: Simulation Based Parameter Estimation for Drug Metabolism
    SIAM Conference on Computational Science and Engineering (CSE), 2019

2018 

  1. Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
    Algorithm Design for Large Scale FFT-Based Simulations on CPU-GPU Platforms
    , 2018
  2. Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
    Algorithm Design for Large Scale Parallel FFT-Based Simulations on Heterogeneous Platforms
    Proc. High Performance Extreme Computing (HPEC), 2018
  3. Vit Ruzicka and Franz Franchetti
    Fast and Accurate Object Detection in High Resolution 4K and 8K Video Using GPUs
    Proc. IEEE High Performance Extreme Computing (HPEC), 2018
  4. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFTX and SpectralPack: A First Look
    Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018
  5. Vadim Zaliva and Franz Franchetti
    HELIX: A Case Study of a Formal Verification of High Performance Program Generation
    Proc. Workshop on Functional High Performance Computing (FHPC), 2018
  6. Jiyuan Zhang, Franz Franchetti and Tze-Meng Low
    High Performance Zero-Memory Overhead Direct Convolutions
    Proc. International Conference on Machine Learning (ICML), 2018
  7. Thom Popovici, Tze-Meng Low and Franz Franchetti
    Large Bandwidth-Efficient FFTs on Multicore and Multi-Socket Systems
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), IEEE, 2018
  8. Tze-Meng Low, Daniele G. Spampinato, A. Kutuluru, U. Sridhar, Thom Popovici, Franz Franchetti and S. McMillan
    Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency Matrices
    Proc. High Performance Extreme Computing (HPEC), 2018
  9. F. Sadi, Joe Sweeney, S. McMillan, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    PageRank Acceleration for Large Graphs with Scalable Hardware and Two-Step SpMV
    Proc. High Performance Extreme Computing (HPEC), 2018
  10. Jiyuan Zhang, Daniele G. Spampinato, S. McMillan and Franz Franchetti
    Preliminary Exploration on Large-Scale Triangle Counting in Shared-Memory Multicore System
    Proc. IEEE High Performance Extreme Computing (HPEC), 2018
  11. Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
    SPIRAL: Extreme Performance Portability
    Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018

2017 

  1. Matthias Bolten, Franz Franchetti, P. H. J. Kelly, Christian Lengauer and Marcus Mohr
    Algebraic Description and Automatic Generation of Multigrid Methods in SPIRAL
    Concurrency and Computation: Practice and Experience, 2017
  2. F. Sadi, Lawrence Pileggi and Franz Franchetti
    Algorithm and Hardware Co-Optimized Solution for Large SpMV Problems
    Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017
  3. Richard Veras and Franz Franchetti
    A Scale-free Structure for Real World Networks
    Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017
  4. Tze-Meng Low, Varun Rao, Matthew Lee, Thom Popovici, Franz Franchetti and S. McMillan
    First Look: Linear Algebra-based Triangle Counting Without Matrix Multiplication
    Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-6, 2017
  5. Tze-Meng Low and Franz Franchetti
    High Assurance Code Generation for Cyber-Physical Systems
    Proc. IEEE International Symposium on High Assurance Systems Engineering (HASE), 2017
  6. Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
    High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
    IEEE Control Systems Magazine, 2017
  7. Thom Popovici, Franz Franchetti and Tze-Meng Low
    Mixed Data Layout Kernels for Vectorized Complex Arithmetic
    Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017
  8. H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
    Multirotor UAV State Prediction Through Multi-microphone Side-channel Fusion
    Proc. Multisensor Fusion and Integration for Intelligent Systems (MFI), IEEE, pp. 15-21, 2017
  9. G. Xu, Tze-Meng Low, James C. Hoe and Franz Franchetti
    Optimizing FFT Resource Efficiency of FPGA using High-Level Synthesis
    High Performance Extreme Computing Conference (HPEC), 2017
  10. H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
    Prediction of Quadcopter State through Multi-Microphone Side-Channel Fusion
    Technical report UU-CS-2017-001, Dept. of Information and Computing Sciences, Utrecht University, 2017

2016 

  1. F. Sadi, Lawrence Pileggi and Franz Franchetti
    3D DRAM Based Application Specific Hardware Accelerator for SpMV
    High Performance Extreme Computing Conference (HPEC), 2016
  2. Qi Guo, Tianshi Chen, Y. Chen and Franz Franchetti
    Accelerating Architectural Simulation Via Statistical Techniques: A Survey
    IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 35, No. 3, pp. 433-446, 2016
  3. Richard Veras, Tze-Meng Low and Franz Franchetti
    A Scale-Free Structure for Power-Law Graphs
    Proc. High Performance Extreme Computing (HPEC), pp. 1-7, 2016
  4. Joya Deri, Franz Franchetti and José M. F. Moura
    Big Data Computation of Taxi Movement in New York City
    Proc. IEEE International Conference on Big Data (Big Data), IEEE, pp. 2616-2625, 2016
  5. Richard Veras, Thom Popovici, Tze-Meng Low and Franz Franchetti
    Compilers, Hands-Off My Hands-On Optimizations
    Proc. Workshop on Programming Models for SIMD/Vector Programming (WPMVP), 2016
  6. Berkin Akin, Franz Franchetti and James C. Hoe
    HAMLeT Architecture for Parallel Data Reorganization in Memory
    IEEE Micro, Vol. 36, No. 1, pp. 14-23, 2016
  7. J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
    Mathematical Foundations of the GraphBLAS
    Proc. High Performance Extreme Computing (HPEC), 2016

2015 

  1. Jiyuan Zhang, Tze-Meng Low, Qi Guo and Franz Franchetti
    A 3D-Stacked Memory Manycore Stencil Accelerator System
    Proc. Workshop on Near Data Processing (WONDP), 2015
  2. H. V. Koops and Franz Franchetti
    An Ensemble Technique for Estimating Vehicle Speed and Gear Position from Acoustic Data
    Proc. International Conference on Digital Signal Processing (DSP), 2015
  3. H. E. Sumbul, K. Vaidyanathan, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
    A Synthesis Methodology for Application-Specific Logic-in-Memory Designs
    Proc. Design Automation Conference (DAC), 2015
  4. Berkin Akin, Franz Franchetti and James C. Hoe
    Data Reorganization in Memory Using 3D-stacked DRAM
    Proc. International Symposium on Computer Architectur (ISCA), 2015
  5. Qi Guo, Tze-Meng Low, N. Alachiotis, Berkin Akin, Lawrence Pileggi, James C. Hoe and Franz Franchetti
    Enabling Portable Energy Efficiency with Memory Accelerated Library
    Proc. MICRO, 2015
  6. M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
    Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
    Computer, Vol. 48, No. 12, pp. 24-33, 2015
  7. Berkin Akin, Franz Franchetti and James C. Hoe
    FFTs with Near-Optimal Memory Access Through Block Data Layouts: Algorithm, Architecture and Design Automation
    Journal of Signal Processing Systems, 2015
  8. Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
    Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
    Proc. Workshop on Compilers for Parallel Computing (CPC), 2015
  9. Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
    Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2015
  10. Tze-Meng Low, Qi Guo and Franz Franchetti
    Optimizing Space Time Adaptive Processing Through Accelerating Memory-Bounded Operations
    Proc. High Performance Extreme Computing (HPEC), 2015
  11. T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, R. Suter, Franz Franchetti and Anthony Rollett
    Spectral Full-Field Deformation Modeling of Polycrystalline Materials
    submitted for publication

2014 

  1. Qi Guo, N. Alachiotis, Berkin Akin, F. Sadi, G. Xu, Tze-Meng Low, Lawrence Pileggi, James C. Hoe and Franz Franchetti
    3D-Stacked Memory-Side Acceleration: Accelerator and System Design
    Proc. Workshop on Near Data Processing (WONDP), 2014
  2. Tao Cui, R. Yang, Gabriela Hug and Franz Franchetti
    Accelerated AC Contingency Calculation on Commodity Multi-core SIMD CPUs
    Proc. IEEE Power and Energy Society General Meeting (PES-GM), 2014
  3. F. Sadi, Berkin Akin, Thom Popovici, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    Algorithm/Hardware Co-optimized SAR Image Reconstruction with 3D-stacked Logic in Memory
    Proc. IEEE High Performance Extreme Computing (HPEC), 2014
  4. B. Duff, J. Larkin, M. Franusich and Franz Franchetti
    Automatic Generation of 3-D FFTs
    submitted for publication
  5. Vadim Zaliva and Franz Franchetti
    Barometric and GPS Altitude Sensor Fusion
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014
  6. K. Vaidyanathan, R. Liu, H. E. Sumbul, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
    Efficient and Secure Intellectual Property (IP) Design for Split Fabrication
    Proc. IEEE International Symposium on Hardware-Oriented Security and Trust (HOST), 2014
  7. T. Ozturk, Thom Popovici, C. Stein, R. Pokharel, Franz Franchetti and Anthony Rollett
    Fast Fourier Transform Based Mechanical Behavior Formulation: Optimized Implementation and Sensitivity Analysis of the Method
    Proc. Materials Science & Technology, 2014
  8. Berkin Akin, Franz Franchetti and James C. Hoe
    FFTs with Near-Optimal Memory Access Through Block Data Layouts
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014
  9. Berkin Akin, James C. Hoe and Franz Franchetti
    HAMLeT: Hardware Accelerated Memory Layout Transform within 3D-stacked DRAM
    Proc. IEEE High Performance Extreme Computing (HPEC), 2014
  10. Franz Franchetti, Aliaksei Sandryhaila and Jeremy Johnson
    High Assurance SPIRAL
    Proc. SPIE, Proceedings of SPIE 2014, 2014
  11. T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, Franz Franchetti, R. Suter and Anthony Rollett
    Performance Evaluation, Algorithm Optimization and Sensitivity Analysis of the Spectral Full-Field Deformation Modeling of Polycrystalline Materials
    submitted for publication
  12. Berkin Akin, Franz Franchetti and James C. Hoe
    Understanding the Design Space of DRAM-optimized Hardware FFT Accelerators
    Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 248-255, 2014

2013 

  1. Qiuling Zhu, Berkin Akin, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    A 3D-Stacked Logic-in-Memory Accelerator for Application-Specific Data Intensive Computing
    Proc. IEEE International 3D Systems Integration Conference (3DIC), pp. 1-7, 2013
  2. Qiuling Zhu, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    Accelerating Sparse Matrix-Matrix Multiplication with 3D-Stacked Logic-in-Memory Hardware
    Proc. High Performance Extreme Computing (HPEC), pp. 1-6, 2013
  3. Q. Li, Tao Cui, Yang Weng, Rohit Negi, Franz Franchetti and Marija Ilic
    An Information-Theoretic Approach to PMU Placement in Electric Power Systems
    IEEE Transactions on Smart Grid, Vol. 4, No. 1, pp. 446-456, 2013
  4. Tao Cui and Franz Franchetti
    A Quasi-Monte Carlo Approach for Radial Distribution System Probabilistic Load Flow
    Proc. IEEE Innovative Smart Grid Technology Conference, 2013
  5. Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
    A Smart Memory Accelerated Computed Tomography Parallel Backprojection
    in From Algorithms to Circuits and System-on-Chip Design (VLSI-SoC 2012), Eds. A. Berg, A. Coskun, M. Guthaus, S. Katkoori, R. Reis, pp. 21-44, Springer, Berlin, Heidelberg 2013
  6. Tao Cui and Franz Franchetti
    A Software Performance Engineering Aproach to Fast Transmission Probabilistic Load Flow
    IEEE PES General Meeting, 2013
  7. Tom Henretty, Richard Veras, Franz Franchetti, Louis-Noël Pouchet, J. Ramanujam and P. Sadayappan
    A Stencil Compiler for Short-Vector SIMD Architectures
    Proc. ACM International Conference on Supercomputing , pp. 13-24, 2013
  8. Tao Cui and Franz Franchetti
    Power System Probabilistic and Security Analysis on Commodity High Performance Computing Systems
    Proc. High Performance Computing, Networking and Analytics for the Power Grid (HiPCNA-PG), 2013
  9. Cory Thoma, Tao Cui and Franz Franchetti
    Privacy Preserving Smart Meter System Based Retail Level Electricity Market
    Proc. IEEE Power and Energy Society General Meeting (PES-GM), 2013
  10. H. E. Sumbul, A. Patterson, A. Tazzoli, G. Feeder, Franz Franchetti, G. Piazza and Lawrence Pileggi
    Trusted Split-Fabrication System-on-Chip Design Technology and Methodology
    Proc. Government Applications & Critical Technology Conference (GOMACTech), 2013
  11. Martin Kong, Richard Veras, Kevin Stock, Franz Franchetti, Louis-Noël Pouchet and P. Sadayappan
    When Polyhedral Transformations Meet SIMD Code Generation
    Proc. Programming Languages Design and Implementation (PLDI), Vol. 48, pp. 127-138, 2013

2012 

  1. Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
    Algorithm and Architecture Optimization for Large Size Two Dimensional Discrete Fourier Transform
    ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA), 2012
  2. Tao Cui and Franz Franchetti
    A Multi-Core High Performance Computing Framework for Probabilistic Solutions of Distribution Systems
    Proc. IEEE Power and Energy Society General Meeting (PES-GM), pp. 1-6, 2012
  3. C. Angelopoulos, Franz Franchetti and Markus Püschel
    Automatic Generation of FFT Libraries for GPUs
    NVIDIA Research Summit at the GPU Technology Conference, 2012
  4. Franz Franchetti, Yevgen Voronenko and G. Almasi
    Automatic Generation of the HPC Challenges Global FFT Benchmark for BlueGene/P
    Proc. High Performance Computing for Computational Science (VECPAR), 2012
  5. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Computer Generation of Hardware for Linear Digital Signal Processing Transforms
    ACM Transactions on Design Automation of Electronic Systems, Vol. 17, No. 2, 2012
  6. Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
    Cost-Effective Smart Memory Implementation for Parallel Backprojection in Computed Tomography
    Proc. IFIP/IEEE Internationa Conference on Very Large Scale Integration, pp. 111-116, 2012
  7. Qiuling Zhu, K. Vaidyanathan, O. Shacham, M. Horowitz, Lawrence Pileggi and Franz Franchetti
    Design Automation Framwork for Application-Specific Logic-in-Memory Blocks
    Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 125-132, 2012
  8. W. Yu, Franz Franchetti, James C. Hoe and Tsuhan Chen
    Highly Efficient Performance Portable Tracking of Evolving Surfaces
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 296-307, 2012
  9. Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
    Local Interpolation-based Polar Format SAR: Algorithm, Hardware Implementation and Design Automation
    Journal of Signal Processing Systems, 2012
  10. Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
    Memory Bandwidth Efficient Two-Dimensional Fast Fourier Transform Algorithm and Implementation for Large Problem Sizes
    Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), 2012
  11. Tao Cui and Franz Franchetti
    Optimized Parallel Distribution Load Flow Solver on Commodity Multi-core CPU
    Proc. IEEE High Performance Extreme Computing (HPEC), 2012
  12. Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
    Polar Format Synthetic Aperture Radar in Energy Efficient Application-Specific Logic-in-Memory
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1557-1560, 2012
  13. Cory Thoma, Tao Cui and Franz Franchetti
    Secure Multiparty Computation Based Privacy Preserving Smart Metering System
    Proc. North American Power Symposium (NAPS), pp. 1-6, 2012
  14. Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
    Smart Memory Synthesis for Energy-Efficient Computed Tomography Reconstruction
    Proc. SRC TECHCON, 2012

2011 

  1. Tao Cui and Franz Franchetti
    A Monte Carlo Framework for Probabilistic Distribution Power Flow
    Carnegie Mellon Conference on the Electricity Industry, 2011
  2. Tao Cui and Franz Franchetti
    A Multi-core High Performance Computing Framework for Distribution Power Flow
    Proc. North American Power Symposium (NAPS), 2011
  3. Qiuling Zhu, E. L. Turner, Christian Berger, Lawrence Pileggi and Franz Franchetti
    Application-Specific Logic-in-Memory for Polar Format Synthetic Aperture Radar
    Proc. High Performance Extreme Computing (HPEC), 2011
  4. Daniel McFarlin, Volodymyr Arbatov, Franz Franchetti and Markus Püschel
    Automatic SIMD Vectorization of Fast Fourier Transforms for the Larrabee and AVX Instruction Sets
    Proc. International Conference on Supercomputing (ICS), 2011
  5. Tao Cui and Franz Franchetti
    Autotuning a Random Walk Boolean Satisfiability Solver
    Proc. International Workshop on Automatic Performance Tuning (iWAPT), 2011
  6. Tom Henretty, Kevin Stock, Louis-Noël Pouchet, Franz Franchetti, J. Ramanujam and P. Sadayappan
    Data Layout Transformation for Stencil Computations on Short SIMD Architectures
    Proc. International Conference on Compiler Construction (CC), 2011
  7. Franz Franchetti and Markus Püschel
    Fast Fourier Transform
    in Encyclopedia of Parallel Computing, Eds. David Padua, Springer 2011
  8. W. Yu, Franz Franchetti, James C. Hoe, José M. F. Moura and Tsuhan Chen
    Performance Portable Tracking of Evolving Surfaces
    Proc. High Performance Extreme Computing (HPEC), 2011
  9. Christian Berger, Volodymyr Arbatov, Yevgen Voronenko, Franz Franchetti and Markus Püschel
    Real-Time Software Implementation of an IEEE 802.11a Baseband Receiver on Intel Multicore
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1693-1696 , 2011
  10. Markus Püschel, Franz Franchetti and Yevgen Voronenko
    Spiral
    in Encyclopedia of Parallel Computing, Eds. David Padua, pp. 1920-1933, Springer 2011

2010 

  1. Frédéric de Mesmay, Srinivas Chellappa, Franz Franchetti and Markus Püschel
    Computer Generation of Efficient Software Viterbi Decoders
    Proc. International Conference on High Performance Embedded Architectures and Compilers (HiPEAC), Lecture Notes in Computer Science, Springer, Vol. 5952, pp. 353-368, 2010
  2. Yevgen Voronenko, Volodymyr Arbatov, Christian Berger, Ronghui Peng, Markus Püschel and Franz Franchetti
    Computer Generation of Platform-Adapted Physical Layer Software
    Proc. Software Defined Radio (SDR), 2010
  3. C. Angelopoulos, Franz Franchetti and Markus Püschel
    DFT Transform on the Fermi (GTX480): Automatic Program Generation
    NVIDIA Research Summit at the GPU Technology Conference, 2010
  4. W. Yu, Franz Franchetti, James C. Hoe and Tsuhan Chen
    Fast and Robust Active Contours for Image Segmentation
    Proc. IEEE International Conference on Image Processing (ICIP), 2010
  5. W. Yu, Franz Franchetti, James C. Hoe, Y.-J. Chang and Tsuhan Chen
    Fast Bilateral Filtering By Adapting Block Size
    Proc. IEEE International Conference on Image Processing (ICIP), 2010
  6. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Hardware Implementation of the Discrete Fourier Transform with Non-Power-of-Two Problem Size
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2010
  7. W. Yu, Tsuhan Chen, Franz Franchetti and James C. Hoe
    High Performance Stereo Vision Designed for Massively Data Parallel Platforms
    IEEE Transactions on Circuits and Systems for Video Technology (T-CSVT), Vol. 20, No. 11, pp. 1509-1519, 2010
  8. Q. Li, Tao Cui, Rohit Negi, Franz Franchetti and Marija Ilic
    On-line Decentralized Charging of Plug-In Electric Vehicles in Power Systems
    , 2010
  9. Lingchuan Meng, Jeremy Johnson, Franz Franchetti, Yevgen Voronenko, Marc Moreno Maza and Yuzhen Xie
    Spiral-Generated Modular FFT Algorithms
    Proc. Parallel Symbolic Computation (PASCO), pp. 169-170, 2010

2009 

  1. Daniel McFarlin, Franz Franchetti and Markus Püschel
    Automatic Generation of Vectorized Fast Fourier Transform Libraries for the Larrabee and AVX Instruction Set Extension
    Proc. High Performance Extreme Computing (HPEC), 2009
  2. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    Computer Generation of Fast Fourier Transforms for the Cell Broadband Engine
    Proc. International Conference on Supercomputing (ICS), pp. 26-35, 2009
  3. Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
    Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
    IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
  4. Franz Franchetti and Markus Püschel
    Generating High-Performance Pruned FFT Implementations
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 549-552, 2009
  5. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    High Performance Linear Transform Program Generation for the Cell BE
    Proc. High Performance Extreme Computing (HPEC), 2009
  6. Daniel McFarlin, Franz Franchetti, José M. F. Moura and Markus Püschel
    High Performance Synthetic Aperture Radar Image Formation On Commodity Architectures
    Proc. SPIE Conference on Defense, Security, and Sensing, Proceedings of SPIE, Vol. 7337, pp. 733708, 2009
  7. Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
    Operator Language: A Program Generation Framework for Fast Kernels
    Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009

2008 

  1. Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Automatic Generation of Adaptive Libraries for Matrix-Multiplication
    Parallel Matrix Algorithms and Applications (PMAA), 2008, Presentation (Abstract reviewed)
  2. Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Automatic Generation of Multithreaded Vectorized Adaptive Libraries for Matrix Multiplication
    Proc. International Workshop on Parallel Matrix Algorithms and Applications (PMAA), 2008
  3. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    Automatic Linear Transform Program Generation for the Cell BE
    Proc. Supercomputing (SC), 2008, Poster (Abstract reviewed)
  4. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  5. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    FFT Program Generation for the Cell BE
    Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
  6. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Formal Datapath Representation and Manipulation for Implementing DSP Transforms
    Proc. Design Automation Conference (DAC), pp. 385-390, 2008
  7. Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
    Generating High-Performance General Size Linear Transform Libraries Using Spiral
    Proc. High Performance Extreme Computing (HPEC), 2008
  8. Franz Franchetti and Markus Püschel
    Generating SIMD Vectorized Permutations
    Proc. International Conference on Compiler Construction (CC), Lecture Notes in Computer Science, Springer, Vol. 4959, pp. 116-131, 2008
  9. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    How To Write Fast Numerical Code: A Small Introduction
    Proc. Summer School on Generative and Transformational Techniques in Software Engineering (GTTSE), Lecture Notes in Computer Science, Springer, Vol. 5235, pp. 196-259, 2008
  10. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Linear Transforms: From Math to Efficient Hardware
    Proc. Workshop on High-Level Synthesis colocated with DAC, 2008
  11. Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
    Program Generation with Spiral: Beyond Transforms
    Proc. High Performance Extreme Computing (HPEC), 2008
  12. Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
    System Demonstration of Spiral: Generator for High-Performance Linear Transform Libraries
    Proc. Algebraic Methodology and Software Technology (AMAST), 2008

2007 

  1. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Discrete Fourier Transform Compiler: From Mathematical Representation to Efficient Hardware
    CSSI Technical Report #CSSI-07-01, Carnegie Mellon University, 2007
  2. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Fast Fourier Transform on FPGA: Design Choices and Evaluation
    Proc. International Symposium on Field-Programmable Gate Arrays (FPGA), 2007
  3. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    FFT Compiler: From Math to Efficient Hardware
    Proc. IEEE International High Level Design Validation and Test Workshop (HLDVT), 2007
  4. Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
    Generating FPGA Accelerated DFT Libraries
    Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007
  5. Paolo D'Alberto, Markus Püschel and Franz Franchetti
    Performance/Energy Optimization of DSP Transforms on the XScale Processor
    Proc. International Conference on High Performance Embedded Architectures and Compilers (HiPEAC), Lecture Notes in Computer Science, Springer, Vol. 4367, pp. 201-214, 2007
  6. Franz Franchetti and Markus Püschel
    SIMD Vectorization of Non-Two-Power Sized FFTs
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. II-17-II-20, 2007

2006 

  1. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    A Rewriting System for the Vectorization of Signal Transforms
    Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 4395, pp. 363-377, 2006
  2. Andreas Bonelli, Franz Franchetti, Juergen Lorenz, Markus Püschel and Christoph W. Ueberhuber
    Automatic Performance Optimization of the Discrete Fourier Transform on Distributed Memory Computers
    Proc. International Symposium on Parallel and Distributed Processing and Application (ISPA), Lecture Notes In Computer Science, Springer, Vol. 4330, pp. 818-832, 2006
  3. Paolo D'Alberto, Peter A. Milder, Franz Franchetti, James C. Hoe, Markus Püschel and José M. F. Moura
    Discrete Fourier Transform Compiler for FPGA and CPU/FPGA Partitioned Implementations
    Proc. High Performance Extreme Computing (HPEC), 2006
  4. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    FFT Program Generation for Shared Memory: SMP and Multicore
    Proc. Supercomputing (SC), 2006
  5. F. Gygi, E. W. Draeger, M. Schulz, B. R. de Supinski, J. A. Gunnels, V. Austel, J. C. Sexton, Franz Franchetti, Stefan Kral, Christoph W. Ueberhuber and Juergen Lorenz
    Large-Scale Electronic Structure Calculations of High-Z Metals on the BlueGene/L Platform
    Proc. Supercomputing (SC), 2006
  6. Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
    Parallelism in Spiral
    Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006
  7. Sung-Chul Han, Franz Franchetti and Markus Püschel
    Program Generation for the All-Pairs Shortest Path Problem
    Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 222-232, 2006
  8. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Spiral: Generating Signal Processing Kernels for New Commodity Architectures
    Proc. EDGE Workshop, pp. D49-D50, 2006

2005 

  1. Franz Franchetti, Stefan Kral, Juergen Lorenz and Christoph W. Ueberhuber
    Efficient Utilization of SIMD Extensions
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 409-425, 2005
  2. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Formal Loop Merging for Signal Transforms
    Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326 , 2005
  3. F. Gygi, E. W. Draeger, B. R. de Supinski, R. K. Yates, Franz Franchetti, Stefan Kral, Juergen Lorenz, Christoph W. Ueberhuber, J. A. Gunnels and J. C. Sexton
    Large-Scale First-Principles Molecular Dynamics Simulations on the BlueGene/L Platform Using the Qbox Code
    Proc. Supercomputing (SC), 2005
  4. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Loop Merging for Signal Transforms
    Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326, 2005
  5. Thammanit Pipatsrisawat, Aca Gacic, Franz Franchetti, Markus Püschel and José M. F. Moura
    Performance Analysis of the Filtered Backprojection Image Reconstruction Algorithms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. 153-156, 2005
  6. Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
    SPIRAL: Code Generation for DSP Transforms
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005
  7. Juergen Lorenz, Stefan Kral, Franz Franchetti and Christoph W. Ueberhuber
    Vectorization Techniques for the BlueGene/L Double FPU
    IBM Journal of Research and Development, Vol. 49, No. 2/3, pp. 437-446, 2005

2004 

  1. Franz Franchetti, Stefan Kral, Juergen Lorenz, Markus Püschel, Christoph W. Ueberhuber and Peter Wurzinger
    Automatically Tuned FFTs for BlueGene/L’s Double FPU
    Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 3402, pp. 23-36, 2004

2003 

  1. Franz Franchetti
    A Portable Short Vector Version of FFTW
    Proc. IMACS Symposium on Mathematical Modelling (MATHMOD), Vol. 2, pp. 1539-1548, 2003
  2. T. Fahringer, Franz Franchetti, M. Geissler, G. Madsen, H. Moritsch and R. Prodan
    On Using ZENTURIO for Performance and Parameter Studies on Clusters and Grids
    Proc. Euromicro Conference on Parallel Distributed and Network-based Processing (Euro PDP), pp. 185-192, 2003
  3. Franz Franchetti
    Performance Portable Short Vector Transforms
    PhD. thesis, Vienna University of Technology, 2003
  4. Franz Franchetti and Markus Püschel
    Short Vector Code Generation and Adaptation for DSP Algorithms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 537-540, 2003
  5. Franz Franchetti and Markus Püschel
    Short Vector Code Generation for the Discrete Fourier Transform
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2003
  6. Stefan Kral, Franz Franchetti, Juergen Lorenz and Christoph W. Ueberhuber
    SIMD Vectorization of Straight Line Code
    Proc. Euro-Par Conference on Parallel and Distributed Computing, LNCS 2985, pp. 251-260, 2003
  7. Franz Franchetti
    Top Performance in Signal Processing
    Proc. International Workshop on Numerical and Symbolic Scientific Computing, 2003

2002 

  1. Franz Franchetti and Markus Püschel
    A SIMD Vectorizing Compiler for Digital Signal Processing Algorithms
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 20-26, 2002
  2. Franz Franchetti, F. Kaltenberger and Christoph W. Ueberhuber
    FFT Kernels with FMA Utilization
    Proc. APLIMAT Conference, pp. 333-339, 2002
  3. Franz Franchetti, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
    Short Vector SIMD Code Generation for DSP Algorithms
    Proc. High Performance Extreme Computing (HPEC), 2002

2001 

  1. Franz Franchetti, H. Karner, Stefan Kral and Christoph W. Ueberhuber
    Architecture Independent Short Vector FFTs
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 1109-1112, 2001

2000 

  1. Franz Franchetti
    Short Vector FFTs
    Master thesis, Vienna University of Technology, 2000

Franchetti, Yoko 

2019 

  1. Yoko Franchetti, Thomas Nolin and Franz Franchetti
    Towards Precision Medicine: Simulation Based Parameter Estimation for Drug Metabolism
    SIAM Conference on Computational Science and Engineering (CSE), 2019

Franusich, M. 

2023 

  1. N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
    Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023
  2. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023

2022 

  1. N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
    Towards Full-Stack Acceleration for Fully Homomorphic Encryption
    Proc. High Performance Extreme Computing (HPEC), 2022

2020 

  1. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFT and Solver Libraries for Exascale: FFTX and SpectralPack
    Exascale Computing Project (ECP) Annual Meeting, 2020, Poster

2019 

  1. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFT and Solvers for Exascale: FFTX and SpectralPACK
    Exascale Computing Project (ECP) Annual Meeting, 2019

2018 

  1. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFTX and SpectralPack: A First Look
    Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018

2017 

  1. Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
    High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
    IEEE Control Systems Magazine, 2017

2014 

  1. B. Duff, J. Larkin, M. Franusich and Franz Franchetti
    Automatic Generation of 3-D FFTs
    submitted for publication

French, M. 

2023 

  1. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Cryptology ePrint Archive, 2023
  2. D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring-Processing Unit
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
  3. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023

Fu, S. 

2024 

  1. S. Fu, N. Zhang and Franz Franchetti
    Accelerating High-Precision Number Theoretic Transforms using Intel AVX-512
    Proc. Parallel Architectures and Compilation Techniques (PACT), 2024

Furis, Mihai 

2005 

  1. Mihai Furis, Pawel Hitczenko and Jeremy Johnson
    Cache Miss Analysis of WHT Algorithms
    Proc. International Conference on Analysis of Algorithms, Discrete Mathematics and Theoretical Computer Science, pp. 115-124, 2005

2003 

  1. Mihai Furis
    Cache Miss Analysis of Walsh-Hadamard Transform Algorithms
    Master thesis, Computer Science, Drexel University, 2003

Gacic, Aca 

2005 

  1. Thammanit Pipatsrisawat, Aca Gacic, Franz Franchetti, Markus Püschel and José M. F. Moura
    Performance Analysis of the Filtered Backprojection Image Reconstruction Algorithms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. 153-156, 2005
  2. Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
    SPIRAL: Code Generation for DSP Transforms
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005

2004 

  1. Aca Gacic, Markus Püschel and José M. F. Moura
    Automatically Generated High-Performance Code for Discrete Wavelet Transforms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-69-V-72, 2004
  2. Aca Gacic
    Automatic Implementation and Platform Adaptation of Discrete Filtering and Wavelet Algorithms
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2004

2003 

  1. Aca Gacic, Markus Püschel and José M. F. Moura
    Fast Automatic Implementations of FIR Filters
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 541-544, 2003

Gamil, H. 

2023 

  1. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Cryptology ePrint Archive, 2023
  2. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023

2022 

  1. N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
    Towards Full-Stack Acceleration for Fully Homomorphic Encryption
    Proc. High Performance Extreme Computing (HPEC), 2022

Gao, M. 

2015 

  1. M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
    Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
    Computer, Vol. 48, No. 12, pp. 24-33, 2015

Garg, Kashish 

2017 

  1. H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
    Multirotor UAV State Prediction Through Multi-microphone Side-channel Fusion
    Proc. Multisensor Fusion and Integration for Intelligent Systems (MFI), IEEE, pp. 15-21, 2017
  2. H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
    Prediction of Quadcopter State through Multi-Microphone Side-Channel Fusion
    Technical report UU-CS-2017-001, Dept. of Information and Computing Sciences, Utrecht University, 2017

Garzarán, María J. 

2005 

  1. Xiaoming Li, María J. Garzarán and David Padua
    Optimizing Sorting with Genetic Algorithm
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 99-110, 2005

2004 

  1. Xiaoming Li, María J. Garzarán and David Padua
    A Dynamically Tuned Sorting Library
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 111-124, 2004

Geissler, M. 

2003 

  1. T. Fahringer, Franz Franchetti, M. Geissler, G. Madsen, H. Moritsch and R. Prodan
    On Using ZENTURIO for Performance and Parameter Studies on Clusters and Grids
    Proc. Euromicro Conference on Parallel Distributed and Network-based Processing (Euro PDP), pp. 185-192, 2003

Gilbert, J. 

2016 

  1. J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
    Mathematical Foundations of the GraphBLAS
    Proc. High Performance Extreme Computing (HPEC), 2016

Glick, Madeleine 

2012 

  1. Robert Koutsoyannis, Peter A. Milder, Christian Berger, Madeleine Glick, James C. Hoe and Markus Püschel
    Improving Fixed-Point Accuracy of FFT Cores in O-OFDM Systems
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2012

2011 

  1. Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Design studies for ASIC implementations of 28 GS/s optical QPSK- and 16-QAM-OFDM transceivers
    Optics Express, Vol. 19, No. 21, pp. 20857-20864, 2011

2010 

  1. Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, Christian Berger, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Design Studies for an ASIC Implementation of an Optical OFDM Transceiver
    Proc. European Conference on Optical Communication (ECOC), pp. 1-3, 2010
  2. Yannis Benlachtar, Rachid Bouziane, Robert I. Killey, Christian Berger, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel and Madeleine Glick
    Optical OFDM for the Data Center
    Proc. International Conference on Transparent Optical Networks (ICTON), pp. 1-4, 2010
  3. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Real-Time Digital Signal Processing for the Generation of Optical Orthogonal Frequency Division Multiplexed Signals
    IEEE Journal of Selected Topics in Quantum Electronics, Vol. 16, No. 5, pp. 1235-1244 , 2010

2009 

  1. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    21.4 GS/s Real-Time DSP-Based Optical OFDM Signal Generation and Transmission Over 1600 km of Uncompensated Fibre
    Proc. European Conference on Optical Communication (ECOC), pp. 1-2, 2009
  2. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
    Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009

Gong, Z. 

2022 

  1. Z. Gong, N. Zhu, M. Ngaw, Joao Rivera, L. Tang, E. Tang, H. Mankad and Franz Franchetti
    Interval Arithmetic-based FFT for Large Integer Multiplication
    Proc. High Performance Extreme Computing (HPEC), 2022

Goodson, K. E. 

2015 

  1. M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
    Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
    Computer, Vol. 48, No. 12, pp. 24-33, 2015

Gross, Thomas 

2014 

  1. Benjamin Hess, Thomas Gross and Markus Püschel
    Automatic Locality-Friendly Interface Extension of Numerical Functions
    Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 83-92, 2014

Gui, Liangyan 

2017 

  1. Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
    High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
    IEEE Control Systems Magazine, 2017

Gunnels, J. A. 

2006 

  1. F. Gygi, E. W. Draeger, M. Schulz, B. R. de Supinski, J. A. Gunnels, V. Austel, J. C. Sexton, Franz Franchetti, Stefan Kral, Christoph W. Ueberhuber and Juergen Lorenz
    Large-Scale Electronic Structure Calculations of High-Z Metals on the BlueGene/L Platform
    Proc. Supercomputing (SC), 2006

2005 

  1. F. Gygi, E. W. Draeger, B. R. de Supinski, R. K. Yates, Franz Franchetti, Stefan Kral, Juergen Lorenz, Christoph W. Ueberhuber, J. A. Gunnels and J. C. Sexton
    Large-Scale First-Principles Molecular Dynamics Simulations on the BlueGene/L Platform Using the Qbox Code
    Proc. Supercomputing (SC), 2005

Guo, Qi 

2016 

  1. Qi Guo, Tianshi Chen, Y. Chen and Franz Franchetti
    Accelerating Architectural Simulation Via Statistical Techniques: A Survey
    IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 35, No. 3, pp. 433-446, 2016

2015 

  1. Jiyuan Zhang, Tze-Meng Low, Qi Guo and Franz Franchetti
    A 3D-Stacked Memory Manycore Stencil Accelerator System
    Proc. Workshop on Near Data Processing (WONDP), 2015
  2. Qi Guo, Tze-Meng Low, N. Alachiotis, Berkin Akin, Lawrence Pileggi, James C. Hoe and Franz Franchetti
    Enabling Portable Energy Efficiency with Memory Accelerated Library
    Proc. MICRO, 2015
  3. Tze-Meng Low, Qi Guo and Franz Franchetti
    Optimizing Space Time Adaptive Processing Through Accelerating Memory-Bounded Operations
    Proc. High Performance Extreme Computing (HPEC), 2015

2014 

  1. Qi Guo, N. Alachiotis, Berkin Akin, F. Sadi, G. Xu, Tze-Meng Low, Lawrence Pileggi, James C. Hoe and Franz Franchetti
    3D-Stacked Memory-Side Acceleration: Accelerator and System Design
    Proc. Workshop on Near Data Processing (WONDP), 2014

Gygi, F. 

2006 

  1. F. Gygi, E. W. Draeger, M. Schulz, B. R. de Supinski, J. A. Gunnels, V. Austel, J. C. Sexton, Franz Franchetti, Stefan Kral, Christoph W. Ueberhuber and Juergen Lorenz
    Large-Scale Electronic Structure Calculations of High-Z Metals on the BlueGene/L Platform
    Proc. Supercomputing (SC), 2006

2005 

  1. F. Gygi, E. W. Draeger, B. R. de Supinski, R. K. Yates, Franz Franchetti, Stefan Kral, Juergen Lorenz, Christoph W. Ueberhuber, J. A. Gunnels and J. C. Sexton
    Large-Scale First-Principles Molecular Dynamics Simulations on the BlueGene/L Platform Using the Qbox Code
    Proc. Supercomputing (SC), 2005

Haentjens, Gavin 

2000 

  1. Gavin Haentjens
    An Investigation of Recursive FFT Implementations
    Master thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2000

Han, Sung-Chul 

2007 

  1. Sung-Chul Han
    A Flexible Decoder and Performance Evaluation of Array-Structured LDPC Codes
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2007

2006 

  1. Sung-Chul Han, Franz Franchetti and Markus Püschel
    Program Generation for the All-Pairs Shortest Path Problem
    Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 222-232, 2006

Harisrikanth, K. 

2024 

  1. L. Tang, S. Chen, K. Harisrikanth, G. Xu, Franz Franchetti and K. Mai
    A 1.19GHz 9.52Gsamples/sec Radix-8 FFT Hardware Accelerator in 28nm
    Proc. Hot Chips Symposium (HCS), 2024

2022 

  1. L. Tang, S. Chen, K. Harisrikanth, G. Xu, K. Mai and Franz Franchetti
    A High Throughput Hardware Accelerator for FFTW Codelets: A First Look
    Proc. High Performance Extreme Computing (HPEC), 2022

Henretty, Tom 

2013 

  1. Tom Henretty, Richard Veras, Franz Franchetti, Louis-Noël Pouchet, J. Ramanujam and P. Sadayappan
    A Stencil Compiler for Short-Vector SIMD Architectures
    Proc. ACM International Conference on Supercomputing , pp. 13-24, 2013

2011 

  1. Tom Henretty, Kevin Stock, Louis-Noël Pouchet, Franz Franchetti, J. Ramanujam and P. Sadayappan
    Data Layout Transformation for Stencil Computations on Short SIMD Architectures
    Proc. International Conference on Compiler Construction (CC), 2011

Hess, Benjamin 

2014 

  1. Benjamin Hess, Thomas Gross and Markus Püschel
    Automatic Locality-Friendly Interface Extension of Numerical Functions
    Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 83-92, 2014

2013 

  1. Benjamin Hess
    Automatic Refactoring: Locality Friendly Interface Enhancements for Numerical Functions
    Master thesis, Computer Science, ETH Zurich, Switzerland, 2013

Hess, Benjamin 

2014 

  1. Benjamin Hess, Thomas Gross and Markus Püschel
    Automatic Locality-Friendly Interface Extension of Numerical Functions
    Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 83-92, 2014

2013 

  1. Benjamin Hess
    Automatic Refactoring: Locality Friendly Interface Enhancements for Numerical Functions
    Master thesis, Computer Science, ETH Zurich, Switzerland, 2013

Heyman, B. 

2023 

  1. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Cryptology ePrint Archive, 2023
  2. D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring-Processing Unit
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023

Hills, G. 

2015 

  1. M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
    Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
    Computer, Vol. 48, No. 12, pp. 24-33, 2015

Hitczenko, Pawel 

2006 

  1. Pawel Hitczenko, Jeremy Johnson and Hung-Jen Huang
    Distribution of a Class of Divide and Conquer Recurrences arising from the Computation of the Walsh-Hadamard Transform
    Theoretical Computer Science, Vol. 352, pp. 8-30, 2006

2005 

  1. Mihai Furis, Pawel Hitczenko and Jeremy Johnson
    Cache Miss Analysis of WHT Algorithms
    Proc. International Conference on Analysis of Algorithms, Discrete Mathematics and Theoretical Computer Science, pp. 115-124, 2005

Hoe, James C. 

2024 

  1. E. Tang, James C. Hoe and Franz Franchetti
    Magic Memory: A Memory-Centric Declarative Programming Paradigm to Enable High Productivity on Heterogeneous Systems
    Proc. SRC TECHCON, 2024

2023 

  1. G. Xu, Franz Franchetti and James C. Hoe
    An Approach to Generating Customized Load-Store Architectures
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2023

2019 

  1. F. Sadi, Joe Sweeney, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    Efficient SpMV Operation for Large and Highly Sparse Matrices Using Scalable Multi-way Merge Parallelization
    Proc. MICRO, 2019

2018 

  1. F. Sadi, Joe Sweeney, S. McMillan, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    PageRank Acceleration for Large Graphs with Scalable Hardware and Two-Step SpMV
    Proc. High Performance Extreme Computing (HPEC), 2018
  2. Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
    SPIRAL: Extreme Performance Portability
    Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018

2017 

  1. G. Xu, Tze-Meng Low, James C. Hoe and Franz Franchetti
    Optimizing FFT Resource Efficiency of FPGA using High-Level Synthesis
    High Performance Extreme Computing Conference (HPEC), 2017

2016 

  1. Berkin Akin, Franz Franchetti and James C. Hoe
    HAMLeT Architecture for Parallel Data Reorganization in Memory
    IEEE Micro, Vol. 36, No. 1, pp. 14-23, 2016

2015 

  1. Berkin Akin, Franz Franchetti and James C. Hoe
    Data Reorganization in Memory Using 3D-stacked DRAM
    Proc. International Symposium on Computer Architectur (ISCA), 2015
  2. Qi Guo, Tze-Meng Low, N. Alachiotis, Berkin Akin, Lawrence Pileggi, James C. Hoe and Franz Franchetti
    Enabling Portable Energy Efficiency with Memory Accelerated Library
    Proc. MICRO, 2015
  3. Berkin Akin, Franz Franchetti and James C. Hoe
    FFTs with Near-Optimal Memory Access Through Block Data Layouts: Algorithm, Architecture and Design Automation
    Journal of Signal Processing Systems, 2015

2014 

  1. Qi Guo, N. Alachiotis, Berkin Akin, F. Sadi, G. Xu, Tze-Meng Low, Lawrence Pileggi, James C. Hoe and Franz Franchetti
    3D-Stacked Memory-Side Acceleration: Accelerator and System Design
    Proc. Workshop on Near Data Processing (WONDP), 2014
  2. F. Sadi, Berkin Akin, Thom Popovici, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    Algorithm/Hardware Co-optimized SAR Image Reconstruction with 3D-stacked Logic in Memory
    Proc. IEEE High Performance Extreme Computing (HPEC), 2014
  3. Berkin Akin, Franz Franchetti and James C. Hoe
    FFTs with Near-Optimal Memory Access Through Block Data Layouts
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014
  4. Berkin Akin, James C. Hoe and Franz Franchetti
    HAMLeT: Hardware Accelerated Memory Layout Transform within 3D-stacked DRAM
    Proc. IEEE High Performance Extreme Computing (HPEC), 2014
  5. Berkin Akin, Franz Franchetti and James C. Hoe
    Understanding the Design Space of DRAM-optimized Hardware FFT Accelerators
    Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 248-255, 2014

2013 

  1. Qiuling Zhu, Berkin Akin, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    A 3D-Stacked Logic-in-Memory Accelerator for Application-Specific Data Intensive Computing
    Proc. IEEE International 3D Systems Integration Conference (3DIC), pp. 1-7, 2013
  2. Qiuling Zhu, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    Accelerating Sparse Matrix-Matrix Multiplication with 3D-Stacked Logic-in-Memory Hardware
    Proc. High Performance Extreme Computing (HPEC), pp. 1-6, 2013

2012 

  1. Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
    Algorithm and Architecture Optimization for Large Size Two Dimensional Discrete Fourier Transform
    ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA), 2012
  2. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Computer Generation of Hardware for Linear Digital Signal Processing Transforms
    ACM Transactions on Design Automation of Electronic Systems, Vol. 17, No. 2, 2012
  3. W. Yu, Franz Franchetti, James C. Hoe and Tsuhan Chen
    Highly Efficient Performance Portable Tracking of Evolving Surfaces
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 296-307, 2012
  4. Robert Koutsoyannis, Peter A. Milder, Christian Berger, Madeleine Glick, James C. Hoe and Markus Püschel
    Improving Fixed-Point Accuracy of FFT Cores in O-OFDM Systems
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2012
  5. Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
    Memory Bandwidth Efficient Two-Dimensional Fast Fourier Transform Algorithm and Implementation for Large Problem Sizes
    Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), 2012

2011 

  1. Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Design studies for ASIC implementations of 28 GS/s optical QPSK- and 16-QAM-OFDM transceivers
    Optics Express, Vol. 19, No. 21, pp. 20857-20864, 2011
  2. W. Yu, Franz Franchetti, James C. Hoe, José M. F. Moura and Tsuhan Chen
    Performance Portable Tracking of Evolving Surfaces
    Proc. High Performance Extreme Computing (HPEC), 2011

2010 

  1. Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, Christian Berger, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Design Studies for an ASIC Implementation of an Optical OFDM Transceiver
    Proc. European Conference on Optical Communication (ECOC), pp. 1-3, 2010
  2. W. Yu, Franz Franchetti, James C. Hoe and Tsuhan Chen
    Fast and Robust Active Contours for Image Segmentation
    Proc. IEEE International Conference on Image Processing (ICIP), 2010
  3. W. Yu, Franz Franchetti, James C. Hoe, Y.-J. Chang and Tsuhan Chen
    Fast Bilateral Filtering By Adapting Block Size
    Proc. IEEE International Conference on Image Processing (ICIP), 2010
  4. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Hardware Implementation of the Discrete Fourier Transform with Non-Power-of-Two Problem Size
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2010
  5. W. Yu, Tsuhan Chen, Franz Franchetti and James C. Hoe
    High Performance Stereo Vision Designed for Massively Data Parallel Platforms
    IEEE Transactions on Circuits and Systems for Video Technology (T-CSVT), Vol. 20, No. 11, pp. 1509-1519, 2010
  6. Yannis Benlachtar, Rachid Bouziane, Robert I. Killey, Christian Berger, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel and Madeleine Glick
    Optical OFDM for the Data Center
    Proc. International Conference on Transparent Optical Networks (ICTON), pp. 1-4, 2010
  7. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Real-Time Digital Signal Processing for the Generation of Optical Orthogonal Frequency Division Multiplexed Signals
    IEEE Journal of Selected Topics in Quantum Electronics, Vol. 16, No. 5, pp. 1235-1244 , 2010

2009 

  1. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    21.4 GS/s Real-Time DSP-Based Optical OFDM Signal Generation and Transmission Over 1600 km of Uncompensated Fibre
    Proc. European Conference on Optical Communication (ECOC), pp. 1-2, 2009
  2. Peter A. Milder, James C. Hoe and Markus Püschel
    Automatic Generation of Streaming Datapaths for Arbitrary Fixed Permutations
    Proc. Design, Automation and Test in Europe (DATE), pp. 1118-1123, 2009
  3. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
    Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009
  4. Markus Püschel, Peter A. Milder and James C. Hoe
    Permuting Streaming Data Using RAMs
    Journal of the ACM, Vol. 56, No. 2, pp. 10:1-10:34, 2009

2008 

  1. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  2. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Formal Datapath Representation and Manipulation for Implementing DSP Transforms
    Proc. Design Automation Conference (DAC), pp. 385-390, 2008
  3. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Linear Transforms: From Math to Efficient Hardware
    Proc. Workshop on High-Level Synthesis colocated with DAC, 2008
  4. Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
    Program Generation with Spiral: Beyond Transforms
    Proc. High Performance Extreme Computing (HPEC), 2008

2007 

  1. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Discrete Fourier Transform Compiler: From Mathematical Representation to Efficient Hardware
    CSSI Technical Report #CSSI-07-01, Carnegie Mellon University, 2007
  2. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Fast Fourier Transform on FPGA: Design Choices and Evaluation
    Proc. International Symposium on Field-Programmable Gate Arrays (FPGA), 2007
  3. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    FFT Compiler: From Math to Efficient Hardware
    Proc. IEEE International High Level Design Validation and Test Workshop (HLDVT), 2007
  4. Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
    Generating FPGA Accelerated DFT Libraries
    Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007
  5. Peter Tummeltshammer, James C. Hoe and Markus Püschel
    Time-Multiplexed Multiple Constant Multiplication
    IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 26, No. 9, pp. 1551-1563, 2007

2006 

  1. Paolo D'Alberto, Peter A. Milder, Franz Franchetti, James C. Hoe, Markus Püschel and José M. F. Moura
    Discrete Fourier Transform Compiler for FPGA and CPU/FPGA Partitioned Implementations
    Proc. High Performance Extreme Computing (HPEC), 2006
  2. Peter A. Milder, Mohammad Ahmad, James C. Hoe and Markus Püschel
    Fast and Accurate Resource Estimation of Automatically Generated Custom DFT IP Cores
    Proc. FPGA, pp. 211-220, 2006
  3. Marek Telgarsky, James C. Hoe and José M. F. Moura
    Spiral: Joint Runtime and Energy Optimization of Linear Transforms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 3, 2006

2005 

  1. Roland Wunderlich, Markus Püschel and James C. Hoe
    Accelerating Blocked Matrix-Matrix Multiplication using a Software-Managed Memory Hierarchy with DMA
    Proc. High Performance Extreme Computing (HPEC), 2005
  2. Grace Nordin, Peter A. Milder, James C. Hoe and Markus Püschel
    Automatic Generation of Customized Discrete Fourier Transform IPs
    Proc. Design Automation Conference (DAC), pp. 471-474, 2005

2004 

  1. Adam C. Zelinski, Markus Püschel, Smarahara Misra and James C. Hoe
    Automatic Cost Minimization for Multiplierless Implementations of Discrete Signal Transforms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-221-V-224, 2004
  2. Markus Püschel, Adam C. Zelinski and James C. Hoe
    Custom-Optimized Multiplierless Implementations of DSP Algorithms
    Proc. International Conference on Computer-Aided Design (ICCAD), pp. 175-182, 2004
  3. Grace Nordin, James C. Hoe and Markus Püschel
    Discrete Fourier Transform IP Generator
    Proc. High Performance Extreme Computing (HPEC), 2004
  4. Peter Tummeltshammer, James C. Hoe and Markus Püschel
    Multiple Constant Multiplication By Time-Multiplexed Mapping of Addition Chains
    Proc. Design Automation Conference (DAC), pp. 826-829, 2004

2003 

  1. Smarahara Misra, Adam C. Zelinski, James C. Hoe and Markus Püschel
    Custom Reduction of Arithmetic in Linear DSP Transforms
    Proc. High Performance Extreme Computing (HPEC), 2003

2002 

  1. Fang Fang, James C. Hoe, Markus Püschel and Smarahara Misra
    Generation of Custom DSP Transform IP Cores: Case Study Walsh-Hadamard Transform
    Proc. High Performance Extreme Computing (HPEC), 2002

Hong, Inpyo 

2004 

  1. Lawrence C. Chang, Inpyo Hong, Yevgen Voronenko and Markus Püschel
    Adaptive Mapping of Linear DSP Algorithms to Fixed-Point Arithmetic
    Proc. High Performance Extreme Computing (HPEC), 2004

Horowitz, M. 

2012 

  1. Qiuling Zhu, K. Vaidyanathan, O. Shacham, M. Horowitz, Lawrence Pileggi and Franz Franchetti
    Design Automation Framwork for Application-Specific Logic-in-Memory Blocks
    Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 125-132, 2012

Huang, Hung-Jen 

2006 

  1. Pawel Hitczenko, Jeremy Johnson and Hung-Jen Huang
    Distribution of a Class of Divide and Conquer Recurrences arising from the Computation of the Walsh-Hadamard Transform
    Theoretical Computer Science, Vol. 352, pp. 8-30, 2006

2002 

  1. Hung-Jen Huang
    Performance Analysis of an Adaptive Algorithm for the Walsh-Hadamard Transform
    Master thesis, Computer Science, Drexel University, 2002

Hug, Gabriela 

2014 

  1. Tao Cui, R. Yang, Gabriela Hug and Franz Franchetti
    Accelerated AC Contingency Calculation on Commodity Multi-core SIMD CPUs
    Proc. IEEE Power and Energy Society General Meeting (PES-GM), 2014

Hutchison, D. 

2016 

  1. J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
    Mathematical Foundations of the GraphBLAS
    Proc. High Performance Extreme Computing (HPEC), 2016

Ilic, Marija 

2013 

  1. Q. Li, Tao Cui, Yang Weng, Rohit Negi, Franz Franchetti and Marija Ilic
    An Information-Theoretic Approach to PMU Placement in Electric Power Systems
    IEEE Transactions on Smart Grid, Vol. 4, No. 1, pp. 446-456, 2013

2010 

  1. Q. Li, Tao Cui, Rohit Negi, Franz Franchetti and Marija Ilic
    On-line Decentralized Charging of Plug-In Electric Vehicles in Power Systems
    , 2010

Jacob, A. 

2023 

  1. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023

Jaiswal, A. 

2023 

  1. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023

Johnson, Jeremy 

2023 

  1. N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
    Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023
  2. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023
  3. P. Brinich, N. Zhang, A. Ebel, Franz Franchetti and Jeremy Johnson
    Twiddle Factor Generation for a Vectorized Number Theoretic Transform
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023

2022 

  1. N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
    Towards Full-Stack Acceleration for Fully Homomorphic Encryption
    Proc. High Performance Extreme Computing (HPEC), 2022

2018 

  1. Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
    SPIRAL: Extreme Performance Portability
    Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018

2017 

  1. Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
    High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
    IEEE Control Systems Magazine, 2017

2014 

  1. Franz Franchetti, Aliaksei Sandryhaila and Jeremy Johnson
    High Assurance SPIRAL
    Proc. SPIE, Proceedings of SPIE 2014, 2014
  2. Lingchuan Meng and Jeremy Johnson
    Towards parallel general-size library generation for polynomial multiplication
    ACM Communications in Computer Algebra, 2014

2013 

  1. Lingchuan Meng and Jeremy Johnson
    Automatic Parallel Library Generation for General-Size Modular FFT Algorithms
    Proc. International Workshop on Computer Algebra in Scientific Computing, Springer, pp. 243-256, 2013

2010 

  1. Lingchuan Meng, Jeremy Johnson, Franz Franchetti, Yevgen Voronenko, Marc Moreno Maza and Yuzhen Xie
    Spiral-Generated Modular FFT Algorithms
    Proc. Parallel Symbolic Computation (PASCO), pp. 169-170, 2010

2008 

  1. Jeremy Johnson, Tim Chagnon, Petya Vachranukunkiet, Prawat Nagvajara and Chika Nwankpa
    Sparse LU Decomposition using FPGA
    Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
  2. Jeremy Johnson and Michael Andrews
    Statistical Evaluation of a Self-Tuning Vectorized Library for the Walsh-Hadamard Transform
    Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008

2007 

  1. Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
    Generating FPGA Accelerated DFT Libraries
    Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007
  2. Jeremy Johnson and Xu Xu
    Generating Symmetric DFTs and Equivariant FFT Algorithms
    Proc. ACM International Symposium on Symbolic and Algebraic Computation (ISSAC), ACM, pp. 195-202, 2007
  3. Jeremy Johnson and Michael Andrews
    Performance Analysis of a Family of WHT Algorithms
    Proc. Workshop on Performance Optimization for High-Level Languages and Libraries (POHLL), 2007 IEEE International Parallel and Distributed Processing Symposium , IEEE, pp. 450, 2007

2006 

  1. Pawel Hitczenko, Jeremy Johnson and Hung-Jen Huang
    Distribution of a Class of Divide and Conquer Recurrences arising from the Computation of the Walsh-Hadamard Transform
    Theoretical Computer Science, Vol. 352, pp. 8-30, 2006

2005 

  1. Mihai Furis, Pawel Hitczenko and Jeremy Johnson
    Cache Miss Analysis of WHT Algorithms
    Proc. International Conference on Analysis of Algorithms, Discrete Mathematics and Theoretical Computer Science, pp. 115-124, 2005
  2. Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
    SPIRAL: Code Generation for DSP Transforms
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005

2004 

  1. Jeremy Johnson and Kang Chen
    A Self-Adapting Distributed Memory Package for Fast Signal Transforms
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 44-, 2004
  2. Anthony F. Breitzman and Jeremy Johnson
    Automatic Derivation and Implementation of Fast Convolution Algorithms
    Journal of High Performance Computing and Applications, special issue on ``Computer Algebra and Signal Processing'', Vol. 37, No. 2, pp. 157-186, 2004
  3. Markus Püschel, Bryan Singer, Jianxin Xiong, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso and Robert W. Johnson
    SPIRAL: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms
    Journal of High Performance Computing and Applications, special issue on ``Automatic Performance Tuning'', Vol. 18, No. 1, pp. 21-45, 2004

2003 

  1. Jeremy Johnson and Xu Xu
    A Recursive Implementation of the Dimensionless FFT
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2003

2002 

  1. Kang Chen and Jeremy Johnson
    A Prototypical Self-Optimizing Package for Parallel Implementation of Fast Signal Transforms
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 58-63, 2002

2001 

  1. Sebastian Egner, Jeremy Johnson, David Padua, Jianxin Xiong and Markus Püschel
    Automatic Derivation and Implementation of Signal Processing Algorithms
    ACM SIGSAM Bulletin Communications in Computer Algebra, Vol. 35, No. 2, pp. 1-19, 2001
  2. José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel, Bryan Singer, Manuela Veloso and Jianxin Xiong
    Generating Platform-Adapted DSP Libraries using SPIRAL
    Proc. High Performance Extreme Computing (HPEC), 2001
  3. Jianxin Xiong, Jeremy Johnson, Robert W. Johnson and David Padua
    SPL: A Language and Compiler for DSP Algorithms
    Proc. Programming Languages Design and Implementation (PLDI), pp. 298-308, 2001

2000 

  1. Pinit Kumhom, Jeremy Johnson and Prawat Nagvajara
    Design, optimization, and implementation of a universal FFT processor
    Proc. IEEE ASIC/SOC Conference, IEEE, pp. 182-186, 2000
  2. Jeremy Johnson and Markus Püschel
    In Search of the Optimal Walsh-Hadamard Transform
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 6, pp. 3347-3350, 2000
  3. José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel and Manuela Veloso
    SPIRAL: Automatic Implementation of Signal Processing Algorithms
    Proc. High Performance Extreme Computing (HPEC), 2000

Johnson, Robert W. 

2005 

  1. Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
    SPIRAL: Code Generation for DSP Transforms
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005

2004 

  1. Markus Püschel, Bryan Singer, Jianxin Xiong, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso and Robert W. Johnson
    SPIRAL: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms
    Journal of High Performance Computing and Applications, special issue on ``Automatic Performance Tuning'', Vol. 18, No. 1, pp. 21-45, 2004

2001 

  1. José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel, Bryan Singer, Manuela Veloso and Jianxin Xiong
    Generating Platform-Adapted DSP Libraries using SPIRAL
    Proc. High Performance Extreme Computing (HPEC), 2001
  2. Jianxin Xiong, Jeremy Johnson, Robert W. Johnson and David Padua
    SPL: A Language and Compiler for DSP Algorithms
    Proc. Programming Languages Design and Implementation (PLDI), pp. 298-308, 2001

2000 

  1. José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel and Manuela Veloso
    SPIRAL: Automatic Implementation of Signal Processing Algorithms
    Proc. High Performance Extreme Computing (HPEC), 2000

Jones, Douglas F. 

2010 

  1. Douglas F. Jones
    Data Pump Architecture Simulator and Performance Model
    Master thesis, Computer Science, Drexel University, 2010

Kaltenberger, F. 

2002 

  1. Franz Franchetti, F. Kaltenberger and Christoph W. Ueberhuber
    FFT Kernels with FMA Utilization
    Proc. APLIMAT Conference, pp. 333-339, 2002

Kang, Dongsoo 

2000 

  1. Neungsoo Park, Viktor K. Prasanna, Kiran Bondalapati and Dongsoo Kang
    Dynamic Data Layouts for Cache-conscious Factorization of DFT
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 693-701, 2000

Kar, Soummya 

2017 

  1. Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
    High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
    IEEE Control Systems Magazine, 2017

Karner, H. 

2001 

  1. Franz Franchetti, H. Karner, Stefan Kral and Christoph W. Ueberhuber
    Architecture Independent Short Vector FFTs
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 1109-1112, 2001

Katagiri, T. 

2021 

  1. N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
    Adaptation of A64 Scalable Vector Extension for Spiral
    Proc. High Performance Computing Conference (HPCC), Japan, 2021
  2. N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
    An Auto-tuning with Adaptation of A64 Scalable Vector Extension for SPIRAL
    Proc. International Workshop on Automatic Performance Tuning (iWAPT), 2021

Kelly, P. H. J. 

2017 

  1. Matthias Bolten, Franz Franchetti, P. H. J. Kelly, Christian Lengauer and Marcus Mohr
    Algebraic Description and Automatic Generation of Multigrid Methods in SPIRAL
    Concurrency and Computation: Practice and Experience, 2017

2015 

  1. Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
    Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
    Proc. Workshop on Compilers for Parallel Computing (CPC), 2015
  2. Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
    Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2015

Kepner, J. 

2016 

  1. J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
    Mathematical Foundations of the GraphBLAS
    Proc. High Performance Extreme Computing (HPEC), 2016

Killey, Robert I. 

2011 

  1. Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Design studies for ASIC implementations of 28 GS/s optical QPSK- and 16-QAM-OFDM transceivers
    Optics Express, Vol. 19, No. 21, pp. 20857-20864, 2011

2010 

  1. Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, Christian Berger, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Design Studies for an ASIC Implementation of an Optical OFDM Transceiver
    Proc. European Conference on Optical Communication (ECOC), pp. 1-3, 2010
  2. Yannis Benlachtar, Rachid Bouziane, Robert I. Killey, Christian Berger, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel and Madeleine Glick
    Optical OFDM for the Data Center
    Proc. International Conference on Transparent Optical Networks (ICTON), pp. 1-4, 2010
  3. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Real-Time Digital Signal Processing for the Generation of Optical Orthogonal Frequency Division Multiplexed Signals
    IEEE Journal of Selected Topics in Quantum Electronics, Vol. 16, No. 5, pp. 1235-1244 , 2010

2009 

  1. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    21.4 GS/s Real-Time DSP-Based Optical OFDM Signal Generation and Transmission Over 1600 km of Uncompensated Fibre
    Proc. European Conference on Optical Communication (ECOC), pp. 1-2, 2009
  2. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
    Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009

Kim, Kyungjoo 

2019 

  1. Mark Blanco, Tze-Meng Low and Kyungjoo Kim
    Exploration of Fine-Grained Parallelism for Load Balancing Eager K-truss on GPU and CPU
    Proc. High Performance Extreme Computing (HPEC), 2019

Kim, Munsung (Bill) 

2017 

  1. H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
    Multirotor UAV State Prediction Through Multi-microphone Side-channel Fusion
    Proc. Multisensor Fusion and Integration for Intelligent Systems (MFI), IEEE, pp. 15-21, 2017
  2. H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
    Prediction of Quadcopter State through Multi-Microphone Side-Channel Fusion
    Technical report UU-CS-2017-001, Dept. of Information and Computing Sciences, Utrecht University, 2017

Kitai, N. 

2021 

  1. N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
    Adaptation of A64 Scalable Vector Extension for Spiral
    Proc. High Performance Computing Conference (HPCC), Japan, 2021
  2. N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
    An Auto-tuning with Adaptation of A64 Scalable Vector Extension for SPIRAL
    Proc. International Workshop on Automatic Performance Tuning (iWAPT), 2021

Kong, Martin 

2013 

  1. Martin Kong, Richard Veras, Kevin Stock, Franz Franchetti, Louis-Noël Pouchet and P. Sadayappan
    When Polyhedral Transformations Meet SIMD Code Generation
    Proc. Programming Languages Design and Implementation (PLDI), Vol. 48, pp. 127-138, 2013

Koops, H. V. 

2017 

  1. H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
    Multirotor UAV State Prediction Through Multi-microphone Side-channel Fusion
    Proc. Multisensor Fusion and Integration for Intelligent Systems (MFI), IEEE, pp. 15-21, 2017
  2. H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
    Prediction of Quadcopter State through Multi-Microphone Side-Channel Fusion
    Technical report UU-CS-2017-001, Dept. of Information and Computing Sciences, Utrecht University, 2017

2015 

  1. H. V. Koops and Franz Franchetti
    An Ensemble Technique for Estimating Vehicle Speed and Gear Position from Acoustic Data
    Proc. International Conference on Digital Signal Processing (DSP), 2015

Koutsoyannis, Robert 

2012 

  1. Robert Koutsoyannis, Peter A. Milder, Christian Berger, Madeleine Glick, James C. Hoe and Markus Püschel
    Improving Fixed-Point Accuracy of FFT Cores in O-OFDM Systems
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2012

2011 

  1. Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Design studies for ASIC implementations of 28 GS/s optical QPSK- and 16-QAM-OFDM transceivers
    Optics Express, Vol. 19, No. 21, pp. 20857-20864, 2011

2010 

  1. Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, Christian Berger, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Design Studies for an ASIC Implementation of an Optical OFDM Transceiver
    Proc. European Conference on Optical Communication (ECOC), pp. 1-3, 2010
  2. Yannis Benlachtar, Rachid Bouziane, Robert I. Killey, Christian Berger, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel and Madeleine Glick
    Optical OFDM for the Data Center
    Proc. International Conference on Transparent Optical Networks (ICTON), pp. 1-4, 2010
  3. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Real-Time Digital Signal Processing for the Generation of Optical Orthogonal Frequency Division Multiplexed Signals
    IEEE Journal of Selected Topics in Quantum Electronics, Vol. 16, No. 5, pp. 1235-1244 , 2010

2009 

  1. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    21.4 GS/s Real-Time DSP-Based Optical OFDM Signal Generation and Transmission Over 1600 km of Uncompensated Fibre
    Proc. European Conference on Optical Communication (ECOC), pp. 1-2, 2009
  2. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
    Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009

Kovacevic, Jelena 

2022 

  1. Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
    A Framework for Low Communication Approaches for Large Scale 3D Convolution
    Proc. International Conference on Parallel Processing (ICPP), 2022

2020 

  1. Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
    Massive Scaling of MASSIF: Algorithm Development and Analysis for Simulation on GPUs
    Proc. Platform for Advanced Scientific Computing (PASC), Article 13, pp. 1 - 10, 2020

2019 

  1. Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
    Algorithm Design at Scale: Porting Parallel FFT-based Fortran Simulations to GPUs
    SIAM Conference on Computational Science and Engineering (CSE), 2019

2018 

  1. Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
    Algorithm Design for Large Scale FFT-Based Simulations on CPU-GPU Platforms
    , 2018
  2. Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
    Algorithm Design for Large Scale Parallel FFT-Based Simulations on Heterogeneous Platforms
    Proc. High Performance Extreme Computing (HPEC), 2018

Kozyrakis, C. 

2015 

  1. M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
    Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
    Computer, Vol. 48, No. 12, pp. 24-33, 2015

Kral, Stefan 

2006 

  1. F. Gygi, E. W. Draeger, M. Schulz, B. R. de Supinski, J. A. Gunnels, V. Austel, J. C. Sexton, Franz Franchetti, Stefan Kral, Christoph W. Ueberhuber and Juergen Lorenz
    Large-Scale Electronic Structure Calculations of High-Z Metals on the BlueGene/L Platform
    Proc. Supercomputing (SC), 2006

2005 

  1. Franz Franchetti, Stefan Kral, Juergen Lorenz and Christoph W. Ueberhuber
    Efficient Utilization of SIMD Extensions
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 409-425, 2005
  2. F. Gygi, E. W. Draeger, B. R. de Supinski, R. K. Yates, Franz Franchetti, Stefan Kral, Juergen Lorenz, Christoph W. Ueberhuber, J. A. Gunnels and J. C. Sexton
    Large-Scale First-Principles Molecular Dynamics Simulations on the BlueGene/L Platform Using the Qbox Code
    Proc. Supercomputing (SC), 2005
  3. Juergen Lorenz, Stefan Kral, Franz Franchetti and Christoph W. Ueberhuber
    Vectorization Techniques for the BlueGene/L Double FPU
    IBM Journal of Research and Development, Vol. 49, No. 2/3, pp. 437-446, 2005

2004 

  1. Franz Franchetti, Stefan Kral, Juergen Lorenz, Markus Püschel, Christoph W. Ueberhuber and Peter Wurzinger
    Automatically Tuned FFTs for BlueGene/L’s Double FPU
    Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 3402, pp. 23-36, 2004

2003 

  1. Stefan Kral, Franz Franchetti, Juergen Lorenz and Christoph W. Ueberhuber
    SIMD Vectorization of Straight Line Code
    Proc. Euro-Par Conference on Parallel and Distributed Computing, LNCS 2985, pp. 251-260, 2003

2001 

  1. Franz Franchetti, H. Karner, Stefan Kral and Christoph W. Ueberhuber
    Architecture Independent Short Vector FFTs
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 1109-1112, 2001

Krause, Andreas 

2016 

  1. Marcela Zuluaga, Andreas Krause and Markus Püschel
    e-PAL: An Active Learning Approach to the Multi-Objective Optimization Problem
    Journal of Machine Learning Research, Vol. 17, No. 104, pp. 1-32, 2016

2013 

  1. Marcela Zuluaga, Andreas Krause, Guillaume Sergent and Markus Püschel
    Active Learning for Multi-Objective Optimization
    Proc. International Conference on Machine Learning (ICML), pp. 462-470, 2013
  2. Marcela Zuluaga, Andreas Krause and Markus Püschel
    Multi-Objective Optimization for High-Level Synthesis
    Proc. Workshop on High-Level Synthesis for High Performance Computing (HLS4HPC), 2013

2012 

  1. Marcela Zuluaga, Andreas Krause, Peter A. Milder and Markus Püschel
    "Smart" Design Space Sampling to Predict Pareto-Optimal Solutions
    Proc. Languages, Compilers, Tools and Theory for Embedded Systems (LCTES), pp. 119-128 , 2012

Kulkarni, Anuva 

2022 

  1. Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
    A Framework for Low Communication Approaches for Large Scale 3D Convolution
    Proc. International Conference on Parallel Processing (ICPP), 2022

2020 

  1. Anuva Kulkarni
    An Approach for Large-Scale Three-Dimensional FFT-Based Approximate Convolutions on GPUs
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2020
  2. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFT and Solver Libraries for Exascale: FFTX and SpectralPack
    Exascale Computing Project (ECP) Annual Meeting, 2020, Poster
  3. Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
    Massive Scaling of MASSIF: Algorithm Development and Analysis for Simulation on GPUs
    Proc. Platform for Advanced Scientific Computing (PASC), Article 13, pp. 1 - 10, 2020

2019 

  1. Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
    Algorithm Design at Scale: Porting Parallel FFT-based Fortran Simulations to GPUs
    SIAM Conference on Computational Science and Engineering (CSE), 2019
  2. Anuva Kulkarni, Daniele G. Spampinato and Franz Franchetti
    Design and Specification of Large-scale Simulations for GPUs using FFTX
    Supercomputing, 2019
  3. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFT and Solvers for Exascale: FFTX and SpectralPACK
    Exascale Computing Project (ECP) Annual Meeting, 2019
  4. Anuva Kulkarni, Daniele G. Spampinato and Franz Franchetti
    FFTX for Micromechanical Stress-Strain Analysis
    IEEE High Performance Extreme Computing Conference (HPEC), 2019

2018 

  1. Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
    Algorithm Design for Large Scale FFT-Based Simulations on CPU-GPU Platforms
    , 2018
  2. Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
    Algorithm Design for Large Scale Parallel FFT-Based Simulations on Heterogeneous Platforms
    Proc. High Performance Extreme Computing (HPEC), 2018
  3. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFTX and SpectralPack: A First Look
    Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018

Kumar, M. 

2016 

  1. J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
    Mathematical Foundations of the GraphBLAS
    Proc. High Performance Extreme Computing (HPEC), 2016

Kumhom, Pinit 

2001 

  1. Pinit Kumhom
    Design, Optimization, and Implementation of a Universal FFT Processor
    PhD. thesis, Electrical and Computer Engineering, Drexel University, 2001, Also Tech. Report DU-MCS-01-01, Drexel University, 2001

2000 

  1. Pinit Kumhom, Jeremy Johnson and Prawat Nagvajara
    Design, optimization, and implementation of a universal FFT processor
    Proc. IEEE ASIC/SOC Conference, IEEE, pp. 182-186, 2000

Kutuluru, A. 

2020 

  1. Sanil Rao, A. Kutuluru, Paul Brouwer, S. McMillan and Franz Franchetti
    GBTLX: A First Look
    Proc. IEEE High Performance Extreme Computing (HPEC), 2020

2018 

  1. Tze-Meng Low, Daniele G. Spampinato, A. Kutuluru, U. Sridhar, Thom Popovici, Franz Franchetti and S. McMillan
    Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency Matrices
    Proc. High Performance Extreme Computing (HPEC), 2018

Kyrtatas, Nikolaos 

2015 

  1. Nikolaos Kyrtatas, Daniele G. Spampinato and Markus Püschel
    A Basic Linear Algebra Compiler for Embedded Processors
    Proc. Design, Automation and Test in Europe (DATE), pp. 1054-1059, 2015

2014 

  1. Nikolaos Kyrtatas
    A Basic Linear Algebra Compiler for Embedded Processors
    Master thesis, Computer Science, ETH Zurich, Switzerland, 2014

Larkin, J. 

2021 

  1. Scott Mionis, Franz Franchetti and J. Larkin
    Optimized Quantum Circuit Generation with SPIRAL
    Proc. High Performance Extreme Computing (HPEC), 2021

2020 

  1. Scott Mionis, Franz Franchetti and J. Larkin
    Quantum Circuit Optimization with SPIRAL: A First Look
    Proc. Supercomputing (SC), 2020

2014 

  1. B. Duff, J. Larkin, M. Franusich and Franz Franchetti
    Automatic Generation of 3-D FFTs
    submitted for publication

Lee, C.-S. 

2015 

  1. M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
    Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
    Computer, Vol. 48, No. 12, pp. 24-33, 2015

Lee, Matthew 

2017 

  1. Tze-Meng Low, Varun Rao, Matthew Lee, Thom Popovici, Franz Franchetti and S. McMillan
    First Look: Linear Algebra-based Triangle Counting Without Matrix Multiplication
    Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-6, 2017

Lee, Yu-Chiang 

2006 

  1. Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
    Parallelism in Spiral
    Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006

Lengauer, Christian 

2017 

  1. Matthias Bolten, Franz Franchetti, P. H. J. Kelly, Christian Lengauer and Marcus Mohr
    Algebraic Description and Automatic Generation of Multigrid Methods in SPIRAL
    Concurrency and Computation: Practice and Experience, 2017

Li, Jonathan 

2017 

  1. H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
    Multirotor UAV State Prediction Through Multi-microphone Side-channel Fusion
    Proc. Multisensor Fusion and Integration for Intelligent Systems (MFI), IEEE, pp. 15-21, 2017
  2. H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
    Prediction of Quadcopter State through Multi-Microphone Side-Channel Fusion
    Technical report UU-CS-2017-001, Dept. of Information and Computing Sciences, Utrecht University, 2017

Li, Q. 

2013 

  1. Q. Li, Tao Cui, Yang Weng, Rohit Negi, Franz Franchetti and Marija Ilic
    An Information-Theoretic Approach to PMU Placement in Electric Power Systems
    IEEE Transactions on Smart Grid, Vol. 4, No. 1, pp. 446-456, 2013

2010 

  1. Q. Li, Tao Cui, Rohit Negi, Franz Franchetti and Marija Ilic
    On-line Decentralized Charging of Plug-In Electric Vehicles in Power Systems
    , 2010

Li, Xiaoming 

2005 

  1. Xiaoming Li, María J. Garzarán and David Padua
    Optimizing Sorting with Genetic Algorithm
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 99-110, 2005

2004 

  1. Xiaoming Li, María J. Garzarán and David Padua
    A Dynamically Tuned Sorting Library
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 111-124, 2004

Liu, R. 

2014 

  1. K. Vaidyanathan, R. Liu, H. E. Sumbul, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
    Efficient and Secure Intellectual Property (IP) Design for Split Fabrication
    Proc. IEEE International Symposium on Hardware-Oriented Security and Trust (HOST), 2014

Lorenz, Juergen 

2006 

  1. Andreas Bonelli, Franz Franchetti, Juergen Lorenz, Markus Püschel and Christoph W. Ueberhuber
    Automatic Performance Optimization of the Discrete Fourier Transform on Distributed Memory Computers
    Proc. International Symposium on Parallel and Distributed Processing and Application (ISPA), Lecture Notes In Computer Science, Springer, Vol. 4330, pp. 818-832, 2006
  2. F. Gygi, E. W. Draeger, M. Schulz, B. R. de Supinski, J. A. Gunnels, V. Austel, J. C. Sexton, Franz Franchetti, Stefan Kral, Christoph W. Ueberhuber and Juergen Lorenz
    Large-Scale Electronic Structure Calculations of High-Z Metals on the BlueGene/L Platform
    Proc. Supercomputing (SC), 2006
  3. Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
    Parallelism in Spiral
    Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006

2005 

  1. Franz Franchetti, Stefan Kral, Juergen Lorenz and Christoph W. Ueberhuber
    Efficient Utilization of SIMD Extensions
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 409-425, 2005
  2. F. Gygi, E. W. Draeger, B. R. de Supinski, R. K. Yates, Franz Franchetti, Stefan Kral, Juergen Lorenz, Christoph W. Ueberhuber, J. A. Gunnels and J. C. Sexton
    Large-Scale First-Principles Molecular Dynamics Simulations on the BlueGene/L Platform Using the Qbox Code
    Proc. Supercomputing (SC), 2005
  3. Juergen Lorenz, Stefan Kral, Franz Franchetti and Christoph W. Ueberhuber
    Vectorization Techniques for the BlueGene/L Double FPU
    IBM Journal of Research and Development, Vol. 49, No. 2/3, pp. 437-446, 2005

2004 

  1. Franz Franchetti, Stefan Kral, Juergen Lorenz, Markus Püschel, Christoph W. Ueberhuber and Peter Wurzinger
    Automatically Tuned FFTs for BlueGene/L’s Double FPU
    Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 3402, pp. 23-36, 2004

2003 

  1. Stefan Kral, Franz Franchetti, Juergen Lorenz and Christoph W. Ueberhuber
    SIMD Vectorization of Straight Line Code
    Proc. Euro-Par Conference on Parallel and Distributed Computing, LNCS 2985, pp. 251-260, 2003

Low, Tze-Meng 

2021 

  1. Mark Blanco, S. McMillan and Tze-Meng Low
    Delayed Asynchronous Iterative Graph Algorithms
    Proc. High Performance Extreme Computing (HPEC), 2021

2020 

  1. Thom Popovici, Martin Schatz, Franz Franchetti and Tze-Meng Low
    A Flexible Framework for Multi-Dimensional DFTs
    SIAM Journal on Scientific Computing (SISC), Software and High-Performance Computing, 2020
  2. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFT and Solver Libraries for Exascale: FFTX and SpectralPack
    Exascale Computing Project (ECP) Annual Meeting, 2020, Poster
  3. Mark Blanco, S. McMillan and Tze-Meng Low
    Towards an Objective Metric for the Performance of Exact Triangle Count
    Proc. High Performance Extreme Computing (HPEC), 2020

2019 

  1. Thom Popovici, Martin Schatz, Franz Franchetti and Tze-Meng Low
    A Flexible Framework for Parallel Multi-Dimensional DFTs
    arXiv (Technical Report), 2019
  2. F. Sadi, Joe Sweeney, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    Efficient SpMV Operation for Large and Highly Sparse Matrices Using Scalable Multi-way Merge Parallelization
    Proc. MICRO, 2019
  3. Mark Blanco, Tze-Meng Low and Kyungjoo Kim
    Exploration of Fine-Grained Parallelism for Load Balancing Eager K-truss on GPU and CPU
    Proc. High Performance Extreme Computing (HPEC), 2019
  4. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFT and Solvers for Exascale: FFTX and SpectralPACK
    Exascale Computing Project (ECP) Annual Meeting, 2019

2018 

  1. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFTX and SpectralPack: A First Look
    Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018
  2. Jiyuan Zhang, Franz Franchetti and Tze-Meng Low
    High Performance Zero-Memory Overhead Direct Convolutions
    Proc. International Conference on Machine Learning (ICML), 2018
  3. Thom Popovici, Tze-Meng Low and Franz Franchetti
    Large Bandwidth-Efficient FFTs on Multicore and Multi-Socket Systems
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), IEEE, 2018
  4. Tze-Meng Low, Daniele G. Spampinato, A. Kutuluru, U. Sridhar, Thom Popovici, Franz Franchetti and S. McMillan
    Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency Matrices
    Proc. High Performance Extreme Computing (HPEC), 2018
  5. F. Sadi, Joe Sweeney, S. McMillan, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    PageRank Acceleration for Large Graphs with Scalable Hardware and Two-Step SpMV
    Proc. High Performance Extreme Computing (HPEC), 2018
  6. Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
    SPIRAL: Extreme Performance Portability
    Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018

2017 

  1. Tze-Meng Low, Varun Rao, Matthew Lee, Thom Popovici, Franz Franchetti and S. McMillan
    First Look: Linear Algebra-based Triangle Counting Without Matrix Multiplication
    Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-6, 2017
  2. Tze-Meng Low and Franz Franchetti
    High Assurance Code Generation for Cyber-Physical Systems
    Proc. IEEE International Symposium on High Assurance Systems Engineering (HASE), 2017
  3. Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
    High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
    IEEE Control Systems Magazine, 2017
  4. Thom Popovici, Franz Franchetti and Tze-Meng Low
    Mixed Data Layout Kernels for Vectorized Complex Arithmetic
    Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017
  5. G. Xu, Tze-Meng Low, James C. Hoe and Franz Franchetti
    Optimizing FFT Resource Efficiency of FPGA using High-Level Synthesis
    High Performance Extreme Computing Conference (HPEC), 2017

2016 

  1. Richard Veras, Tze-Meng Low and Franz Franchetti
    A Scale-Free Structure for Power-Law Graphs
    Proc. High Performance Extreme Computing (HPEC), pp. 1-7, 2016
  2. Richard Veras, Thom Popovici, Tze-Meng Low and Franz Franchetti
    Compilers, Hands-Off My Hands-On Optimizations
    Proc. Workshop on Programming Models for SIMD/Vector Programming (WPMVP), 2016

2015 

  1. Jiyuan Zhang, Tze-Meng Low, Qi Guo and Franz Franchetti
    A 3D-Stacked Memory Manycore Stencil Accelerator System
    Proc. Workshop on Near Data Processing (WONDP), 2015
  2. Qi Guo, Tze-Meng Low, N. Alachiotis, Berkin Akin, Lawrence Pileggi, James C. Hoe and Franz Franchetti
    Enabling Portable Energy Efficiency with Memory Accelerated Library
    Proc. MICRO, 2015
  3. Tze-Meng Low, Qi Guo and Franz Franchetti
    Optimizing Space Time Adaptive Processing Through Accelerating Memory-Bounded Operations
    Proc. High Performance Extreme Computing (HPEC), 2015

2014 

  1. Qi Guo, N. Alachiotis, Berkin Akin, F. Sadi, G. Xu, Tze-Meng Low, Lawrence Pileggi, James C. Hoe and Franz Franchetti
    3D-Stacked Memory-Side Acceleration: Accelerator and System Design
    Proc. Workshop on Near Data Processing (WONDP), 2014

Lu, Yi 

2020 

  1. Jiyuan Zhang, Yi Lu, Daniele G. Spampinato and Franz Franchetti
    FESIA: A Fast and Efficient Set Intersection Approach on Modern CPUs
    Proc. IEEE International Conference on Data Engineering (ICDE), 2020

Lumsdaine, A. 

2016 

  1. J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
    Mathematical Foundations of the GraphBLAS
    Proc. High Performance Extreme Computing (HPEC), 2016

Madsen, G. 

2003 

  1. T. Fahringer, Franz Franchetti, M. Geissler, G. Madsen, H. Moritsch and R. Prodan
    On Using ZENTURIO for Performance and Parameter Studies on Clusters and Grids
    Proc. Euromicro Conference on Parallel Distributed and Network-based Processing (Euro PDP), pp. 185-192, 2003

Mai, K. 

2024 

  1. L. Tang, S. Chen, K. Harisrikanth, G. Xu, Franz Franchetti and K. Mai
    A 1.19GHz 9.52Gsamples/sec Radix-8 FFT Hardware Accelerator in 28nm
    Proc. Hot Chips Symposium (HCS), 2024

2022 

  1. L. Tang, S. Chen, K. Harisrikanth, G. Xu, K. Mai and Franz Franchetti
    A High Throughput Hardware Accelerator for FFTW Codelets: A First Look
    Proc. High Performance Extreme Computing (HPEC), 2022
  2. J. Nguyen, M. Cai, Z. Zuo, L. Tang, K. Mai and Franz Franchetti
    LIMA: Hardware for FFT based Large Integer Multiplication
    Proc. High Performance Extreme Computing (HPEC), 2022

Maniatakos, M. 

2023 

  1. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Cryptology ePrint Archive, 2023
  2. D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring-Processing Unit
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
  3. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023

2022 

  1. N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
    Towards Full-Stack Acceleration for Fully Homomorphic Encryption
    Proc. High Performance Extreme Computing (HPEC), 2022

Mankad, H. 

2023 

  1. Sanil Rao, M. A. H. Monil, H. Mankad, J. Vetter and Franz Franchetti
    FFTX-IRIS: Towards Performance Portability and Heterogeneity for SPIRAL Generated Code
    Proc. Supercomputing (SC), 2023

2022 

  1. Z. Gong, N. Zhu, M. Ngaw, Joao Rivera, L. Tang, E. Tang, H. Mankad and Franz Franchetti
    Interval Arithmetic-based FFT for Large Integer Multiplication
    Proc. High Performance Extreme Computing (HPEC), 2022
  2. H. Mankad, Sanil Rao, P. Colella, B. Van Straalen and Franz Franchetti
    ProtoX: A First Look
    Proc. High Performance Extreme Computing (HPEC), 2022

Markov, I. 

2015 

  1. M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
    Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
    Computer, Vol. 48, No. 12, pp. 24-33, 2015

Mathew, C. 

2023 

  1. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023

Mattson, T. 

2016 

  1. J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
    Mathematical Foundations of the GraphBLAS
    Proc. High Performance Extreme Computing (HPEC), 2016

Maza, Marc Moreno 

2010 

  1. Lingchuan Meng, Jeremy Johnson, Franz Franchetti, Yevgen Voronenko, Marc Moreno Maza and Yuzhen Xie
    Spiral-Generated Modular FFT Algorithms
    Proc. Parallel Symbolic Computation (PASCO), pp. 169-170, 2010

McCorquodale, P. 

2020 

  1. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFT and Solver Libraries for Exascale: FFTX and SpectralPack
    Exascale Computing Project (ECP) Annual Meeting, 2020, Poster

2019 

  1. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFT and Solvers for Exascale: FFTX and SpectralPACK
    Exascale Computing Project (ECP) Annual Meeting, 2019

2018 

  1. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFTX and SpectralPack: A First Look
    Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018

McFarlin, Daniel 

2011 

  1. Daniel McFarlin, Volodymyr Arbatov, Franz Franchetti and Markus Püschel
    Automatic SIMD Vectorization of Fast Fourier Transforms for the Larrabee and AVX Instruction Sets
    Proc. International Conference on Supercomputing (ICS), 2011

2009 

  1. Daniel McFarlin, Franz Franchetti and Markus Püschel
    Automatic Generation of Vectorized Fast Fourier Transform Libraries for the Larrabee and AVX Instruction Set Extension
    Proc. High Performance Extreme Computing (HPEC), 2009
  2. Daniel McFarlin, Franz Franchetti, José M. F. Moura and Markus Püschel
    High Performance Synthetic Aperture Radar Image Formation On Commodity Architectures
    Proc. SPIE Conference on Defense, Security, and Sensing, Proceedings of SPIE, Vol. 7337, pp. 733708, 2009
  3. Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
    Operator Language: A Program Generation Framework for Fast Kernels
    Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009

2008 

  1. Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
    Program Generation with Spiral: Beyond Transforms
    Proc. High Performance Extreme Computing (HPEC), 2008

McMillan, S. 

2021 

  1. Mark Blanco, S. McMillan and Tze-Meng Low
    Delayed Asynchronous Iterative Graph Algorithms
    Proc. High Performance Extreme Computing (HPEC), 2021

2020 

  1. Sanil Rao, A. Kutuluru, Paul Brouwer, S. McMillan and Franz Franchetti
    GBTLX: A First Look
    Proc. IEEE High Performance Extreme Computing (HPEC), 2020
  2. Mark Blanco, S. McMillan and Tze-Meng Low
    Towards an Objective Metric for the Performance of Exact Triangle Count
    Proc. High Performance Extreme Computing (HPEC), 2020

2018 

  1. Tze-Meng Low, Daniele G. Spampinato, A. Kutuluru, U. Sridhar, Thom Popovici, Franz Franchetti and S. McMillan
    Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency Matrices
    Proc. High Performance Extreme Computing (HPEC), 2018
  2. F. Sadi, Joe Sweeney, S. McMillan, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    PageRank Acceleration for Large Graphs with Scalable Hardware and Two-Step SpMV
    Proc. High Performance Extreme Computing (HPEC), 2018
  3. Jiyuan Zhang, Daniele G. Spampinato, S. McMillan and Franz Franchetti
    Preliminary Exploration on Large-Scale Triangle Counting in Shared-Memory Multicore System
    Proc. IEEE High Performance Extreme Computing (HPEC), 2018

2017 

  1. Tze-Meng Low, Varun Rao, Matthew Lee, Thom Popovici, Franz Franchetti and S. McMillan
    First Look: Linear Algebra-based Triangle Counting Without Matrix Multiplication
    Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-6, 2017

2016 

  1. J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
    Mathematical Foundations of the GraphBLAS
    Proc. High Performance Extreme Computing (HPEC), 2016

Mendoza, Juan Pablo 

2017 

  1. Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
    High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
    IEEE Control Systems Magazine, 2017

Meng, Lingchuan 

2014 

  1. Lingchuan Meng and Jeremy Johnson
    Towards parallel general-size library generation for polynomial multiplication
    ACM Communications in Computer Algebra, 2014

2013 

  1. Lingchuan Meng and Jeremy Johnson
    Automatic Parallel Library Generation for General-Size Modular FFT Algorithms
    Proc. International Workshop on Computer Algebra in Scientific Computing, Springer, pp. 243-256, 2013

2010 

  1. Lingchuan Meng, Jeremy Johnson, Franz Franchetti, Yevgen Voronenko, Marc Moreno Maza and Yuzhen Xie
    Spiral-Generated Modular FFT Algorithms
    Proc. Parallel Symbolic Computation (PASCO), pp. 169-170, 2010

Meyerhenke, H. 

2016 

  1. J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
    Mathematical Foundations of the GraphBLAS
    Proc. High Performance Extreme Computing (HPEC), 2016

Milder, Peter A. 

2016 

  1. Marcela Zuluaga, Peter A. Milder and Markus Püschel
    Streaming Sorting Networks
    ACM Transactions on Design Automation of Electronic Systems, Vol. 21, No. 4, pp. 55, 2016

2012 

  1. Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
    Algorithm and Architecture Optimization for Large Size Two Dimensional Discrete Fourier Transform
    ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA), 2012
  2. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Computer Generation of Hardware for Linear Digital Signal Processing Transforms
    ACM Transactions on Design Automation of Electronic Systems, Vol. 17, No. 2, 2012
  3. Marcela Zuluaga, Peter A. Milder and Markus Püschel
    Computer Generation of Streaming Sorting Networks
    Proc. Design Automation Conference (DAC), pp. 1245-1253, 2012
  4. Robert Koutsoyannis, Peter A. Milder, Christian Berger, Madeleine Glick, James C. Hoe and Markus Püschel
    Improving Fixed-Point Accuracy of FFT Cores in O-OFDM Systems
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2012
  5. Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
    Memory Bandwidth Efficient Two-Dimensional Fast Fourier Transform Algorithm and Implementation for Large Problem Sizes
    Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), 2012
  6. Marcela Zuluaga, Andreas Krause, Peter A. Milder and Markus Püschel
    "Smart" Design Space Sampling to Predict Pareto-Optimal Solutions
    Proc. Languages, Compilers, Tools and Theory for Embedded Systems (LCTES), pp. 119-128 , 2012

2011 

  1. Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Design studies for ASIC implementations of 28 GS/s optical QPSK- and 16-QAM-OFDM transceivers
    Optics Express, Vol. 19, No. 21, pp. 20857-20864, 2011

2010 

  1. Peter A. Milder
    A Mathematical Approach for Compiling and Optimizing Hardware Implementations of DSP Transforms
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
  2. Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, Christian Berger, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Design Studies for an ASIC Implementation of an Optical OFDM Transceiver
    Proc. European Conference on Optical Communication (ECOC), pp. 1-3, 2010
  3. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Hardware Implementation of the Discrete Fourier Transform with Non-Power-of-Two Problem Size
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2010
  4. Yannis Benlachtar, Rachid Bouziane, Robert I. Killey, Christian Berger, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel and Madeleine Glick
    Optical OFDM for the Data Center
    Proc. International Conference on Transparent Optical Networks (ICTON), pp. 1-4, 2010
  5. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Real-Time Digital Signal Processing for the Generation of Optical Orthogonal Frequency Division Multiplexed Signals
    IEEE Journal of Selected Topics in Quantum Electronics, Vol. 16, No. 5, pp. 1235-1244 , 2010

2009 

  1. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    21.4 GS/s Real-Time DSP-Based Optical OFDM Signal Generation and Transmission Over 1600 km of Uncompensated Fibre
    Proc. European Conference on Optical Communication (ECOC), pp. 1-2, 2009
  2. Peter A. Milder, James C. Hoe and Markus Püschel
    Automatic Generation of Streaming Datapaths for Arbitrary Fixed Permutations
    Proc. Design, Automation and Test in Europe (DATE), pp. 1118-1123, 2009
  3. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
    Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009
  4. Markus Püschel, Peter A. Milder and James C. Hoe
    Permuting Streaming Data Using RAMs
    Journal of the ACM, Vol. 56, No. 2, pp. 10:1-10:34, 2009

2008 

  1. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  2. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Formal Datapath Representation and Manipulation for Implementing DSP Transforms
    Proc. Design Automation Conference (DAC), pp. 385-390, 2008
  3. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Linear Transforms: From Math to Efficient Hardware
    Proc. Workshop on High-Level Synthesis colocated with DAC, 2008
  4. Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
    Program Generation with Spiral: Beyond Transforms
    Proc. High Performance Extreme Computing (HPEC), 2008

2007 

  1. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Discrete Fourier Transform Compiler: From Mathematical Representation to Efficient Hardware
    CSSI Technical Report #CSSI-07-01, Carnegie Mellon University, 2007
  2. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Fast Fourier Transform on FPGA: Design Choices and Evaluation
    Proc. International Symposium on Field-Programmable Gate Arrays (FPGA), 2007
  3. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    FFT Compiler: From Math to Efficient Hardware
    Proc. IEEE International High Level Design Validation and Test Workshop (HLDVT), 2007
  4. Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
    Generating FPGA Accelerated DFT Libraries
    Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007

2006 

  1. Paolo D'Alberto, Peter A. Milder, Franz Franchetti, James C. Hoe, Markus Püschel and José M. F. Moura
    Discrete Fourier Transform Compiler for FPGA and CPU/FPGA Partitioned Implementations
    Proc. High Performance Extreme Computing (HPEC), 2006
  2. Peter A. Milder, Mohammad Ahmad, James C. Hoe and Markus Püschel
    Fast and Accurate Resource Estimation of Automatically Generated Custom DFT IP Cores
    Proc. FPGA, pp. 211-220, 2006

2005 

  1. Grace Nordin, Peter A. Milder, James C. Hoe and Markus Püschel
    Automatic Generation of Customized Discrete Fourier Transform IPs
    Proc. Design Automation Conference (DAC), pp. 471-474, 2005

Mionis, Scott 

2021 

  1. Scott Mionis, Franz Franchetti and J. Larkin
    Optimized Quantum Circuit Generation with SPIRAL
    Proc. High Performance Extreme Computing (HPEC), 2021

2020 

  1. Scott Mionis, Franz Franchetti and J. Larkin
    Quantum Circuit Optimization with SPIRAL: A First Look
    Proc. Supercomputing (SC), 2020

Misra, Smarahara 

2004 

  1. Adam C. Zelinski, Markus Püschel, Smarahara Misra and James C. Hoe
    Automatic Cost Minimization for Multiplierless Implementations of Discrete Signal Transforms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-221-V-224, 2004

2003 

  1. Smarahara Misra, Adam C. Zelinski, James C. Hoe and Markus Püschel
    Custom Reduction of Arithmetic in Linear DSP Transforms
    Proc. High Performance Extreme Computing (HPEC), 2003
  2. Smarahara Misra
    Custom Reduction of Arithmetic in Multiplierless implementations of DSP Transforms
    Master thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2003

2002 

  1. Fang Fang, James C. Hoe, Markus Püschel and Smarahara Misra
    Generation of Custom DSP Transform IP Cores: Case Study Walsh-Hadamard Transform
    Proc. High Performance Extreme Computing (HPEC), 2002

Mitra, S. 

2015 

  1. M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
    Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
    Computer, Vol. 48, No. 12, pp. 24-33, 2015

Mitsch, Stefan 

2017 

  1. Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
    High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
    IEEE Control Systems Magazine, 2017

Mohr, Marcus 

2017 

  1. Matthias Bolten, Franz Franchetti, P. H. J. Kelly, Christian Lengauer and Marcus Mohr
    Algebraic Description and Automatic Generation of Multigrid Methods in SPIRAL
    Concurrency and Computation: Practice and Experience, 2017

Monil, M. A. H. 

2023 

  1. Sanil Rao, M. A. H. Monil, H. Mankad, J. Vetter and Franz Franchetti
    FFTX-IRIS: Towards Performance Portability and Heterogeneity for SPIRAL Generated Code
    Proc. Supercomputing (SC), 2023

Moopan, M. N. T. 

2023 

  1. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Cryptology ePrint Archive, 2023

Moreira, J. 

2022 

  1. J. Nguyen, Sanil Rao, J. Moreira and Franz Franchetti
    Automatic Generation of Matrix-Vector Code Using SPIRAL for the Power10 ISA
    Proc. IEEE High Performance Extreme Computing (HPEC), 2022

2016 

  1. J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
    Mathematical Foundations of the GraphBLAS
    Proc. High Performance Extreme Computing (HPEC), 2016

Moritsch, H. 

2003 

  1. T. Fahringer, Franz Franchetti, M. Geissler, G. Madsen, H. Moritsch and R. Prodan
    On Using ZENTURIO for Performance and Parameter Studies on Clusters and Grids
    Proc. Euromicro Conference on Parallel Distributed and Network-based Processing (Euro PDP), pp. 185-192, 2003

Moura, José M. F. 

2018 

  1. Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
    SPIRAL: Extreme Performance Portability
    Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018

2017 

  1. Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
    High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
    IEEE Control Systems Magazine, 2017

2016 

  1. Joya Deri, Franz Franchetti and José M. F. Moura
    Big Data Computation of Taxi Movement in New York City
    Proc. IEEE International Conference on Big Data (Big Data), IEEE, pp. 2616-2625, 2016

2011 

  1. W. Yu, Franz Franchetti, James C. Hoe, José M. F. Moura and Tsuhan Chen
    Performance Portable Tracking of Evolving Surfaces
    Proc. High Performance Extreme Computing (HPEC), 2011

2009 

  1. Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
    Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
    IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
  2. Daniel McFarlin, Franz Franchetti, José M. F. Moura and Markus Püschel
    High Performance Synthetic Aperture Radar Image Formation On Commodity Architectures
    Proc. SPIE Conference on Defense, Security, and Sensing, Proceedings of SPIE, Vol. 7337, pp. 733708, 2009

2008 

  1. Markus Püschel and José M. F. Moura
    Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for DCTs and DSTs
    IEEE Transactions on Signal Processing, Vol. 56, No. 4, pp. 1502-1521, 2008
  2. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  3. Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
    Program Generation with Spiral: Beyond Transforms
    Proc. High Performance Extreme Computing (HPEC), 2008

2007 

  1. Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
    Generating FPGA Accelerated DFT Libraries
    Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007

2006 

  1. Paolo D'Alberto, Peter A. Milder, Franz Franchetti, James C. Hoe, Markus Püschel and José M. F. Moura
    Discrete Fourier Transform Compiler for FPGA and CPU/FPGA Partitioned Implementations
    Proc. High Performance Extreme Computing (HPEC), 2006
  2. Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
    Parallelism in Spiral
    Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006
  3. Marek Telgarsky, James C. Hoe and José M. F. Moura
    Spiral: Joint Runtime and Energy Optimization of Linear Transforms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 3, 2006

2005 

  1. Thammanit Pipatsrisawat, Aca Gacic, Franz Franchetti, Markus Püschel and José M. F. Moura
    Performance Analysis of the Filtered Backprojection Image Reconstruction Algorithms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. 153-156, 2005
  2. José M. F. Moura, Markus Püschel, David Padua and Jack Dongarra
    Scanning the Issue: Special Issue on Program Generation, Optimization, and Platform Adaptation
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 211-215, 2005
  3. Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
    SPIRAL: Code Generation for DSP Transforms
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005

2004 

  1. Aca Gacic, Markus Püschel and José M. F. Moura
    Automatically Generated High-Performance Code for Discrete Wavelet Transforms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-69-V-72, 2004
  2. Markus Püschel, Bryan Singer, Jianxin Xiong, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso and Robert W. Johnson
    SPIRAL: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms
    Journal of High Performance Computing and Applications, special issue on ``Automatic Performance Tuning'', Vol. 18, No. 1, pp. 21-45, 2004

2003 

  1. Aca Gacic, Markus Püschel and José M. F. Moura
    Fast Automatic Implementations of FIR Filters
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 541-544, 2003
  2. Markus Püschel and José M. F. Moura
    SPIRAL: An Overview
    Proc. Workshop on Optimizations for DSP and Embedded Systems (ODES), 2003
  3. Markus Püschel and José M. F. Moura
    The Algebraic Approach to the Discrete Cosine and Sine Transforms and their Fast Algorithms
    SIAM Journal of Computing, Vol. 32, No. 5, pp. 1280-1316, 2003

2002 

  1. Markus Püschel and José M. F. Moura
    Generation and Manipulation of DSP Transform Algorithms
    Proc. Digital Signal Processing Workshop, 2002
  2. Franz Franchetti, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
    Short Vector SIMD Code Generation for DSP Algorithms
    Proc. High Performance Extreme Computing (HPEC), 2002

2001 

  1. Markus Püschel, Bryan Singer, Manuela Veloso and José M. F. Moura
    Fast Automatic Generation of DSP Algorithms
    Proc. International Conference on Computational Science (ICCS), Lecture Notes In Computer Science, Springer, Vol. 2073, pp. 97-106, 2001
  2. José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel, Bryan Singer, Manuela Veloso and Jianxin Xiong
    Generating Platform-Adapted DSP Libraries using SPIRAL
    Proc. High Performance Extreme Computing (HPEC), 2001

2000 

  1. José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel and Manuela Veloso
    SPIRAL: Automatic Implementation of Signal Processing Algorithms
    Proc. High Performance Extreme Computing (HPEC), 2000

Nabeel, M. 

2023 

  1. D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring-Processing Unit
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023

Nagai, T. 

2021 

  1. N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
    Adaptation of A64 Scalable Vector Extension for Spiral
    Proc. High Performance Computing Conference (HPCC), Japan, 2021
  2. N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
    An Auto-tuning with Adaptation of A64 Scalable Vector Extension for SPIRAL
    Proc. International Workshop on Automatic Performance Tuning (iWAPT), 2021

Nagvajara, Prawat 

2008 

  1. Jeremy Johnson, Tim Chagnon, Petya Vachranukunkiet, Prawat Nagvajara and Chika Nwankpa
    Sparse LU Decomposition using FPGA
    Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008

2000 

  1. Pinit Kumhom, Jeremy Johnson and Prawat Nagvajara
    Design, optimization, and implementation of a universal FFT processor
    Proc. IEEE ASIC/SOC Conference, IEEE, pp. 182-186, 2000

Neda, N. 

2023 

  1. N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
    Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023
  2. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Cryptology ePrint Archive, 2023
  3. D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring-Processing Unit
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
  4. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023

2022 

  1. N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
    Towards Full-Stack Acceleration for Fully Homomorphic Encryption
    Proc. High Performance Extreme Computing (HPEC), 2022

Neda, N. 

2023 

  1. N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
    Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023
  2. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Cryptology ePrint Archive, 2023
  3. D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring-Processing Unit
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
  4. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023

2022 

  1. N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
    Towards Full-Stack Acceleration for Fully Homomorphic Encryption
    Proc. High Performance Extreme Computing (HPEC), 2022

Negi, Rohit 

2013 

  1. Q. Li, Tao Cui, Yang Weng, Rohit Negi, Franz Franchetti and Marija Ilic
    An Information-Theoretic Approach to PMU Placement in Electric Power Systems
    IEEE Transactions on Smart Grid, Vol. 4, No. 1, pp. 446-456, 2013

2010 

  1. Q. Li, Tao Cui, Rohit Negi, Franz Franchetti and Marija Ilic
    On-line Decentralized Charging of Plug-In Electric Vehicles in Power Systems
    , 2010

Ngaw, M. 

2022 

  1. Z. Gong, N. Zhu, M. Ngaw, Joao Rivera, L. Tang, E. Tang, H. Mankad and Franz Franchetti
    Interval Arithmetic-based FFT for Large Integer Multiplication
    Proc. High Performance Extreme Computing (HPEC), 2022

Nguyen, J. 

2022 

  1. J. Nguyen, Sanil Rao, J. Moreira and Franz Franchetti
    Automatic Generation of Matrix-Vector Code Using SPIRAL for the Power10 ISA
    Proc. IEEE High Performance Extreme Computing (HPEC), 2022
  2. J. Nguyen, M. Cai, Z. Zuo, L. Tang, K. Mai and Franz Franchetti
    LIMA: Hardware for FFT based Large Integer Multiplication
    Proc. High Performance Extreme Computing (HPEC), 2022

Nolin, Thomas 

2019 

  1. Yoko Franchetti, Thomas Nolin and Franz Franchetti
    Towards Precision Medicine: Simulation Based Parameter Estimation for Drug Metabolism
    SIAM Conference on Computational Science and Engineering (CSE), 2019

Nordin, Grace 

2005 

  1. Grace Nordin, Peter A. Milder, James C. Hoe and Markus Püschel
    Automatic Generation of Customized Discrete Fourier Transform IPs
    Proc. Design Automation Conference (DAC), pp. 471-474, 2005

2004 

  1. Grace Nordin, James C. Hoe and Markus Püschel
    Discrete Fourier Transform IP Generator
    Proc. High Performance Extreme Computing (HPEC), 2004

Nwankpa, Chika 

2008 

  1. Jeremy Johnson, Tim Chagnon, Petya Vachranukunkiet, Prawat Nagvajara and Chika Nwankpa
    Sparse LU Decomposition using FPGA
    Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008

Odersky, Martin 

2014 

  1. Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
    Language Support for the Construction of High Performance Code Generators
    Proc. International Workshop on Adaptive Self-tuning Computing Systems (ADAPT), 2014

2013 

  1. Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
    Spiral in Scala: Towards the Systematic Construction of Generators for Performance Libraries
    Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 125-134, 2013

Ofenbeck, Georg 

2017 

  1. Georg Ofenbeck, Tiark Rompf and Markus Püschel
    Staging for Generic Programming in Space and Time
    Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 15-28, 2017

2014 

  1. Alen Stojanov, Georg Ofenbeck, Tiark Rompf and Markus Püschel
    Abstracting Vector Architectures in Library Generators: Case Study Convolution Filters
    Proc. ACM International Workshop on Libraries, Languages and Compilers for Array Programming (ARRAY), pp. 14, 2014
  2. Georg Ofenbeck, Ruedi Steinmann, Victoria Caparrós Cabezas, Daniele G. Spampinato and Markus Püschel
    Applying the Roofline Model
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 76 - 85, 2014
  3. Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
    Language Support for the Construction of High Performance Code Generators
    Proc. International Workshop on Adaptive Self-tuning Computing Systems (ADAPT), 2014

2013 

  1. Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
    Spiral in Scala: Towards the Systematic Construction of Generators for Performance Libraries
    Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 125-134, 2013

Ohshima, S. 

2021 

  1. N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
    Adaptation of A64 Scalable Vector Extension for Spiral
    Proc. High Performance Computing Conference (HPCC), Japan, 2021
  2. N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
    An Auto-tuning with Adaptation of A64 Scalable Vector Extension for SPIRAL
    Proc. International Workshop on Automatic Performance Tuning (iWAPT), 2021

Olukoton, K. 

2015 

  1. M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
    Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
    Computer, Vol. 48, No. 12, pp. 24-33, 2015

Oostema, P. 

2022 

  1. P. Oostema and Franz Franchetti
    Approximating Manifolds and Geodesics with Curved Surfaces
    Proc. High Performance Extreme Computing (HPEC), 2022
  2. P. Oostema and Franz Franchetti
    On Manifold Graph Embedding
    Proc. High Performance Extreme Computing (HPEC), 2022

2021 

  1. P. Oostema and Franz Franchetti
    Leveraging High Dimensional Spatial Graph Embedding as a Heuristic for Graph Algorithms
    Proc. IEEE Workshop on Parallel / Distributed Combinatorics and Optimization (PDCO), 2021

Owens, J. D. 

2016 

  1. J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
    Mathematical Foundations of the GraphBLAS
    Proc. High Performance Extreme Computing (HPEC), 2016

Ozturk, T. 

2015 

  1. T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, R. Suter, Franz Franchetti and Anthony Rollett
    Spectral Full-Field Deformation Modeling of Polycrystalline Materials
    submitted for publication

2014 

  1. T. Ozturk, Thom Popovici, C. Stein, R. Pokharel, Franz Franchetti and Anthony Rollett
    Fast Fourier Transform Based Mechanical Behavior Formulation: Optimized Implementation and Sensitivity Analysis of the Method
    Proc. Materials Science & Technology, 2014
  2. T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, Franz Franchetti, R. Suter and Anthony Rollett
    Performance Evaluation, Algorithm Optimization and Sensitivity Analysis of the Spectral Full-Field Deformation Modeling of Polycrystalline Materials
    submitted for publication

Padua, David 

2017 

  1. Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
    High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
    IEEE Control Systems Magazine, 2017

2005 

  1. Xiaoming Li, María J. Garzarán and David Padua
    Optimizing Sorting with Genetic Algorithm
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 99-110, 2005
  2. José M. F. Moura, Markus Püschel, David Padua and Jack Dongarra
    Scanning the Issue: Special Issue on Program Generation, Optimization, and Platform Adaptation
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 211-215, 2005
  3. Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
    SPIRAL: Code Generation for DSP Transforms
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005

2004 

  1. Xiaoming Li, María J. Garzarán and David Padua
    A Dynamically Tuned Sorting Library
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 111-124, 2004
  2. Nicholas Rizzolo and David Padua
    HiLO: High Level Optimization of FFTs
    Proc. International Workshop on Languages and Compilers for Parallel Computing (LCPC), 2004
  3. Markus Püschel, Bryan Singer, Jianxin Xiong, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso and Robert W. Johnson
    SPIRAL: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms
    Journal of High Performance Computing and Applications, special issue on ``Automatic Performance Tuning'', Vol. 18, No. 1, pp. 21-45, 2004

2001 

  1. Sebastian Egner, Jeremy Johnson, David Padua, Jianxin Xiong and Markus Püschel
    Automatic Derivation and Implementation of Signal Processing Algorithms
    ACM SIGSAM Bulletin Communications in Computer Algebra, Vol. 35, No. 2, pp. 1-19, 2001
  2. José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel, Bryan Singer, Manuela Veloso and Jianxin Xiong
    Generating Platform-Adapted DSP Libraries using SPIRAL
    Proc. High Performance Extreme Computing (HPEC), 2001
  3. Jianxin Xiong, Jeremy Johnson, Robert W. Johnson and David Padua
    SPL: A Language and Compiler for DSP Algorithms
    Proc. Programming Languages Design and Implementation (PLDI), pp. 298-308, 2001

2000 

  1. José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel and Manuela Veloso
    SPIRAL: Automatic Implementation of Signal Processing Algorithms
    Proc. High Performance Extreme Computing (HPEC), 2000

Park, Neungsoo 

2001 

  1. Neungsoo Park and Viktor K. Prasanna
    Cache Conscious Walsh-Hadamard Transform
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. II-1205-II-1208, 2001

2000 

  1. Neungsoo Park, Viktor K. Prasanna, Kiran Bondalapati and Dongsoo Kang
    Dynamic Data Layouts for Cache-conscious Factorization of DFT
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 693-701, 2000

Patterson, A. 

2013 

  1. H. E. Sumbul, A. Patterson, A. Tazzoli, G. Feeder, Franz Franchetti, G. Piazza and Lawrence Pileggi
    Trusted Split-Fabrication System-on-Chip Design Technology and Methodology
    Proc. Government Applications & Critical Technology Conference (GOMACTech), 2013

Pedram, M. 

2023 

  1. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Cryptology ePrint Archive, 2023
  2. D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring-Processing Unit
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
  3. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023

Peng, Ronghui 

2010 

  1. Yevgen Voronenko, Volodymyr Arbatov, Christian Berger, Ronghui Peng, Markus Püschel and Franz Franchetti
    Computer Generation of Platform-Adapted Physical Layer Software
    Proc. Software Defined Radio (SDR), 2010

Peter, Thomas 

2006 

  1. Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
    Parallelism in Spiral
    Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006

Phaosawasdi, Amarin 

2017 

  1. Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
    High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
    IEEE Control Systems Magazine, 2017

Piazza, G. 

2013 

  1. H. E. Sumbul, A. Patterson, A. Tazzoli, G. Feeder, Franz Franchetti, G. Piazza and Lawrence Pileggi
    Trusted Split-Fabrication System-on-Chip Design Technology and Methodology
    Proc. Government Applications & Critical Technology Conference (GOMACTech), 2013

Pileggi, Lawrence 

2019 

  1. F. Sadi, Joe Sweeney, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    Efficient SpMV Operation for Large and Highly Sparse Matrices Using Scalable Multi-way Merge Parallelization
    Proc. MICRO, 2019

2018 

  1. F. Sadi, Joe Sweeney, S. McMillan, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    PageRank Acceleration for Large Graphs with Scalable Hardware and Two-Step SpMV
    Proc. High Performance Extreme Computing (HPEC), 2018

2017 

  1. F. Sadi, Lawrence Pileggi and Franz Franchetti
    Algorithm and Hardware Co-Optimized Solution for Large SpMV Problems
    Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017

2016 

  1. F. Sadi, Lawrence Pileggi and Franz Franchetti
    3D DRAM Based Application Specific Hardware Accelerator for SpMV
    High Performance Extreme Computing Conference (HPEC), 2016

2015 

  1. H. E. Sumbul, K. Vaidyanathan, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
    A Synthesis Methodology for Application-Specific Logic-in-Memory Designs
    Proc. Design Automation Conference (DAC), 2015
  2. Qi Guo, Tze-Meng Low, N. Alachiotis, Berkin Akin, Lawrence Pileggi, James C. Hoe and Franz Franchetti
    Enabling Portable Energy Efficiency with Memory Accelerated Library
    Proc. MICRO, 2015
  3. M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
    Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
    Computer, Vol. 48, No. 12, pp. 24-33, 2015

2014 

  1. Qi Guo, N. Alachiotis, Berkin Akin, F. Sadi, G. Xu, Tze-Meng Low, Lawrence Pileggi, James C. Hoe and Franz Franchetti
    3D-Stacked Memory-Side Acceleration: Accelerator and System Design
    Proc. Workshop on Near Data Processing (WONDP), 2014
  2. F. Sadi, Berkin Akin, Thom Popovici, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    Algorithm/Hardware Co-optimized SAR Image Reconstruction with 3D-stacked Logic in Memory
    Proc. IEEE High Performance Extreme Computing (HPEC), 2014
  3. K. Vaidyanathan, R. Liu, H. E. Sumbul, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
    Efficient and Secure Intellectual Property (IP) Design for Split Fabrication
    Proc. IEEE International Symposium on Hardware-Oriented Security and Trust (HOST), 2014

2013 

  1. Qiuling Zhu, Berkin Akin, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    A 3D-Stacked Logic-in-Memory Accelerator for Application-Specific Data Intensive Computing
    Proc. IEEE International 3D Systems Integration Conference (3DIC), pp. 1-7, 2013
  2. Qiuling Zhu, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    Accelerating Sparse Matrix-Matrix Multiplication with 3D-Stacked Logic-in-Memory Hardware
    Proc. High Performance Extreme Computing (HPEC), pp. 1-6, 2013
  3. Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
    A Smart Memory Accelerated Computed Tomography Parallel Backprojection
    in From Algorithms to Circuits and System-on-Chip Design (VLSI-SoC 2012), Eds. A. Berg, A. Coskun, M. Guthaus, S. Katkoori, R. Reis, pp. 21-44, Springer, Berlin, Heidelberg 2013
  4. H. E. Sumbul, A. Patterson, A. Tazzoli, G. Feeder, Franz Franchetti, G. Piazza and Lawrence Pileggi
    Trusted Split-Fabrication System-on-Chip Design Technology and Methodology
    Proc. Government Applications & Critical Technology Conference (GOMACTech), 2013

2012 

  1. Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
    Cost-Effective Smart Memory Implementation for Parallel Backprojection in Computed Tomography
    Proc. IFIP/IEEE Internationa Conference on Very Large Scale Integration, pp. 111-116, 2012
  2. Qiuling Zhu, K. Vaidyanathan, O. Shacham, M. Horowitz, Lawrence Pileggi and Franz Franchetti
    Design Automation Framwork for Application-Specific Logic-in-Memory Blocks
    Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 125-132, 2012
  3. Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
    Local Interpolation-based Polar Format SAR: Algorithm, Hardware Implementation and Design Automation
    Journal of Signal Processing Systems, 2012
  4. Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
    Polar Format Synthetic Aperture Radar in Energy Efficient Application-Specific Logic-in-Memory
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1557-1560, 2012
  5. Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
    Smart Memory Synthesis for Energy-Efficient Computed Tomography Reconstruction
    Proc. SRC TECHCON, 2012

2011 

  1. Qiuling Zhu, E. L. Turner, Christian Berger, Lawrence Pileggi and Franz Franchetti
    Application-Specific Logic-in-Memory for Polar Format Synthetic Aperture Radar
    Proc. High Performance Extreme Computing (HPEC), 2011

Pipatsrisawat, Thammanit 

2005 

  1. Thammanit Pipatsrisawat, Aca Gacic, Franz Franchetti, Markus Püschel and José M. F. Moura
    Performance Analysis of the Filtered Backprojection Image Reconstruction Algorithms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. 153-156, 2005

Pitner, G. 

2015 

  1. M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
    Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
    Computer, Vol. 48, No. 12, pp. 24-33, 2015

Platzer, Andre' 

2017 

  1. Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
    High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
    IEEE Control Systems Magazine, 2017

Pokharel, R. 

2015 

  1. T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, R. Suter, Franz Franchetti and Anthony Rollett
    Spectral Full-Field Deformation Modeling of Polycrystalline Materials
    submitted for publication

2014 

  1. T. Ozturk, Thom Popovici, C. Stein, R. Pokharel, Franz Franchetti and Anthony Rollett
    Fast Fourier Transform Based Mechanical Behavior Formulation: Optimized Implementation and Sensitivity Analysis of the Method
    Proc. Materials Science & Technology, 2014
  2. T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, Franz Franchetti, R. Suter and Anthony Rollett
    Performance Evaluation, Algorithm Optimization and Sensitivity Analysis of the Spectral Full-Field Deformation Modeling of Polycrystalline Materials
    submitted for publication

Polyakov, Y. 

2023 

  1. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Cryptology ePrint Archive, 2023
  2. D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring-Processing Unit
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
  3. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023

2022 

  1. N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
    Towards Full-Stack Acceleration for Fully Homomorphic Encryption
    Proc. High Performance Extreme Computing (HPEC), 2022

Pop, E. 

2015 

  1. M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
    Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
    Computer, Vol. 48, No. 12, pp. 24-33, 2015

Popovici, Thom 

2020 

  1. Thom Popovici, Martin Schatz, Franz Franchetti and Tze-Meng Low
    A Flexible Framework for Multi-Dimensional DFTs
    SIAM Journal on Scientific Computing (SISC), Software and High-Performance Computing, 2020
  2. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFT and Solver Libraries for Exascale: FFTX and SpectralPack
    Exascale Computing Project (ECP) Annual Meeting, 2020, Poster

2019 

  1. Thom Popovici, Martin Schatz, Franz Franchetti and Tze-Meng Low
    A Flexible Framework for Parallel Multi-Dimensional DFTs
    arXiv (Technical Report), 2019
  2. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFT and Solvers for Exascale: FFTX and SpectralPACK
    Exascale Computing Project (ECP) Annual Meeting, 2019

2018 

  1. Thom Popovici
    An Approach to Specifying and Automatically Optimizing Fourier Transform Based Operations
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2018
  2. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFTX and SpectralPack: A First Look
    Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018
  3. Thom Popovici, Tze-Meng Low and Franz Franchetti
    Large Bandwidth-Efficient FFTs on Multicore and Multi-Socket Systems
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), IEEE, 2018
  4. Tze-Meng Low, Daniele G. Spampinato, A. Kutuluru, U. Sridhar, Thom Popovici, Franz Franchetti and S. McMillan
    Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency Matrices
    Proc. High Performance Extreme Computing (HPEC), 2018
  5. Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
    SPIRAL: Extreme Performance Portability
    Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018

2017 

  1. Tze-Meng Low, Varun Rao, Matthew Lee, Thom Popovici, Franz Franchetti and S. McMillan
    First Look: Linear Algebra-based Triangle Counting Without Matrix Multiplication
    Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-6, 2017
  2. Thom Popovici, Franz Franchetti and Tze-Meng Low
    Mixed Data Layout Kernels for Vectorized Complex Arithmetic
    Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017

2016 

  1. Richard Veras, Thom Popovici, Tze-Meng Low and Franz Franchetti
    Compilers, Hands-Off My Hands-On Optimizations
    Proc. Workshop on Programming Models for SIMD/Vector Programming (WPMVP), 2016

2015 

  1. Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
    Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
    Proc. Workshop on Compilers for Parallel Computing (CPC), 2015
  2. Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
    Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2015
  3. T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, R. Suter, Franz Franchetti and Anthony Rollett
    Spectral Full-Field Deformation Modeling of Polycrystalline Materials
    submitted for publication

2014 

  1. F. Sadi, Berkin Akin, Thom Popovici, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    Algorithm/Hardware Co-optimized SAR Image Reconstruction with 3D-stacked Logic in Memory
    Proc. IEEE High Performance Extreme Computing (HPEC), 2014
  2. T. Ozturk, Thom Popovici, C. Stein, R. Pokharel, Franz Franchetti and Anthony Rollett
    Fast Fourier Transform Based Mechanical Behavior Formulation: Optimized Implementation and Sensitivity Analysis of the Method
    Proc. Materials Science & Technology, 2014
  3. T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, Franz Franchetti, R. Suter and Anthony Rollett
    Performance Evaluation, Algorithm Optimization and Sensitivity Analysis of the Spectral Full-Field Deformation Modeling of Polycrystalline Materials
    submitted for publication

Pouchet, Louis-Noël 

2013 

  1. Tom Henretty, Richard Veras, Franz Franchetti, Louis-Noël Pouchet, J. Ramanujam and P. Sadayappan
    A Stencil Compiler for Short-Vector SIMD Architectures
    Proc. ACM International Conference on Supercomputing , pp. 13-24, 2013
  2. Martin Kong, Richard Veras, Kevin Stock, Franz Franchetti, Louis-Noël Pouchet and P. Sadayappan
    When Polyhedral Transformations Meet SIMD Code Generation
    Proc. Programming Languages Design and Implementation (PLDI), Vol. 48, pp. 127-138, 2013

2011 

  1. Tom Henretty, Kevin Stock, Louis-Noël Pouchet, Franz Franchetti, J. Ramanujam and P. Sadayappan
    Data Layout Transformation for Stencil Computations on Short SIMD Architectures
    Proc. International Conference on Compiler Construction (CC), 2011

Prakash, A. 

2024 

  1. Sanil Rao, A. Prakash and Franz Franchetti
    Towards LibraryX: A Framework for Cross-Library Call Optimization
    Proc. High Performance Extreme Computing (HPEC), 2024

Prasanna, Viktor K. 

2001 

  1. Neungsoo Park and Viktor K. Prasanna
    Cache Conscious Walsh-Hadamard Transform
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. II-1205-II-1208, 2001
  2. José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel, Bryan Singer, Manuela Veloso and Jianxin Xiong
    Generating Platform-Adapted DSP Libraries using SPIRAL
    Proc. High Performance Extreme Computing (HPEC), 2001

2000 

  1. Neungsoo Park, Viktor K. Prasanna, Kiran Bondalapati and Dongsoo Kang
    Dynamic Data Layouts for Cache-conscious Factorization of DFT
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 693-701, 2000
  2. José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel and Manuela Veloso
    SPIRAL: Automatic Implementation of Signal Processing Algorithms
    Proc. High Performance Extreme Computing (HPEC), 2000

Prodan, R. 

2003 

  1. T. Fahringer, Franz Franchetti, M. Geissler, G. Madsen, H. Moritsch and R. Prodan
    On Using ZENTURIO for Performance and Parameter Studies on Clusters and Grids
    Proc. Euromicro Conference on Parallel Distributed and Network-based Processing (Euro PDP), pp. 185-192, 2003

Püschel, Markus 

2022 

  1. Joao Rivera, Franz Franchetti and Markus Püschel
    A Compiler for Sound Floating-Point Computations using Affine Arithmetic
    Proc. International Symposium on Code Generation and Optimization (CGO), 2022

2021 

  1. Joao Rivera, Franz Franchetti and Markus Püschel
    An Interval Compiler for Sound Floating Point Computations
    Proc. International Symposium on Code Generation and Optimization (CGO), 2021

2018 

  1. Daniele G. Spampinato, Diego Fabregat-Traver, Paolo Bientinesi and Markus Püschel
    Program Generation for Small-Scale Linear Algebra Applications
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 327-339, 2018
  2. Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
    SPIRAL: Extreme Performance Portability
    Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018

2017 

  1. Georg Ofenbeck, Tiark Rompf and Markus Püschel
    Staging for Generic Programming in Space and Time
    Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 15-28, 2017

2016 

  1. Daniele G. Spampinato and Markus Püschel
    A Basic Linear Algebra Compiler for Structured Matrices
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 117-127, 2016
  2. Marcela Zuluaga, Andreas Krause and Markus Püschel
    e-PAL: An Active Learning Approach to the Multi-Objective Optimization Problem
    Journal of Machine Learning Research, Vol. 17, No. 104, pp. 1-32, 2016
  3. Francois Serre and Markus Püschel
    Optimal Circuits for Streamed Linear Permutations using RAM
    Proc. FPGA, pp. 215-223, 2016
  4. Marcela Zuluaga, Peter A. Milder and Markus Püschel
    Streaming Sorting Networks
    ACM Transactions on Design Automation of Electronic Systems, Vol. 21, No. 4, pp. 55, 2016

2015 

  1. Nikolaos Kyrtatas, Daniele G. Spampinato and Markus Püschel
    A Basic Linear Algebra Compiler for Embedded Processors
    Proc. Design, Automation and Test in Europe (DATE), pp. 1054-1059, 2015

2014 

  1. Daniele G. Spampinato and Markus Püschel
    A Basic Linear Algebra Compiler
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 23-32, 2014
  2. Alen Stojanov, Georg Ofenbeck, Tiark Rompf and Markus Püschel
    Abstracting Vector Architectures in Library Generators: Case Study Convolution Filters
    Proc. ACM International Workshop on Libraries, Languages and Compilers for Array Programming (ARRAY), pp. 14, 2014
  3. Georg Ofenbeck, Ruedi Steinmann, Victoria Caparrós Cabezas, Daniele G. Spampinato and Markus Püschel
    Applying the Roofline Model
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 76 - 85, 2014
  4. Benjamin Hess, Thomas Gross and Markus Püschel
    Automatic Locality-Friendly Interface Extension of Numerical Functions
    Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 83-92, 2014
  5. Victoria Caparrós Cabezas and Markus Püschel
    Extending the Roofline Model: Bottleneck Analysis with Microarchitectural Constraints
    Proc. IEEE International Symposium on Workload Characterization (IISWC), pp. 222-231, 2014
  6. Jörn Schumacher and Markus Püschel
    High-performance sparse fast Fourier transforms
    Proc. IEEE Workshop on Signal Processing Systems (SIPS), pp. 1-6, 2014
  7. Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
    Language Support for the Construction of High Performance Code Generators
    Proc. International Workshop on Adaptive Self-tuning Computing Systems (ADAPT), 2014

2013 

  1. Marcela Zuluaga, Andreas Krause, Guillaume Sergent and Markus Püschel
    Active Learning for Multi-Objective Optimization
    Proc. International Conference on Machine Learning (ICML), pp. 462-470, 2013
  2. Marcela Zuluaga, Andreas Krause and Markus Püschel
    Multi-Objective Optimization for High-Level Synthesis
    Proc. Workshop on High-Level Synthesis for High Performance Computing (HLS4HPC), 2013
  3. Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
    Spiral in Scala: Towards the Systematic Construction of Generators for Performance Libraries
    Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 125-134, 2013

2012 

  1. C. Angelopoulos, Franz Franchetti and Markus Püschel
    Automatic Generation of FFT Libraries for GPUs
    NVIDIA Research Summit at the GPU Technology Conference, 2012
  2. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Computer Generation of Hardware for Linear Digital Signal Processing Transforms
    ACM Transactions on Design Automation of Electronic Systems, Vol. 17, No. 2, 2012
  3. Marcela Zuluaga, Peter A. Milder and Markus Püschel
    Computer Generation of Streaming Sorting Networks
    Proc. Design Automation Conference (DAC), pp. 1245-1253, 2012
  4. Robert Koutsoyannis, Peter A. Milder, Christian Berger, Madeleine Glick, James C. Hoe and Markus Püschel
    Improving Fixed-Point Accuracy of FFT Cores in O-OFDM Systems
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2012
  5. Marcela Zuluaga, Andreas Krause, Peter A. Milder and Markus Püschel
    "Smart" Design Space Sampling to Predict Pareto-Optimal Solutions
    Proc. Languages, Compilers, Tools and Theory for Embedded Systems (LCTES), pp. 119-128 , 2012

2011 

  1. Daniel McFarlin, Volodymyr Arbatov, Franz Franchetti and Markus Püschel
    Automatic SIMD Vectorization of Fast Fourier Transforms for the Larrabee and AVX Instruction Sets
    Proc. International Conference on Supercomputing (ICS), 2011
  2. Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Design studies for ASIC implementations of 28 GS/s optical QPSK- and 16-QAM-OFDM transceivers
    Optics Express, Vol. 19, No. 21, pp. 20857-20864, 2011
  3. Franz Franchetti and Markus Püschel
    Fast Fourier Transform
    in Encyclopedia of Parallel Computing, Eds. David Padua, Springer 2011
  4. Christian Berger, Volodymyr Arbatov, Yevgen Voronenko, Franz Franchetti and Markus Püschel
    Real-Time Software Implementation of an IEEE 802.11a Baseband Receiver on Intel Multicore
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1693-1696 , 2011
  5. Markus Püschel, Franz Franchetti and Yevgen Voronenko
    Spiral
    in Encyclopedia of Parallel Computing, Eds. David Padua, pp. 1920-1933, Springer 2011

2010 

  1. Frédéric de Mesmay, Srinivas Chellappa, Franz Franchetti and Markus Püschel
    Computer Generation of Efficient Software Viterbi Decoders
    Proc. International Conference on High Performance Embedded Architectures and Compilers (HiPEAC), Lecture Notes in Computer Science, Springer, Vol. 5952, pp. 353-368, 2010
  2. Yevgen Voronenko, Volodymyr Arbatov, Christian Berger, Ronghui Peng, Markus Püschel and Franz Franchetti
    Computer Generation of Platform-Adapted Physical Layer Software
    Proc. Software Defined Radio (SDR), 2010
  3. Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, Christian Berger, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Design Studies for an ASIC Implementation of an Optical OFDM Transceiver
    Proc. European Conference on Optical Communication (ECOC), pp. 1-3, 2010
  4. C. Angelopoulos, Franz Franchetti and Markus Püschel
    DFT Transform on the Fermi (GTX480): Automatic Program Generation
    NVIDIA Research Summit at the GPU Technology Conference, 2010
  5. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Hardware Implementation of the Discrete Fourier Transform with Non-Power-of-Two Problem Size
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2010
  6. Frédéric de Mesmay, Yevgen Voronenko and Markus Püschel
    Offline Library Adaptation Using Automatically Generated Heuristics
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 1-10, 2010
  7. Yannis Benlachtar, Rachid Bouziane, Robert I. Killey, Christian Berger, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel and Madeleine Glick
    Optical OFDM for the Data Center
    Proc. International Conference on Transparent Optical Networks (ICTON), pp. 1-4, 2010
  8. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Real-Time Digital Signal Processing for the Generation of Optical Orthogonal Frequency Division Multiplexed Signals
    IEEE Journal of Selected Topics in Quantum Electronics, Vol. 16, No. 5, pp. 1235-1244 , 2010

2009 

  1. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    21.4 GS/s Real-Time DSP-Based Optical OFDM Signal Generation and Transmission Over 1600 km of Uncompensated Fibre
    Proc. European Conference on Optical Communication (ECOC), pp. 1-2, 2009
  2. Yevgen Voronenko and Markus Püschel
    Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for Real DFTs
    IEEE Transactions on Signal Processing, Vol. 57, No. 1, pp. 205-222, 2009
  3. Peter A. Milder, James C. Hoe and Markus Püschel
    Automatic Generation of Streaming Datapaths for Arbitrary Fixed Permutations
    Proc. Design, Automation and Test in Europe (DATE), pp. 1118-1123, 2009
  4. Daniel McFarlin, Franz Franchetti and Markus Püschel
    Automatic Generation of Vectorized Fast Fourier Transform Libraries for the Larrabee and AVX Instruction Set Extension
    Proc. High Performance Extreme Computing (HPEC), 2009
  5. Basilio B. Fraguela, Yevgen Voronenko and Markus Püschel
    Automatic Tuning of Discrete Fourier Transforms Driven by Analytical Modeling
    Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 271-280, 2009
  6. Frédéric de Mesmay, Arpad Rimmel, Yevgen Voronenko and Markus Püschel
    Bandit-Based Optimization on Graphs with Application to Library Performance Tuning
    Proc. International Conference on Machine Learning (ICML), pp. 729-736, 2009
  7. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    Computer Generation of Fast Fourier Transforms for the Cell Broadband Engine
    Proc. International Conference on Supercomputing (ICS), pp. 26-35, 2009
  8. Yevgen Voronenko, Frédéric de Mesmay and Markus Püschel
    Computer Generation of General Size Linear Transform Libraries
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 102-113, 2009
  9. Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
    Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
    IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
  10. Franz Franchetti and Markus Püschel
    Generating High-Performance Pruned FFT Implementations
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 549-552, 2009
  11. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
    Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009
  12. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    High Performance Linear Transform Program Generation for the Cell BE
    Proc. High Performance Extreme Computing (HPEC), 2009
  13. Daniel McFarlin, Franz Franchetti, José M. F. Moura and Markus Püschel
    High Performance Synthetic Aperture Radar Image Formation On Commodity Architectures
    Proc. SPIE Conference on Defense, Security, and Sensing, Proceedings of SPIE, Vol. 7337, pp. 733708, 2009
  14. Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
    Operator Language: A Program Generation Framework for Fast Kernels
    Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009
  15. Markus Püschel, Peter A. Milder and James C. Hoe
    Permuting Streaming Data Using RAMs
    Journal of the ACM, Vol. 56, No. 2, pp. 10:1-10:34, 2009

2008 

  1. Markus Püschel and José M. F. Moura
    Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for DCTs and DSTs
    IEEE Transactions on Signal Processing, Vol. 56, No. 4, pp. 1502-1521, 2008
  2. Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Automatic Generation of Adaptive Libraries for Matrix-Multiplication
    Parallel Matrix Algorithms and Applications (PMAA), 2008, Presentation (Abstract reviewed)
  3. Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Automatic Generation of Multithreaded Vectorized Adaptive Libraries for Matrix Multiplication
    Proc. International Workshop on Parallel Matrix Algorithms and Applications (PMAA), 2008
  4. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    Automatic Linear Transform Program Generation for the Cell BE
    Proc. Supercomputing (SC), 2008, Poster (Abstract reviewed)
  5. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  6. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    FFT Program Generation for the Cell BE
    Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
  7. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Formal Datapath Representation and Manipulation for Implementing DSP Transforms
    Proc. Design Automation Conference (DAC), pp. 385-390, 2008
  8. Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
    Generating High-Performance General Size Linear Transform Libraries Using Spiral
    Proc. High Performance Extreme Computing (HPEC), 2008
  9. Franz Franchetti and Markus Püschel
    Generating SIMD Vectorized Permutations
    Proc. International Conference on Compiler Construction (CC), Lecture Notes in Computer Science, Springer, Vol. 4959, pp. 116-131, 2008
  10. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    How To Write Fast Numerical Code: A Small Introduction
    Proc. Summer School on Generative and Transformational Techniques in Software Engineering (GTTSE), Lecture Notes in Computer Science, Springer, Vol. 5235, pp. 196-259, 2008
  11. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Linear Transforms: From Math to Efficient Hardware
    Proc. Workshop on High-Level Synthesis colocated with DAC, 2008
  12. Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
    Program Generation with Spiral: Beyond Transforms
    Proc. High Performance Extreme Computing (HPEC), 2008
  13. Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
    System Demonstration of Spiral: Generator for High-Performance Linear Transform Libraries
    Proc. Algebraic Methodology and Software Technology (AMAST), 2008

2007 

  1. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Discrete Fourier Transform Compiler: From Mathematical Representation to Efficient Hardware
    CSSI Technical Report #CSSI-07-01, Carnegie Mellon University, 2007
  2. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Fast Fourier Transform on FPGA: Design Choices and Evaluation
    Proc. International Symposium on Field-Programmable Gate Arrays (FPGA), 2007
  3. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    FFT Compiler: From Math to Efficient Hardware
    Proc. IEEE International High Level Design Validation and Test Workshop (HLDVT), 2007
  4. Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
    Generating FPGA Accelerated DFT Libraries
    Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007
  5. Yevgen Voronenko and Markus Püschel
    Mechanical Derivation of Fused Multiply-Add Algorithms for Linear Transforms
    IEEE Transactions on Signal Processing, Vol. 55, No. 9, pp. 4458-4473, 2007
  6. Yevgen Voronenko and Markus Püschel
    Multiplierless Multiple Constant Multiplication
    ACM Transactions on Algorithms, Vol. 3, No. 2, 2007
  7. Paolo D'Alberto, Markus Püschel and Franz Franchetti
    Performance/Energy Optimization of DSP Transforms on the XScale Processor
    Proc. International Conference on High Performance Embedded Architectures and Compilers (HiPEAC), Lecture Notes in Computer Science, Springer, Vol. 4367, pp. 201-214, 2007
  8. Franz Franchetti and Markus Püschel
    SIMD Vectorization of Non-Two-Power Sized FFTs
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. II-17-II-20, 2007
  9. Peter Tummeltshammer, James C. Hoe and Markus Püschel
    Time-Multiplexed Multiple Constant Multiplication
    IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 26, No. 9, pp. 1551-1563, 2007

2006 

  1. Yevgen Voronenko and Markus Püschel
    Algebraic Derivation of General Radix Cooley-Tukey Algorithms for the Real Discrete Fourier Transform
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 3, 2006
  2. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    A Rewriting System for the Vectorization of Signal Transforms
    Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 4395, pp. 363-377, 2006
  3. Andreas Bonelli, Franz Franchetti, Juergen Lorenz, Markus Püschel and Christoph W. Ueberhuber
    Automatic Performance Optimization of the Discrete Fourier Transform on Distributed Memory Computers
    Proc. International Symposium on Parallel and Distributed Processing and Application (ISPA), Lecture Notes In Computer Science, Springer, Vol. 4330, pp. 818-832, 2006
  4. Paolo D'Alberto, Peter A. Milder, Franz Franchetti, James C. Hoe, Markus Püschel and José M. F. Moura
    Discrete Fourier Transform Compiler for FPGA and CPU/FPGA Partitioned Implementations
    Proc. High Performance Extreme Computing (HPEC), 2006
  5. Peter A. Milder, Mohammad Ahmad, James C. Hoe and Markus Püschel
    Fast and Accurate Resource Estimation of Automatically Generated Custom DFT IP Cores
    Proc. FPGA, pp. 211-220, 2006
  6. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    FFT Program Generation for Shared Memory: SMP and Multicore
    Proc. Supercomputing (SC), 2006
  7. Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
    Parallelism in Spiral
    Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006
  8. Sung-Chul Han, Franz Franchetti and Markus Püschel
    Program Generation for the All-Pairs Shortest Path Problem
    Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 222-232, 2006
  9. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Spiral: Generating Signal Processing Kernels for New Commodity Architectures
    Proc. EDGE Workshop, pp. D49-D50, 2006

2005 

  1. Roland Wunderlich, Markus Püschel and James C. Hoe
    Accelerating Blocked Matrix-Matrix Multiplication using a Software-Managed Memory Hierarchy with DMA
    Proc. High Performance Extreme Computing (HPEC), 2005
  2. Grace Nordin, Peter A. Milder, James C. Hoe and Markus Püschel
    Automatic Generation of Customized Discrete Fourier Transform IPs
    Proc. Design Automation Conference (DAC), pp. 471-474, 2005
  3. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Formal Loop Merging for Signal Transforms
    Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326 , 2005
  4. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Loop Merging for Signal Transforms
    Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326, 2005
  5. Thammanit Pipatsrisawat, Aca Gacic, Franz Franchetti, Markus Püschel and José M. F. Moura
    Performance Analysis of the Filtered Backprojection Image Reconstruction Algorithms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. 153-156, 2005
  6. José M. F. Moura, Markus Püschel, David Padua and Jack Dongarra
    Scanning the Issue: Special Issue on Program Generation, Optimization, and Platform Adaptation
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 211-215, 2005
  7. Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
    SPIRAL: Code Generation for DSP Transforms
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005

2004 

  1. Lawrence C. Chang, Inpyo Hong, Yevgen Voronenko and Markus Püschel
    Adaptive Mapping of Linear DSP Algorithms to Fixed-Point Arithmetic
    Proc. High Performance Extreme Computing (HPEC), 2004
  2. Aca Gacic, Markus Püschel and José M. F. Moura
    Automatically Generated High-Performance Code for Discrete Wavelet Transforms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-69-V-72, 2004
  3. Franz Franchetti, Stefan Kral, Juergen Lorenz, Markus Püschel, Christoph W. Ueberhuber and Peter Wurzinger
    Automatically Tuned FFTs for BlueGene/L’s Double FPU
    Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 3402, pp. 23-36, 2004
  4. Adam C. Zelinski, Markus Püschel, Smarahara Misra and James C. Hoe
    Automatic Cost Minimization for Multiplierless Implementations of Discrete Signal Transforms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-221-V-224, 2004
  5. Yevgen Voronenko and Markus Püschel
    Automatic Generation of Implementations for DSP Transforms on Fused Multiply-Add Architectures
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-101-V-104, 2004
  6. Markus Püschel, Adam C. Zelinski and James C. Hoe
    Custom-Optimized Multiplierless Implementations of DSP Algorithms
    Proc. International Conference on Computer-Aided Design (ICCAD), pp. 175-182, 2004
  7. Grace Nordin, James C. Hoe and Markus Püschel
    Discrete Fourier Transform IP Generator
    Proc. High Performance Extreme Computing (HPEC), 2004
  8. Peter Tummeltshammer, James C. Hoe and Markus Püschel
    Multiple Constant Multiplication By Time-Multiplexed Mapping of Addition Chains
    Proc. Design Automation Conference (DAC), pp. 826-829, 2004
  9. Markus Püschel, Bryan Singer, Jianxin Xiong, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso and Robert W. Johnson
    SPIRAL: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms
    Journal of High Performance Computing and Applications, special issue on ``Automatic Performance Tuning'', Vol. 18, No. 1, pp. 21-45, 2004

2003 

  1. Markus Püschel
    Cooley-Tukey FFT like Algorithms for the DCT
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 501-504, 2003
  2. Smarahara Misra, Adam C. Zelinski, James C. Hoe and Markus Püschel
    Custom Reduction of Arithmetic in Linear DSP Transforms
    Proc. High Performance Extreme Computing (HPEC), 2003
  3. Aca Gacic, Markus Püschel and José M. F. Moura
    Fast Automatic Implementations of FIR Filters
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 541-544, 2003
  4. Franz Franchetti and Markus Püschel
    Short Vector Code Generation and Adaptation for DSP Algorithms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 537-540, 2003
  5. Franz Franchetti and Markus Püschel
    Short Vector Code Generation for the Discrete Fourier Transform
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2003
  6. Markus Püschel and José M. F. Moura
    SPIRAL: An Overview
    Proc. Workshop on Optimizations for DSP and Embedded Systems (ODES), 2003
  7. Markus Püschel and José M. F. Moura
    The Algebraic Approach to the Discrete Cosine and Sine Transforms and their Fast Algorithms
    SIAM Journal of Computing, Vol. 32, No. 5, pp. 1280-1316, 2003
  8. Fang Fang, Rob A. Rutenbar, Markus Püschel and Tsuhan Chen
    Toward Efficient Static Analysis of Finite-Precision Effects in DSP Applications via Affine Arithmetic Modeling
    Proc. Design Automation Conference (DAC), pp. 496-501, 2003

2002 

  1. Markus Püschel, Sebastian Egner and Thomas Beth
    AREP
    in New Reference Book on Computer Algebra, Eds. J. Grabmeier, E. Kaltofen, J. Grabmeier, E. Kaltofen, Springer 2002
  2. Franz Franchetti and Markus Püschel
    A SIMD Vectorizing Compiler for Digital Signal Processing Algorithms
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 20-26, 2002
  3. Markus Püschel and José M. F. Moura
    Generation and Manipulation of DSP Transform Algorithms
    Proc. Digital Signal Processing Workshop, 2002
  4. Fang Fang, James C. Hoe, Markus Püschel and Smarahara Misra
    Generation of Custom DSP Transform IP Cores: Case Study Walsh-Hadamard Transform
    Proc. High Performance Extreme Computing (HPEC), 2002
  5. Franz Franchetti, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
    Short Vector SIMD Code Generation for DSP Algorithms
    Proc. High Performance Extreme Computing (HPEC), 2002

2001 

  1. Sebastian Egner, Jeremy Johnson, David Padua, Jianxin Xiong and Markus Püschel
    Automatic Derivation and Implementation of Signal Processing Algorithms
    ACM SIGSAM Bulletin Communications in Computer Algebra, Vol. 35, No. 2, pp. 1-19, 2001
  2. Sebastian Egner and Markus Püschel
    Automatic Generation of Fast Discrete Signal Transforms
    IEEE Transactions on Signal Processing, Vol. 49, No. 9, pp. 1992-2002, 2001
  3. Markus Püschel, Bryan Singer, Manuela Veloso and José M. F. Moura
    Fast Automatic Generation of DSP Algorithms
    Proc. International Conference on Computational Science (ICCS), Lecture Notes In Computer Science, Springer, Vol. 2073, pp. 97-106, 2001
  4. José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel, Bryan Singer, Manuela Veloso and Jianxin Xiong
    Generating Platform-Adapted DSP Libraries using SPIRAL
    Proc. High Performance Extreme Computing (HPEC), 2001

2000 

  1. Jeremy Johnson and Markus Püschel
    In Search of the Optimal Walsh-Hadamard Transform
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 6, pp. 3347-3350, 2000
  2. José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel and Manuela Veloso
    SPIRAL: Automatic Implementation of Signal Processing Algorithms
    Proc. High Performance Extreme Computing (HPEC), 2000

Rabaey, J. 

2015 

  1. M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
    Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
    Computer, Vol. 48, No. 12, pp. 24-33, 2015

Ramanujam, J. 

2013 

  1. Tom Henretty, Richard Veras, Franz Franchetti, Louis-Noël Pouchet, J. Ramanujam and P. Sadayappan
    A Stencil Compiler for Short-Vector SIMD Architectures
    Proc. ACM International Conference on Supercomputing , pp. 13-24, 2013

2011 

  1. Tom Henretty, Kevin Stock, Louis-Noël Pouchet, Franz Franchetti, J. Ramanujam and P. Sadayappan
    Data Layout Transformation for Stencil Computations on Short SIMD Architectures
    Proc. International Conference on Compiler Construction (CC), 2011

Rangaraj, Deepak 

2009 

  1. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
    Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009

Rao, Sanil 

2024 

  1. Sanil Rao, A. Prakash and Franz Franchetti
    Towards LibraryX: A Framework for Cross-Library Call Optimization
    Proc. High Performance Extreme Computing (HPEC), 2024

2023 

  1. Sanil Rao, M. A. H. Monil, H. Mankad, J. Vetter and Franz Franchetti
    FFTX-IRIS: Towards Performance Portability and Heterogeneity for SPIRAL Generated Code
    Proc. Supercomputing (SC), 2023

2022 

  1. J. Nguyen, Sanil Rao, J. Moreira and Franz Franchetti
    Automatic Generation of Matrix-Vector Code Using SPIRAL for the Power10 ISA
    Proc. IEEE High Performance Extreme Computing (HPEC), 2022
  2. H. Mankad, Sanil Rao, P. Colella, B. Van Straalen and Franz Franchetti
    ProtoX: A First Look
    Proc. High Performance Extreme Computing (HPEC), 2022

2020 

  1. Sanil Rao, A. Kutuluru, Paul Brouwer, S. McMillan and Franz Franchetti
    GBTLX: A First Look
    Proc. IEEE High Performance Extreme Computing (HPEC), 2020

Rao, Varun 

2017 

  1. Tze-Meng Low, Varun Rao, Matthew Lee, Thom Popovici, Franz Franchetti and S. McMillan
    First Look: Linear Algebra-based Triangle Counting Without Matrix Multiplication
    Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-6, 2017

Re, C. 

2015 

  1. M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
    Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
    Computer, Vol. 48, No. 12, pp. 24-33, 2015

Reagen, B. 

2023 

  1. N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
    Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023
  2. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Cryptology ePrint Archive, 2023
  3. D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring-Processing Unit
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
  4. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023

2022 

  1. N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
    Towards Full-Stack Acceleration for Fully Homomorphic Encryption
    Proc. High Performance Extreme Computing (HPEC), 2022

Reynwar, B. 

2023 

  1. N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
    Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023
  2. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Cryptology ePrint Archive, 2023
  3. D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring-Processing Unit
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
  4. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023

2022 

  1. N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
    Towards Full-Stack Acceleration for Fully Homomorphic Encryption
    Proc. High Performance Extreme Computing (HPEC), 2022

Rimmel, Arpad 

2009 

  1. Frédéric de Mesmay, Arpad Rimmel, Yevgen Voronenko and Markus Püschel
    Bandit-Based Optimization on Graphs with Application to Library Performance Tuning
    Proc. International Conference on Machine Learning (ICML), pp. 729-736, 2009

Rivera, Joao 

2022 

  1. Joao Rivera, Franz Franchetti and Markus Püschel
    A Compiler for Sound Floating-Point Computations using Affine Arithmetic
    Proc. International Symposium on Code Generation and Optimization (CGO), 2022
  2. Z. Gong, N. Zhu, M. Ngaw, Joao Rivera, L. Tang, E. Tang, H. Mankad and Franz Franchetti
    Interval Arithmetic-based FFT for Large Integer Multiplication
    Proc. High Performance Extreme Computing (HPEC), 2022

2021 

  1. Joao Rivera, Franz Franchetti and Markus Püschel
    An Interval Compiler for Sound Floating Point Computations
    Proc. International Symposium on Code Generation and Optimization (CGO), 2021

Rizzolo, Nicholas 

2005 

  1. Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
    SPIRAL: Code Generation for DSP Transforms
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005

2004 

  1. Nicholas Rizzolo and David Padua
    HiLO: High Level Optimization of FFTs
    Proc. International Workshop on Languages and Compilers for Parallel Computing (LCPC), 2004

Rollett, Anthony 

2015 

  1. T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, R. Suter, Franz Franchetti and Anthony Rollett
    Spectral Full-Field Deformation Modeling of Polycrystalline Materials
    submitted for publication

2014 

  1. T. Ozturk, Thom Popovici, C. Stein, R. Pokharel, Franz Franchetti and Anthony Rollett
    Fast Fourier Transform Based Mechanical Behavior Formulation: Optimized Implementation and Sensitivity Analysis of the Method
    Proc. Materials Science & Technology, 2014
  2. T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, Franz Franchetti, R. Suter and Anthony Rollett
    Performance Evaluation, Algorithm Optimization and Sensitivity Analysis of the Spectral Full-Field Deformation Modeling of Polycrystalline Materials
    submitted for publication

Rompf, Tiark 

2017 

  1. Georg Ofenbeck, Tiark Rompf and Markus Püschel
    Staging for Generic Programming in Space and Time
    Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 15-28, 2017

2014 

  1. Alen Stojanov, Georg Ofenbeck, Tiark Rompf and Markus Püschel
    Abstracting Vector Architectures in Library Generators: Case Study Convolution Filters
    Proc. ACM International Workshop on Libraries, Languages and Compilers for Array Programming (ARRAY), pp. 14, 2014
  2. Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
    Language Support for the Construction of High Performance Code Generators
    Proc. International Workshop on Adaptive Self-tuning Computing Systems (ADAPT), 2014

2013 

  1. Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
    Spiral in Scala: Towards the Systematic Construction of Generators for Performance Libraries
    Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 125-134, 2013

Russell, F. 

2015 

  1. Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
    Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
    Proc. Workshop on Compilers for Parallel Computing (CPC), 2015
  2. Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
    Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2015

Rutenbar, Rob A. 

2003 

  1. Fang Fang, Rob A. Rutenbar, Markus Püschel and Tsuhan Chen
    Toward Efficient Static Analysis of Finite-Precision Effects in DSP Applications via Affine Arithmetic Modeling
    Proc. Design Automation Conference (DAC), pp. 496-501, 2003

Ruzicka, Vit 

2018 

  1. Vit Ruzicka and Franz Franchetti
    Fast and Accurate Object Detection in High Resolution 4K and 8K Video Using GPUs
    Proc. IEEE High Performance Extreme Computing (HPEC), 2018

Sabry Aly, M. M. 

2015 

  1. M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
    Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
    Computer, Vol. 48, No. 12, pp. 24-33, 2015

Sadayappan, P. 

2013 

  1. Tom Henretty, Richard Veras, Franz Franchetti, Louis-Noël Pouchet, J. Ramanujam and P. Sadayappan
    A Stencil Compiler for Short-Vector SIMD Architectures
    Proc. ACM International Conference on Supercomputing , pp. 13-24, 2013
  2. Martin Kong, Richard Veras, Kevin Stock, Franz Franchetti, Louis-Noël Pouchet and P. Sadayappan
    When Polyhedral Transformations Meet SIMD Code Generation
    Proc. Programming Languages Design and Implementation (PLDI), Vol. 48, pp. 127-138, 2013

2011 

  1. Tom Henretty, Kevin Stock, Louis-Noël Pouchet, Franz Franchetti, J. Ramanujam and P. Sadayappan
    Data Layout Transformation for Stencil Computations on Short SIMD Architectures
    Proc. International Conference on Compiler Construction (CC), 2011

Sadi, F. 

2019 

  1. F. Sadi, Joe Sweeney, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    Efficient SpMV Operation for Large and Highly Sparse Matrices Using Scalable Multi-way Merge Parallelization
    Proc. MICRO, 2019

2018 

  1. F. Sadi
    Accelerating Sparse Matrix Kernels with Co-optimized Architecture
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2018
  2. F. Sadi, Joe Sweeney, S. McMillan, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    PageRank Acceleration for Large Graphs with Scalable Hardware and Two-Step SpMV
    Proc. High Performance Extreme Computing (HPEC), 2018

2017 

  1. F. Sadi, Lawrence Pileggi and Franz Franchetti
    Algorithm and Hardware Co-Optimized Solution for Large SpMV Problems
    Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017

2016 

  1. F. Sadi, Lawrence Pileggi and Franz Franchetti
    3D DRAM Based Application Specific Hardware Accelerator for SpMV
    High Performance Extreme Computing Conference (HPEC), 2016

2014 

  1. Qi Guo, N. Alachiotis, Berkin Akin, F. Sadi, G. Xu, Tze-Meng Low, Lawrence Pileggi, James C. Hoe and Franz Franchetti
    3D-Stacked Memory-Side Acceleration: Accelerator and System Design
    Proc. Workshop on Near Data Processing (WONDP), 2014
  2. F. Sadi, Berkin Akin, Thom Popovici, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    Algorithm/Hardware Co-optimized SAR Image Reconstruction with 3D-stacked Logic in Memory
    Proc. IEEE High Performance Extreme Computing (HPEC), 2014

2013 

  1. Qiuling Zhu, Berkin Akin, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    A 3D-Stacked Logic-in-Memory Accelerator for Application-Specific Data Intensive Computing
    Proc. IEEE International 3D Systems Integration Conference (3DIC), pp. 1-7, 2013
  2. Qiuling Zhu, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    Accelerating Sparse Matrix-Matrix Multiplication with 3D-Stacked Logic-in-Memory Hardware
    Proc. High Performance Extreme Computing (HPEC), pp. 1-6, 2013

Sandryhaila, Aliaksei 

2014 

  1. Franz Franchetti, Aliaksei Sandryhaila and Jeremy Johnson
    High Assurance SPIRAL
    Proc. SPIE, Proceedings of SPIE 2014, 2014

2007 

  1. Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
    Generating FPGA Accelerated DFT Libraries
    Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007

Schatz, Martin 

2020 

  1. Thom Popovici, Martin Schatz, Franz Franchetti and Tze-Meng Low
    A Flexible Framework for Multi-Dimensional DFTs
    SIAM Journal on Scientific Computing (SISC), Software and High-Performance Computing, 2020

2019 

  1. Thom Popovici, Martin Schatz, Franz Franchetti and Tze-Meng Low
    A Flexible Framework for Parallel Multi-Dimensional DFTs
    arXiv (Technical Report), 2019

Schmidt, A. G. 

2023 

  1. N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
    Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023
  2. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Cryptology ePrint Archive, 2023
  3. D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring-Processing Unit
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
  4. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023

2022 

  1. N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
    Towards Full-Stack Acceleration for Fully Homomorphic Encryption
    Proc. High Performance Extreme Computing (HPEC), 2022

Schulz, M. 

2006 

  1. F. Gygi, E. W. Draeger, M. Schulz, B. R. de Supinski, J. A. Gunnels, V. Austel, J. C. Sexton, Franz Franchetti, Stefan Kral, Christoph W. Ueberhuber and Juergen Lorenz
    Large-Scale Electronic Structure Calculations of High-Z Metals on the BlueGene/L Platform
    Proc. Supercomputing (SC), 2006

Schumacher, Jörn 

2014 

  1. Jörn Schumacher and Markus Püschel
    High-performance sparse fast Fourier transforms
    Proc. IEEE Workshop on Signal Processing Systems (SIPS), pp. 1-6, 2014

2013 

  1. Jörn Schumacher
    High Performance Sparse Fast Fourier Transform
    Master thesis, Computer Science, ETH Zurich, Switzerland, 2013

Sepiashvili, David 

2000 

  1. David Sepiashvili
    Performance Models and Search Methods for Optimal FFT Implementations
    Master thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2000

Sergent, Guillaume 

2013 

  1. Marcela Zuluaga, Andreas Krause, Guillaume Sergent and Markus Püschel
    Active Learning for Multi-Objective Optimization
    Proc. International Conference on Machine Learning (ICML), pp. 462-470, 2013

Serre, Francois 

2016 

  1. Francois Serre and Markus Püschel
    Optimal Circuits for Streamed Linear Permutations using RAM
    Proc. FPGA, pp. 215-223, 2016

Sexton, J. C. 

2006 

  1. F. Gygi, E. W. Draeger, M. Schulz, B. R. de Supinski, J. A. Gunnels, V. Austel, J. C. Sexton, Franz Franchetti, Stefan Kral, Christoph W. Ueberhuber and Juergen Lorenz
    Large-Scale Electronic Structure Calculations of High-Z Metals on the BlueGene/L Platform
    Proc. Supercomputing (SC), 2006

2005 

  1. F. Gygi, E. W. Draeger, B. R. de Supinski, R. K. Yates, Franz Franchetti, Stefan Kral, Juergen Lorenz, Christoph W. Ueberhuber, J. A. Gunnels and J. C. Sexton
    Large-Scale First-Principles Molecular Dynamics Simulations on the BlueGene/L Platform Using the Qbox Code
    Proc. Supercomputing (SC), 2005

Shacham, O. 

2012 

  1. Qiuling Zhu, K. Vaidyanathan, O. Shacham, M. Horowitz, Lawrence Pileggi and Franz Franchetti
    Design Automation Framwork for Application-Specific Logic-in-Memory Blocks
    Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 125-132, 2012

Shen, Hao 

2008 

  1. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  2. Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
    Program Generation with Spiral: Beyond Transforms
    Proc. High Performance Extreme Computing (HPEC), 2008

2006 

  1. Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
    Parallelism in Spiral
    Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006

Shenoy, Pranab 

2007 

  1. Pranab Shenoy
    Universal FFT Core Generator
    Master thesis, Computer Science, Drexel University, 2007

Shulaker, M. M. 

2015 

  1. M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
    Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
    Computer, Vol. 48, No. 12, pp. 24-33, 2015

Siddique, F. 

2024 

  1. T. Zhang, E. Tang, F. Siddique, K. Skadron and Franz Franchetti
    Towards an End-to-End Processing-in-DRAM Acceleration of Spectral Library Search
    Proc. High Performance Extreme Computing (HPEC), 2024

Singer, Bryan 

2005 

  1. Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
    SPIRAL: Code Generation for DSP Transforms
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005

2004 

  1. Markus Püschel, Bryan Singer, Jianxin Xiong, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso and Robert W. Johnson
    SPIRAL: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms
    Journal of High Performance Computing and Applications, special issue on ``Automatic Performance Tuning'', Vol. 18, No. 1, pp. 21-45, 2004

2002 

  1. Bryan Singer and Manuela Veloso
    Automating the Modeling and Optimization of the Performance of Signal Transforms
    IEEE Transactions on Signal Processing, Vol. 50, No. 8, pp. 2003-2014, 2002
  2. Bryan Singer and Manuela Veloso
    Learning to Construct Fast Signal Processing Implementations
    Journal of Machine Learning Research, special issue on ``the Eighteenth International Conference on Machine Learning (ICML 2001)'', Vol. 3, pp. 887-919, 2002

2001 

  1. Bryan Singer
    Automating the Modeling and Optimization of the Performance of Signal Processing Algorithms
    PhD. thesis, Computer Science, Carnegie Mellon University, 2001
  2. Markus Püschel, Bryan Singer, Manuela Veloso and José M. F. Moura
    Fast Automatic Generation of DSP Algorithms
    Proc. International Conference on Computational Science (ICCS), Lecture Notes In Computer Science, Springer, Vol. 2073, pp. 97-106, 2001
  3. José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel, Bryan Singer, Manuela Veloso and Jianxin Xiong
    Generating Platform-Adapted DSP Libraries using SPIRAL
    Proc. High Performance Extreme Computing (HPEC), 2001
  4. Bryan Singer and Manuela Veloso
    Learning to Generate Fast Signal Processing Implementations
    Proc. International Conference on Machine Learning (ICML), pp. 529-536, 2001
  5. Bryan Singer and Manuela Veloso
    Stochastic Search for Signal Processing Algorithm Optimization
    Proc. Supercomputing (SC), pp. 22, 2001

2000 

  1. Bryan Singer and Manuela Veloso
    Learning to Predict Performance from Formula Modeling and Training Data
    Proc. International Conference on Machine Learning (ICML), pp. 887-894, 2000

Skadron, K. 

2024 

  1. T. Zhang, E. Tang, F. Siddique, K. Skadron and Franz Franchetti
    Towards an End-to-End Processing-in-DRAM Acceleration of Spectral Library Search
    Proc. High Performance Extreme Computing (HPEC), 2024

Skylaris, C-K. 

2015 

  1. Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
    Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
    Proc. Workshop on Compilers for Parallel Computing (CPC), 2015
  2. Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
    Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2015

Soni, D. 

2023 

  1. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Cryptology ePrint Archive, 2023
  2. D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring-Processing Unit
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
  3. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023

2022 

  1. N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
    Towards Full-Stack Acceleration for Fully Homomorphic Encryption
    Proc. High Performance Extreme Computing (HPEC), 2022

Sozeau, Matthieu 

2019 

  1. Vadim Zaliva and Matthieu Sozeau
    Reification of Shallow-Embedded DSLs in Coq with Automated Verification
    Proc. International Workshop on Coq for Programming Languages (CoqPL), 2019

Spampinato, Daniele G. 

2020 

  1. Jiyuan Zhang, Yi Lu, Daniele G. Spampinato and Franz Franchetti
    FESIA: A Fast and Efficient Set Intersection Approach on Modern CPUs
    Proc. IEEE International Conference on Data Engineering (ICDE), 2020
  2. Jiyuan Zhang, Daniele G. Spampinato and Franz Franchetti
    FESIA: A Fast and SIMD-Efficient Set Intersection Approach on Modern CPUs
    North East Database Day (NEDB), 2020, Poster
  3. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFT and Solver Libraries for Exascale: FFTX and SpectralPack
    Exascale Computing Project (ECP) Annual Meeting, 2020, Poster

2019 

  1. Anuva Kulkarni, Daniele G. Spampinato and Franz Franchetti
    Design and Specification of Large-scale Simulations for GPUs using FFTX
    Supercomputing, 2019
  2. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFT and Solvers for Exascale: FFTX and SpectralPACK
    Exascale Computing Project (ECP) Annual Meeting, 2019
  3. Anuva Kulkarni, Daniele G. Spampinato and Franz Franchetti
    FFTX for Micromechanical Stress-Strain Analysis
    IEEE High Performance Extreme Computing Conference (HPEC), 2019

2018 

  1. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFTX and SpectralPack: A First Look
    Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018
  2. Tze-Meng Low, Daniele G. Spampinato, A. Kutuluru, U. Sridhar, Thom Popovici, Franz Franchetti and S. McMillan
    Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency Matrices
    Proc. High Performance Extreme Computing (HPEC), 2018
  3. Jiyuan Zhang, Daniele G. Spampinato, S. McMillan and Franz Franchetti
    Preliminary Exploration on Large-Scale Triangle Counting in Shared-Memory Multicore System
    Proc. IEEE High Performance Extreme Computing (HPEC), 2018
  4. Daniele G. Spampinato, Diego Fabregat-Traver, Paolo Bientinesi and Markus Püschel
    Program Generation for Small-Scale Linear Algebra Applications
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 327-339, 2018
  5. Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
    SPIRAL: Extreme Performance Portability
    Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018

2017 

  1. Daniele G. Spampinato
    A Linear Algebra Compiler for Small Problem Sizes
    PhD. thesis, Computer Science, ETH Zurich, Switzerland, 2017

2016 

  1. Daniele G. Spampinato and Markus Püschel
    A Basic Linear Algebra Compiler for Structured Matrices
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 117-127, 2016

2015 

  1. Nikolaos Kyrtatas, Daniele G. Spampinato and Markus Püschel
    A Basic Linear Algebra Compiler for Embedded Processors
    Proc. Design, Automation and Test in Europe (DATE), pp. 1054-1059, 2015

2014 

  1. Daniele G. Spampinato and Markus Püschel
    A Basic Linear Algebra Compiler
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 23-32, 2014
  2. Georg Ofenbeck, Ruedi Steinmann, Victoria Caparrós Cabezas, Daniele G. Spampinato and Markus Püschel
    Applying the Roofline Model
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 76 - 85, 2014

Sridhar, U. 

2018 

  1. Tze-Meng Low, Daniele G. Spampinato, A. Kutuluru, U. Sridhar, Thom Popovici, Franz Franchetti and S. McMillan
    Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency Matrices
    Proc. High Performance Extreme Computing (HPEC), 2018

Stein, C. 

2015 

  1. T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, R. Suter, Franz Franchetti and Anthony Rollett
    Spectral Full-Field Deformation Modeling of Polycrystalline Materials
    submitted for publication

2014 

  1. T. Ozturk, Thom Popovici, C. Stein, R. Pokharel, Franz Franchetti and Anthony Rollett
    Fast Fourier Transform Based Mechanical Behavior Formulation: Optimized Implementation and Sensitivity Analysis of the Method
    Proc. Materials Science & Technology, 2014
  2. T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, Franz Franchetti, R. Suter and Anthony Rollett
    Performance Evaluation, Algorithm Optimization and Sensitivity Analysis of the Spectral Full-Field Deformation Modeling of Polycrystalline Materials
    submitted for publication

Steinmann, Ruedi 

2014 

  1. Georg Ofenbeck, Ruedi Steinmann, Victoria Caparrós Cabezas, Daniele G. Spampinato and Markus Püschel
    Applying the Roofline Model
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 76 - 85, 2014

Stock, Kevin 

2013 

  1. Martin Kong, Richard Veras, Kevin Stock, Franz Franchetti, Louis-Noël Pouchet and P. Sadayappan
    When Polyhedral Transformations Meet SIMD Code Generation
    Proc. Programming Languages Design and Implementation (PLDI), Vol. 48, pp. 127-138, 2013

2011 

  1. Tom Henretty, Kevin Stock, Louis-Noël Pouchet, Franz Franchetti, J. Ramanujam and P. Sadayappan
    Data Layout Transformation for Stencil Computations on Short SIMD Architectures
    Proc. International Conference on Compiler Construction (CC), 2011

Stojanov, Alen 

2014 

  1. Alen Stojanov, Georg Ofenbeck, Tiark Rompf and Markus Püschel
    Abstracting Vector Architectures in Library Generators: Case Study Convolution Filters
    Proc. ACM International Workshop on Libraries, Languages and Compilers for Array Programming (ARRAY), pp. 14, 2014
  2. Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
    Language Support for the Construction of High Performance Code Generators
    Proc. International Workshop on Adaptive Self-tuning Computing Systems (ADAPT), 2014

2013 

  1. Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
    Spiral in Scala: Towards the Systematic Construction of Generators for Performance Libraries
    Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 125-134, 2013

Sumbul, H. E. 

2015 

  1. H. E. Sumbul, K. Vaidyanathan, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
    A Synthesis Methodology for Application-Specific Logic-in-Memory Designs
    Proc. Design Automation Conference (DAC), 2015

2014 

  1. K. Vaidyanathan, R. Liu, H. E. Sumbul, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
    Efficient and Secure Intellectual Property (IP) Design for Split Fabrication
    Proc. IEEE International Symposium on Hardware-Oriented Security and Trust (HOST), 2014

2013 

  1. Qiuling Zhu, Berkin Akin, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    A 3D-Stacked Logic-in-Memory Accelerator for Application-Specific Data Intensive Computing
    Proc. IEEE International 3D Systems Integration Conference (3DIC), pp. 1-7, 2013
  2. Qiuling Zhu, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    Accelerating Sparse Matrix-Matrix Multiplication with 3D-Stacked Logic-in-Memory Hardware
    Proc. High Performance Extreme Computing (HPEC), pp. 1-6, 2013
  3. H. E. Sumbul, A. Patterson, A. Tazzoli, G. Feeder, Franz Franchetti, G. Piazza and Lawrence Pileggi
    Trusted Split-Fabrication System-on-Chip Design Technology and Methodology
    Proc. Government Applications & Critical Technology Conference (GOMACTech), 2013

Sun, D. 

2023 

  1. D. Sun, N. Zhang and Franz Franchetti
    Optimization and Performance Analysis of Shor’s Algorithm in Qiskit
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023

Suter, R. 

2015 

  1. T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, R. Suter, Franz Franchetti and Anthony Rollett
    Spectral Full-Field Deformation Modeling of Polycrystalline Materials
    submitted for publication

2014 

  1. T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, Franz Franchetti, R. Suter and Anthony Rollett
    Performance Evaluation, Algorithm Optimization and Sensitivity Analysis of the Spectral Full-Field Deformation Modeling of Polycrystalline Materials
    submitted for publication

Sweeney, Joe 

2019 

  1. F. Sadi, Joe Sweeney, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    Efficient SpMV Operation for Large and Highly Sparse Matrices Using Scalable Multi-way Merge Parallelization
    Proc. MICRO, 2019

2018 

  1. F. Sadi, Joe Sweeney, S. McMillan, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    PageRank Acceleration for Large Graphs with Scalable Hardware and Two-Step SpMV
    Proc. High Performance Extreme Computing (HPEC), 2018

Takahashi, Daisuke 

2021 

  1. N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
    Adaptation of A64 Scalable Vector Extension for Spiral
    Proc. High Performance Computing Conference (HPCC), Japan, 2021
  2. N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
    An Auto-tuning with Adaptation of A64 Scalable Vector Extension for SPIRAL
    Proc. International Workshop on Automatic Performance Tuning (iWAPT), 2021

2020 

  1. Daisuke Takahashi and Franz Franchetti
    FFTE on SVE: SPIRAL-Generated Kernels
    Proc. International Conference on High Performance Computing in Asia-Pacific Region (HPCAsia), pp. 114-122, 2020

Tang, E. 

2024 

  1. E. Tang, James C. Hoe and Franz Franchetti
    Magic Memory: A Memory-Centric Declarative Programming Paradigm to Enable High Productivity on Heterogeneous Systems
    Proc. SRC TECHCON, 2024
  2. T. Zhang, E. Tang, F. Siddique, K. Skadron and Franz Franchetti
    Towards an End-to-End Processing-in-DRAM Acceleration of Spectral Library Search
    Proc. High Performance Extreme Computing (HPEC), 2024

2022 

  1. Z. Gong, N. Zhu, M. Ngaw, Joao Rivera, L. Tang, E. Tang, H. Mankad and Franz Franchetti
    Interval Arithmetic-based FFT for Large Integer Multiplication
    Proc. High Performance Extreme Computing (HPEC), 2022
  2. E. Tang and Franz Franchetti
    Magic Memory: A Programming Model For Big Data Analytics
    Proc. High Performance Extreme Computing (HPEC), 2022

Tang, L. 

2024 

  1. L. Tang, S. Chen, K. Harisrikanth, G. Xu, Franz Franchetti and K. Mai
    A 1.19GHz 9.52Gsamples/sec Radix-8 FFT Hardware Accelerator in 28nm
    Proc. Hot Chips Symposium (HCS), 2024
  2. Y. Eum, N. Zhang, L. Tang and Franz Franchetti
    Towards a RISC-V Instruction Set Extension for Multi-word Arithmetic
    Proc. High Performance Extreme Computing (HPEC), 2024

2022 

  1. L. Tang, S. Chen, K. Harisrikanth, G. Xu, K. Mai and Franz Franchetti
    A High Throughput Hardware Accelerator for FFTW Codelets: A First Look
    Proc. High Performance Extreme Computing (HPEC), 2022
  2. Z. Gong, N. Zhu, M. Ngaw, Joao Rivera, L. Tang, E. Tang, H. Mankad and Franz Franchetti
    Interval Arithmetic-based FFT for Large Integer Multiplication
    Proc. High Performance Extreme Computing (HPEC), 2022
  3. J. Nguyen, M. Cai, Z. Zuo, L. Tang, K. Mai and Franz Franchetti
    LIMA: Hardware for FFT based Large Integer Multiplication
    Proc. High Performance Extreme Computing (HPEC), 2022

Tazzoli, A. 

2013 

  1. H. E. Sumbul, A. Patterson, A. Tazzoli, G. Feeder, Franz Franchetti, G. Piazza and Lawrence Pileggi
    Trusted Split-Fabrication System-on-Chip Design Technology and Methodology
    Proc. Government Applications & Critical Technology Conference (GOMACTech), 2013

Telgarsky, Marek 

2008 

  1. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  2. Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
    Program Generation with Spiral: Beyond Transforms
    Proc. High Performance Extreme Computing (HPEC), 2008

2006 

  1. Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
    Parallelism in Spiral
    Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006
  2. Marek Telgarsky, James C. Hoe and José M. F. Moura
    Spiral: Joint Runtime and Energy Optimization of Linear Transforms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 3, 2006

Thoma, Cory 

2013 

  1. Cory Thoma, Tao Cui and Franz Franchetti
    Privacy Preserving Smart Meter System Based Retail Level Electricity Market
    Proc. IEEE Power and Energy Society General Meeting (PES-GM), 2013

2012 

  1. Cory Thoma, Tao Cui and Franz Franchetti
    Secure Multiparty Computation Based Privacy Preserving Smart Metering System
    Proc. North American Power Symposium (NAPS), pp. 1-6, 2012

Tummeltshammer, Peter 

2007 

  1. Peter Tummeltshammer, James C. Hoe and Markus Püschel
    Time-Multiplexed Multiple Constant Multiplication
    IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 26, No. 9, pp. 1551-1563, 2007

2004 

  1. Peter Tummeltshammer, James C. Hoe and Markus Püschel
    Multiple Constant Multiplication By Time-Multiplexed Mapping of Addition Chains
    Proc. Design Automation Conference (DAC), pp. 826-829, 2004

Turner, E. L. 

2012 

  1. Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
    Local Interpolation-based Polar Format SAR: Algorithm, Hardware Implementation and Design Automation
    Journal of Signal Processing Systems, 2012
  2. Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
    Polar Format Synthetic Aperture Radar in Energy Efficient Application-Specific Logic-in-Memory
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1557-1560, 2012

2011 

  1. Qiuling Zhu, E. L. Turner, Christian Berger, Lawrence Pileggi and Franz Franchetti
    Application-Specific Logic-in-Memory for Polar Format Synthetic Aperture Radar
    Proc. High Performance Extreme Computing (HPEC), 2011

Ueberhuber, Christoph W. 

2006 

  1. Andreas Bonelli, Franz Franchetti, Juergen Lorenz, Markus Püschel and Christoph W. Ueberhuber
    Automatic Performance Optimization of the Discrete Fourier Transform on Distributed Memory Computers
    Proc. International Symposium on Parallel and Distributed Processing and Application (ISPA), Lecture Notes In Computer Science, Springer, Vol. 4330, pp. 818-832, 2006
  2. F. Gygi, E. W. Draeger, M. Schulz, B. R. de Supinski, J. A. Gunnels, V. Austel, J. C. Sexton, Franz Franchetti, Stefan Kral, Christoph W. Ueberhuber and Juergen Lorenz
    Large-Scale Electronic Structure Calculations of High-Z Metals on the BlueGene/L Platform
    Proc. Supercomputing (SC), 2006
  3. Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
    Parallelism in Spiral
    Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006

2005 

  1. Franz Franchetti, Stefan Kral, Juergen Lorenz and Christoph W. Ueberhuber
    Efficient Utilization of SIMD Extensions
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 409-425, 2005
  2. F. Gygi, E. W. Draeger, B. R. de Supinski, R. K. Yates, Franz Franchetti, Stefan Kral, Juergen Lorenz, Christoph W. Ueberhuber, J. A. Gunnels and J. C. Sexton
    Large-Scale First-Principles Molecular Dynamics Simulations on the BlueGene/L Platform Using the Qbox Code
    Proc. Supercomputing (SC), 2005
  3. Juergen Lorenz, Stefan Kral, Franz Franchetti and Christoph W. Ueberhuber
    Vectorization Techniques for the BlueGene/L Double FPU
    IBM Journal of Research and Development, Vol. 49, No. 2/3, pp. 437-446, 2005

2004 

  1. Franz Franchetti, Stefan Kral, Juergen Lorenz, Markus Püschel, Christoph W. Ueberhuber and Peter Wurzinger
    Automatically Tuned FFTs for BlueGene/L’s Double FPU
    Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 3402, pp. 23-36, 2004

2003 

  1. Stefan Kral, Franz Franchetti, Juergen Lorenz and Christoph W. Ueberhuber
    SIMD Vectorization of Straight Line Code
    Proc. Euro-Par Conference on Parallel and Distributed Computing, LNCS 2985, pp. 251-260, 2003

2002 

  1. Franz Franchetti, F. Kaltenberger and Christoph W. Ueberhuber
    FFT Kernels with FMA Utilization
    Proc. APLIMAT Conference, pp. 333-339, 2002
  2. Franz Franchetti, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
    Short Vector SIMD Code Generation for DSP Algorithms
    Proc. High Performance Extreme Computing (HPEC), 2002

2001 

  1. Franz Franchetti, H. Karner, Stefan Kral and Christoph W. Ueberhuber
    Architecture Independent Short Vector FFTs
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 1109-1112, 2001

Vachranukunkiet, Petya 

2008 

  1. Jeremy Johnson, Tim Chagnon, Petya Vachranukunkiet, Prawat Nagvajara and Chika Nwankpa
    Sparse LU Decomposition using FPGA
    Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008

Vaidyanathan, K. 

2015 

  1. H. E. Sumbul, K. Vaidyanathan, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
    A Synthesis Methodology for Application-Specific Logic-in-Memory Designs
    Proc. Design Automation Conference (DAC), 2015

2014 

  1. K. Vaidyanathan, R. Liu, H. E. Sumbul, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
    Efficient and Secure Intellectual Property (IP) Design for Split Fabrication
    Proc. IEEE International Symposium on Hardware-Oriented Security and Trust (HOST), 2014

2012 

  1. Qiuling Zhu, K. Vaidyanathan, O. Shacham, M. Horowitz, Lawrence Pileggi and Franz Franchetti
    Design Automation Framwork for Application-Specific Logic-in-Memory Blocks
    Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 125-132, 2012

Van Straalen, B. 

2023 

  1. B. Van Straalen, P. Colella, Franz Franchetti and P. Broderick
    FFTX: Next-Generation Open-Source Software for Fast Fourier Transforms
    to appear in Proc. Exascale Computing Project Annual Meeting (ECPAM), 2023

2022 

  1. H. Mankad, Sanil Rao, P. Colella, B. Van Straalen and Franz Franchetti
    ProtoX: A First Look
    Proc. High Performance Extreme Computing (HPEC), 2022

2020 

  1. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFT and Solver Libraries for Exascale: FFTX and SpectralPack
    Exascale Computing Project (ECP) Annual Meeting, 2020, Poster

2019 

  1. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFT and Solvers for Exascale: FFTX and SpectralPACK
    Exascale Computing Project (ECP) Annual Meeting, 2019

2018 

  1. Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
    FFTX and SpectralPack: A First Look
    Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018

Veloso, Manuela 

2017 

  1. Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
    High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
    IEEE Control Systems Magazine, 2017

2005 

  1. Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
    SPIRAL: Code Generation for DSP Transforms
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005

2004 

  1. Markus Püschel, Bryan Singer, Jianxin Xiong, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso and Robert W. Johnson
    SPIRAL: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms
    Journal of High Performance Computing and Applications, special issue on ``Automatic Performance Tuning'', Vol. 18, No. 1, pp. 21-45, 2004

2002 

  1. Bryan Singer and Manuela Veloso
    Automating the Modeling and Optimization of the Performance of Signal Transforms
    IEEE Transactions on Signal Processing, Vol. 50, No. 8, pp. 2003-2014, 2002
  2. Bryan Singer and Manuela Veloso
    Learning to Construct Fast Signal Processing Implementations
    Journal of Machine Learning Research, special issue on ``the Eighteenth International Conference on Machine Learning (ICML 2001)'', Vol. 3, pp. 887-919, 2002

2001 

  1. Markus Püschel, Bryan Singer, Manuela Veloso and José M. F. Moura
    Fast Automatic Generation of DSP Algorithms
    Proc. International Conference on Computational Science (ICCS), Lecture Notes In Computer Science, Springer, Vol. 2073, pp. 97-106, 2001
  2. José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel, Bryan Singer, Manuela Veloso and Jianxin Xiong
    Generating Platform-Adapted DSP Libraries using SPIRAL
    Proc. High Performance Extreme Computing (HPEC), 2001
  3. Bryan Singer and Manuela Veloso
    Learning to Generate Fast Signal Processing Implementations
    Proc. International Conference on Machine Learning (ICML), pp. 529-536, 2001
  4. Bryan Singer and Manuela Veloso
    Stochastic Search for Signal Processing Algorithm Optimization
    Proc. Supercomputing (SC), pp. 22, 2001

2000 

  1. Bryan Singer and Manuela Veloso
    Learning to Predict Performance from Formula Modeling and Training Data
    Proc. International Conference on Machine Learning (ICML), pp. 887-894, 2000
  2. José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel and Manuela Veloso
    SPIRAL: Automatic Implementation of Signal Processing Algorithms
    Proc. High Performance Extreme Computing (HPEC), 2000

Veras, Richard 

2018 

  1. Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
    SPIRAL: Extreme Performance Portability
    Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018

2017 

  1. Richard Veras and Franz Franchetti
    A Scale-free Structure for Real World Networks
    Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017
  2. Richard Veras
    The Automatic Generation of High-Performance Graph Analytic Code using Spiral
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2017

2016 

  1. Richard Veras, Tze-Meng Low and Franz Franchetti
    A Scale-Free Structure for Power-Law Graphs
    Proc. High Performance Extreme Computing (HPEC), pp. 1-7, 2016
  2. Richard Veras, Thom Popovici, Tze-Meng Low and Franz Franchetti
    Compilers, Hands-Off My Hands-On Optimizations
    Proc. Workshop on Programming Models for SIMD/Vector Programming (WPMVP), 2016

2013 

  1. Tom Henretty, Richard Veras, Franz Franchetti, Louis-Noël Pouchet, J. Ramanujam and P. Sadayappan
    A Stencil Compiler for Short-Vector SIMD Architectures
    Proc. ACM International Conference on Supercomputing , pp. 13-24, 2013
  2. Martin Kong, Richard Veras, Kevin Stock, Franz Franchetti, Louis-Noël Pouchet and P. Sadayappan
    When Polyhedral Transformations Meet SIMD Code Generation
    Proc. Programming Languages Design and Implementation (PLDI), Vol. 48, pp. 127-138, 2013

Vetter, J. 

2023 

  1. Sanil Rao, M. A. H. Monil, H. Mankad, J. Vetter and Franz Franchetti
    FFTX-IRIS: Towards Performance Portability and Heterogeneity for SPIRAL Generated Code
    Proc. Supercomputing (SC), 2023

Volk, Anja 

2017 

  1. H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
    Multirotor UAV State Prediction Through Multi-microphone Side-channel Fusion
    Proc. Multisensor Fusion and Integration for Intelligent Systems (MFI), IEEE, pp. 15-21, 2017
  2. H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
    Prediction of Quadcopter State through Multi-Microphone Side-Channel Fusion
    Technical report UU-CS-2017-001, Dept. of Information and Computing Sciences, Utrecht University, 2017

Voronenko, Yevgen 

2012 

  1. Franz Franchetti, Yevgen Voronenko and G. Almasi
    Automatic Generation of the HPC Challenges Global FFT Benchmark for BlueGene/P
    Proc. High Performance Computing for Computational Science (VECPAR), 2012

2011 

  1. Christian Berger, Volodymyr Arbatov, Yevgen Voronenko, Franz Franchetti and Markus Püschel
    Real-Time Software Implementation of an IEEE 802.11a Baseband Receiver on Intel Multicore
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1693-1696 , 2011
  2. Markus Püschel, Franz Franchetti and Yevgen Voronenko
    Spiral
    in Encyclopedia of Parallel Computing, Eds. David Padua, pp. 1920-1933, Springer 2011

2010 

  1. Yevgen Voronenko, Volodymyr Arbatov, Christian Berger, Ronghui Peng, Markus Püschel and Franz Franchetti
    Computer Generation of Platform-Adapted Physical Layer Software
    Proc. Software Defined Radio (SDR), 2010
  2. Frédéric de Mesmay, Yevgen Voronenko and Markus Püschel
    Offline Library Adaptation Using Automatically Generated Heuristics
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 1-10, 2010
  3. Lingchuan Meng, Jeremy Johnson, Franz Franchetti, Yevgen Voronenko, Marc Moreno Maza and Yuzhen Xie
    Spiral-Generated Modular FFT Algorithms
    Proc. Parallel Symbolic Computation (PASCO), pp. 169-170, 2010

2009 

  1. Yevgen Voronenko and Markus Püschel
    Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for Real DFTs
    IEEE Transactions on Signal Processing, Vol. 57, No. 1, pp. 205-222, 2009
  2. Basilio B. Fraguela, Yevgen Voronenko and Markus Püschel
    Automatic Tuning of Discrete Fourier Transforms Driven by Analytical Modeling
    Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 271-280, 2009
  3. Frédéric de Mesmay, Arpad Rimmel, Yevgen Voronenko and Markus Püschel
    Bandit-Based Optimization on Graphs with Application to Library Performance Tuning
    Proc. International Conference on Machine Learning (ICML), pp. 729-736, 2009
  4. Yevgen Voronenko, Frédéric de Mesmay and Markus Püschel
    Computer Generation of General Size Linear Transform Libraries
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 102-113, 2009
  5. Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
    Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
    IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009

2008 

  1. Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Automatic Generation of Adaptive Libraries for Matrix-Multiplication
    Parallel Matrix Algorithms and Applications (PMAA), 2008, Presentation (Abstract reviewed)
  2. Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Automatic Generation of Multithreaded Vectorized Adaptive Libraries for Matrix Multiplication
    Proc. International Workshop on Parallel Matrix Algorithms and Applications (PMAA), 2008
  3. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  4. Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
    Generating High-Performance General Size Linear Transform Libraries Using Spiral
    Proc. High Performance Extreme Computing (HPEC), 2008
  5. Yevgen Voronenko
    Library Generation for Linear Transforms
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2008
  6. Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
    Program Generation with Spiral: Beyond Transforms
    Proc. High Performance Extreme Computing (HPEC), 2008
  7. Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
    System Demonstration of Spiral: Generator for High-Performance Linear Transform Libraries
    Proc. Algebraic Methodology and Software Technology (AMAST), 2008

2007 

  1. Yevgen Voronenko and Markus Püschel
    Mechanical Derivation of Fused Multiply-Add Algorithms for Linear Transforms
    IEEE Transactions on Signal Processing, Vol. 55, No. 9, pp. 4458-4473, 2007
  2. Yevgen Voronenko and Markus Püschel
    Multiplierless Multiple Constant Multiplication
    ACM Transactions on Algorithms, Vol. 3, No. 2, 2007

2006 

  1. Yevgen Voronenko and Markus Püschel
    Algebraic Derivation of General Radix Cooley-Tukey Algorithms for the Real Discrete Fourier Transform
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 3, 2006
  2. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    A Rewriting System for the Vectorization of Signal Transforms
    Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 4395, pp. 363-377, 2006
  3. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    FFT Program Generation for Shared Memory: SMP and Multicore
    Proc. Supercomputing (SC), 2006
  4. Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
    Parallelism in Spiral
    Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006
  5. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Spiral: Generating Signal Processing Kernels for New Commodity Architectures
    Proc. EDGE Workshop, pp. D49-D50, 2006

2005 

  1. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Formal Loop Merging for Signal Transforms
    Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326 , 2005
  2. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Loop Merging for Signal Transforms
    Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326, 2005
  3. Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
    SPIRAL: Code Generation for DSP Transforms
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005

2004 

  1. Lawrence C. Chang, Inpyo Hong, Yevgen Voronenko and Markus Püschel
    Adaptive Mapping of Linear DSP Algorithms to Fixed-Point Arithmetic
    Proc. High Performance Extreme Computing (HPEC), 2004
  2. Yevgen Voronenko and Markus Püschel
    Automatic Generation of Implementations for DSP Transforms on Fused Multiply-Add Architectures
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-101-V-104, 2004

Włodarczyk, Tomasz Wiktor 

2008 

  1. Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
    Program Generation with Spiral: Beyond Transforms
    Proc. High Performance Extreme Computing (HPEC), 2008

Watts, Philip M. 

2010 

  1. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Real-Time Digital Signal Processing for the Generation of Optical Orthogonal Frequency Division Multiplexed Signals
    IEEE Journal of Selected Topics in Quantum Electronics, Vol. 16, No. 5, pp. 1235-1244 , 2010

2009 

  1. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    21.4 GS/s Real-Time DSP-Based Optical OFDM Signal Generation and Transmission Over 1600 km of Uncompensated Fibre
    Proc. European Conference on Optical Communication (ECOC), pp. 1-2, 2009
  2. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
    Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009

Weng, Yang 

2013 

  1. Q. Li, Tao Cui, Yang Weng, Rohit Negi, Franz Franchetti and Marija Ilic
    An Information-Theoretic Approach to PMU Placement in Electric Power Systems
    IEEE Transactions on Smart Grid, Vol. 4, No. 1, pp. 446-456, 2013

Wilkinson, K. 

2015 

  1. Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
    Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
    Proc. Workshop on Compilers for Parallel Computing (CPC), 2015
  2. Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
    Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2015

Wong, H.-S. 

2015 

  1. M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
    Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
    Computer, Vol. 48, No. 12, pp. 24-33, 2015

Wu, T. F. 

2015 

  1. M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
    Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
    Computer, Vol. 48, No. 12, pp. 24-33, 2015

Wunderlich, Roland 

2005 

  1. Roland Wunderlich, Markus Püschel and James C. Hoe
    Accelerating Blocked Matrix-Matrix Multiplication using a Software-Managed Memory Hierarchy with DMA
    Proc. High Performance Extreme Computing (HPEC), 2005

Wurzinger, Peter 

2004 

  1. Franz Franchetti, Stefan Kral, Juergen Lorenz, Markus Püschel, Christoph W. Ueberhuber and Peter Wurzinger
    Automatically Tuned FFTs for BlueGene/L’s Double FPU
    Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 3402, pp. 23-36, 2004

Xie, Yuzhen 

2010 

  1. Lingchuan Meng, Jeremy Johnson, Franz Franchetti, Yevgen Voronenko, Marc Moreno Maza and Yuzhen Xie
    Spiral-Generated Modular FFT Algorithms
    Proc. Parallel Symbolic Computation (PASCO), pp. 169-170, 2010

Xiong, Jianxin 

2005 

  1. Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
    SPIRAL: Code Generation for DSP Transforms
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005

2004 

  1. Markus Püschel, Bryan Singer, Jianxin Xiong, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso and Robert W. Johnson
    SPIRAL: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms
    Journal of High Performance Computing and Applications, special issue on ``Automatic Performance Tuning'', Vol. 18, No. 1, pp. 21-45, 2004

2001 

  1. Sebastian Egner, Jeremy Johnson, David Padua, Jianxin Xiong and Markus Püschel
    Automatic Derivation and Implementation of Signal Processing Algorithms
    ACM SIGSAM Bulletin Communications in Computer Algebra, Vol. 35, No. 2, pp. 1-19, 2001
  2. Jianxin Xiong
    Automatic Optimization of DSP Algorithms
    PhD. thesis, Computer Science, University of Illinois at Urbana-Champaign, 2001, Also as Technical Report UIUCDCS-R-2001-224, University of Illinois
  3. José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel, Bryan Singer, Manuela Veloso and Jianxin Xiong
    Generating Platform-Adapted DSP Libraries using SPIRAL
    Proc. High Performance Extreme Computing (HPEC), 2001
  4. Jianxin Xiong, Jeremy Johnson, Robert W. Johnson and David Padua
    SPL: A Language and Compiler for DSP Algorithms
    Proc. Programming Languages Design and Implementation (PLDI), pp. 298-308, 2001

Xu, G. 

2024 

  1. L. Tang, S. Chen, K. Harisrikanth, G. Xu, Franz Franchetti and K. Mai
    A 1.19GHz 9.52Gsamples/sec Radix-8 FFT Hardware Accelerator in 28nm
    Proc. Hot Chips Symposium (HCS), 2024

2023 

  1. G. Xu, Franz Franchetti and James C. Hoe
    An Approach to Generating Customized Load-Store Architectures
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2023

2022 

  1. L. Tang, S. Chen, K. Harisrikanth, G. Xu, K. Mai and Franz Franchetti
    A High Throughput Hardware Accelerator for FFTW Codelets: A First Look
    Proc. High Performance Extreme Computing (HPEC), 2022

2017 

  1. G. Xu, Tze-Meng Low, James C. Hoe and Franz Franchetti
    Optimizing FFT Resource Efficiency of FPGA using High-Level Synthesis
    High Performance Extreme Computing Conference (HPEC), 2017

2014 

  1. Qi Guo, N. Alachiotis, Berkin Akin, F. Sadi, G. Xu, Tze-Meng Low, Lawrence Pileggi, James C. Hoe and Franz Franchetti
    3D-Stacked Memory-Side Acceleration: Accelerator and System Design
    Proc. Workshop on Near Data Processing (WONDP), 2014

Xu, Xu 

2007 

  1. Jeremy Johnson and Xu Xu
    Generating Symmetric DFTs and Equivariant FFT Algorithms
    Proc. ACM International Symposium on Symbolic and Algebraic Computation (ISSAC), ACM, pp. 195-202, 2007

2003 

  1. Xu Xu
    A Recursive Implementation of the Dimensionless FFT
    Master thesis, Computer Science, Drexel University, 2003
  2. Jeremy Johnson and Xu Xu
    A Recursive Implementation of the Dimensionless FFT
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2003

Yang, C. 

2016 

  1. J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
    Mathematical Foundations of the GraphBLAS
    Proc. High Performance Extreme Computing (HPEC), 2016

Yang, R. 

2014 

  1. Tao Cui, R. Yang, Gabriela Hug and Franz Franchetti
    Accelerated AC Contingency Calculation on Commodity Multi-core SIMD CPUs
    Proc. IEEE Power and Energy Society General Meeting (PES-GM), 2014

Yates, R. K. 

2005 

  1. F. Gygi, E. W. Draeger, B. R. de Supinski, R. K. Yates, Franz Franchetti, Stefan Kral, Juergen Lorenz, Christoph W. Ueberhuber, J. A. Gunnels and J. C. Sexton
    Large-Scale First-Principles Molecular Dynamics Simulations on the BlueGene/L Platform Using the Qbox Code
    Proc. Supercomputing (SC), 2005

Yoon, Irene 

2021 

  1. Yannick Zakowski, Calvin Beck, Irene Yoon, Ilia Zaichuk, Vadim Zaliva and Steve Zdancewic
    Modular, Compositional, and Executable Formal Semantics for LLVMIR
    Proc. International Conference on Functional Programming (ICFP), 2021

Yu, Qian 

2008 

  1. Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
    Program Generation with Spiral: Beyond Transforms
    Proc. High Performance Extreme Computing (HPEC), 2008

Yu, W. 

2012 

  1. W. Yu, Franz Franchetti, James C. Hoe and Tsuhan Chen
    Highly Efficient Performance Portable Tracking of Evolving Surfaces
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 296-307, 2012

2011 

  1. W. Yu, Franz Franchetti, James C. Hoe, José M. F. Moura and Tsuhan Chen
    Performance Portable Tracking of Evolving Surfaces
    Proc. High Performance Extreme Computing (HPEC), 2011

2010 

  1. W. Yu, Franz Franchetti, James C. Hoe and Tsuhan Chen
    Fast and Robust Active Contours for Image Segmentation
    Proc. IEEE International Conference on Image Processing (ICIP), 2010
  2. W. Yu, Franz Franchetti, James C. Hoe, Y.-J. Chang and Tsuhan Chen
    Fast Bilateral Filtering By Adapting Block Size
    Proc. IEEE International Conference on Image Processing (ICIP), 2010
  3. W. Yu, Tsuhan Chen, Franz Franchetti and James C. Hoe
    High Performance Stereo Vision Designed for Massively Data Parallel Platforms
    IEEE Transactions on Circuits and Systems for Video Technology (T-CSVT), Vol. 20, No. 11, pp. 1509-1519, 2010
  4. W. Yu
    Performance Portable Tracking of Evolving Surfaces
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010

Zaichuk, Ilia 

2021 

  1. Yannick Zakowski, Calvin Beck, Irene Yoon, Ilia Zaichuk, Vadim Zaliva and Steve Zdancewic
    Modular, Compositional, and Executable Formal Semantics for LLVMIR
    Proc. International Conference on Functional Programming (ICFP), 2021

2020 

  1. Vadim Zaliva, Ilia Zaichuk and Franz Franchetti
    Verified Translation Between Purely Functional and Imperative Domain Specific Languages in HELIX
    Proc. Working Conference on Verified Software: Theories, Tools, and Experiments (VSTTE), 2020

Zakowski, Yannick 

2021 

  1. Yannick Zakowski, Calvin Beck, Irene Yoon, Ilia Zaichuk, Vadim Zaliva and Steve Zdancewic
    Modular, Compositional, and Executable Formal Semantics for LLVMIR
    Proc. International Conference on Functional Programming (ICFP), 2021

Zalewski, M. 

2016 

  1. J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
    Mathematical Foundations of the GraphBLAS
    Proc. High Performance Extreme Computing (HPEC), 2016

Zaliva, Vadim 

2021 

  1. Yannick Zakowski, Calvin Beck, Irene Yoon, Ilia Zaichuk, Vadim Zaliva and Steve Zdancewic
    Modular, Compositional, and Executable Formal Semantics for LLVMIR
    Proc. International Conference on Functional Programming (ICFP), 2021

2020 

  1. Vadim Zaliva
    HELIX: From Math to Verified Code
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2020
  2. Vadim Zaliva, Ilia Zaichuk and Franz Franchetti
    Verified Translation Between Purely Functional and Imperative Domain Specific Languages in HELIX
    Proc. Working Conference on Verified Software: Theories, Tools, and Experiments (VSTTE), 2020

2019 

  1. Vadim Zaliva and Matthieu Sozeau
    Reification of Shallow-Embedded DSLs in Coq with Automated Verification
    Proc. International Workshop on Coq for Programming Languages (CoqPL), 2019

2018 

  1. Vadim Zaliva and Franz Franchetti
    HELIX: A Case Study of a Formal Verification of High Performance Program Generation
    Proc. Workshop on Functional High Performance Computing (FHPC), 2018

2014 

  1. Vadim Zaliva and Franz Franchetti
    Barometric and GPS Altitude Sensor Fusion
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014

Zdancewic, Steve 

2021 

  1. Yannick Zakowski, Calvin Beck, Irene Yoon, Ilia Zaichuk, Vadim Zaliva and Steve Zdancewic
    Modular, Compositional, and Executable Formal Semantics for LLVMIR
    Proc. International Conference on Functional Programming (ICFP), 2021

Zelinski, Adam C. 

2004 

  1. Adam C. Zelinski, Markus Püschel, Smarahara Misra and James C. Hoe
    Automatic Cost Minimization for Multiplierless Implementations of Discrete Signal Transforms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-221-V-224, 2004
  2. Markus Püschel, Adam C. Zelinski and James C. Hoe
    Custom-Optimized Multiplierless Implementations of DSP Algorithms
    Proc. International Conference on Computer-Aided Design (ICCAD), pp. 175-182, 2004

2003 

  1. Smarahara Misra, Adam C. Zelinski, James C. Hoe and Markus Püschel
    Custom Reduction of Arithmetic in Linear DSP Transforms
    Proc. High Performance Extreme Computing (HPEC), 2003

Zhang, B. 

2023 

  1. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023

Zhang, Jiyuan 

2020 

  1. Jiyuan Zhang
    Accelerating the Motifs of Machine Learning Applications on Modern Processors
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2020
  2. Jiyuan Zhang, Yi Lu, Daniele G. Spampinato and Franz Franchetti
    FESIA: A Fast and Efficient Set Intersection Approach on Modern CPUs
    Proc. IEEE International Conference on Data Engineering (ICDE), 2020
  3. Jiyuan Zhang, Daniele G. Spampinato and Franz Franchetti
    FESIA: A Fast and SIMD-Efficient Set Intersection Approach on Modern CPUs
    North East Database Day (NEDB), 2020, Poster

2018 

  1. Jiyuan Zhang, Franz Franchetti and Tze-Meng Low
    High Performance Zero-Memory Overhead Direct Convolutions
    Proc. International Conference on Machine Learning (ICML), 2018
  2. Jiyuan Zhang, Daniele G. Spampinato, S. McMillan and Franz Franchetti
    Preliminary Exploration on Large-Scale Triangle Counting in Shared-Memory Multicore System
    Proc. IEEE High Performance Extreme Computing (HPEC), 2018

2015 

  1. Jiyuan Zhang, Tze-Meng Low, Qi Guo and Franz Franchetti
    A 3D-Stacked Memory Manycore Stencil Accelerator System
    Proc. Workshop on Near Data Processing (WONDP), 2015

Zhang, N. 

2024 

  1. S. Fu, N. Zhang and Franz Franchetti
    Accelerating High-Precision Number Theoretic Transforms using Intel AVX-512
    Proc. Parallel Architectures and Compilation Techniques (PACT), 2024
  2. Y. Eum, N. Zhang, L. Tang and Franz Franchetti
    Towards a RISC-V Instruction Set Extension for Multi-word Arithmetic
    Proc. High Performance Extreme Computing (HPEC), 2024

2023 

  1. N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
    Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023
  2. N. Zhang and Franz Franchetti
    Generating Number Theoretic Transforms for Multi-Word Integer Data Types
    Proc. International Symposium on Code Generation and Optimization (CGO), 2023
  3. D. Sun, N. Zhang and Franz Franchetti
    Optimization and Performance Analysis of Shor’s Algorithm in Qiskit
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023
  4. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Cryptology ePrint Archive, 2023
  5. D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring-Processing Unit
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
  6. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023
  7. P. Brinich, N. Zhang, A. Ebel, Franz Franchetti and Jeremy Johnson
    Twiddle Factor Generation for a Vectorized Number Theoretic Transform
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023

2022 

  1. N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
    Towards Full-Stack Acceleration for Fully Homomorphic Encryption
    Proc. High Performance Extreme Computing (HPEC), 2022

Zhang, T. 

2024 

  1. T. Zhang, E. Tang, F. Siddique, K. Skadron and Franz Franchetti
    Towards an End-to-End Processing-in-DRAM Acceleration of Spectral Library Search
    Proc. High Performance Extreme Computing (HPEC), 2024

Zhu, N. 

2022 

  1. Z. Gong, N. Zhu, M. Ngaw, Joao Rivera, L. Tang, E. Tang, H. Mankad and Franz Franchetti
    Interval Arithmetic-based FFT for Large Integer Multiplication
    Proc. High Performance Extreme Computing (HPEC), 2022

Zhu, Qiuling 

2015 

  1. H. E. Sumbul, K. Vaidyanathan, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
    A Synthesis Methodology for Application-Specific Logic-in-Memory Designs
    Proc. Design Automation Conference (DAC), 2015

2014 

  1. K. Vaidyanathan, R. Liu, H. E. Sumbul, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
    Efficient and Secure Intellectual Property (IP) Design for Split Fabrication
    Proc. IEEE International Symposium on Hardware-Oriented Security and Trust (HOST), 2014

2013 

  1. Qiuling Zhu, Berkin Akin, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    A 3D-Stacked Logic-in-Memory Accelerator for Application-Specific Data Intensive Computing
    Proc. IEEE International 3D Systems Integration Conference (3DIC), pp. 1-7, 2013
  2. Qiuling Zhu, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
    Accelerating Sparse Matrix-Matrix Multiplication with 3D-Stacked Logic-in-Memory Hardware
    Proc. High Performance Extreme Computing (HPEC), pp. 1-6, 2013
  3. Qiuling Zhu
    Application Specific Logic in Memory
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2013
  4. Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
    A Smart Memory Accelerated Computed Tomography Parallel Backprojection
    in From Algorithms to Circuits and System-on-Chip Design (VLSI-SoC 2012), Eds. A. Berg, A. Coskun, M. Guthaus, S. Katkoori, R. Reis, pp. 21-44, Springer, Berlin, Heidelberg 2013

2012 

  1. Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
    Cost-Effective Smart Memory Implementation for Parallel Backprojection in Computed Tomography
    Proc. IFIP/IEEE Internationa Conference on Very Large Scale Integration, pp. 111-116, 2012
  2. Qiuling Zhu, K. Vaidyanathan, O. Shacham, M. Horowitz, Lawrence Pileggi and Franz Franchetti
    Design Automation Framwork for Application-Specific Logic-in-Memory Blocks
    Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 125-132, 2012
  3. Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
    Local Interpolation-based Polar Format SAR: Algorithm, Hardware Implementation and Design Automation
    Journal of Signal Processing Systems, 2012
  4. Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
    Polar Format Synthetic Aperture Radar in Energy Efficient Application-Specific Logic-in-Memory
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1557-1560, 2012
  5. Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
    Smart Memory Synthesis for Energy-Efficient Computed Tomography Reconstruction
    Proc. SRC TECHCON, 2012

2011 

  1. Qiuling Zhu, E. L. Turner, Christian Berger, Lawrence Pileggi and Franz Franchetti
    Application-Specific Logic-in-Memory for Polar Format Synthetic Aperture Radar
    Proc. High Performance Extreme Computing (HPEC), 2011

Zuluaga, Marcela 

2016 

  1. Marcela Zuluaga, Andreas Krause and Markus Püschel
    e-PAL: An Active Learning Approach to the Multi-Objective Optimization Problem
    Journal of Machine Learning Research, Vol. 17, No. 104, pp. 1-32, 2016
  2. Marcela Zuluaga, Peter A. Milder and Markus Püschel
    Streaming Sorting Networks
    ACM Transactions on Design Automation of Electronic Systems, Vol. 21, No. 4, pp. 55, 2016

2013 

  1. Marcela Zuluaga, Andreas Krause, Guillaume Sergent and Markus Püschel
    Active Learning for Multi-Objective Optimization
    Proc. International Conference on Machine Learning (ICML), pp. 462-470, 2013
  2. Marcela Zuluaga, Andreas Krause and Markus Püschel
    Multi-Objective Optimization for High-Level Synthesis
    Proc. Workshop on High-Level Synthesis for High Performance Computing (HLS4HPC), 2013

2012 

  1. Marcela Zuluaga, Peter A. Milder and Markus Püschel
    Computer Generation of Streaming Sorting Networks
    Proc. Design Automation Conference (DAC), pp. 1245-1253, 2012
  2. Marcela Zuluaga, Andreas Krause, Peter A. Milder and Markus Püschel
    "Smart" Design Space Sampling to Predict Pareto-Optimal Solutions
    Proc. Languages, Compilers, Tools and Theory for Embedded Systems (LCTES), pp. 119-128 , 2012

Zuo, Z. 

2022 

  1. J. Nguyen, M. Cai, Z. Zuo, L. Tang, K. Mai and Franz Franchetti
    LIMA: Hardware for FFT based Large Integer Multiplication
    Proc. High Performance Extreme Computing (HPEC), 2022
Publication interface designed and implemented by Patra Pantupat, Aliaksei Sandryhaila, and Markus Püschel
Electrical and Computer Engineering, Carnegie Mellon University, 2007