Publications
Filtered as:
all types
- all years
- all authors
- all keywords
Sorted by:
author - year
Corresponding
bibtex list
Aaltonen, P.
2016
- J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
Mathematical Foundations of the GraphBLAS
Proc. High Performance Extreme Computing (HPEC), 2016
Ahmad, Mohammad
2006
- Peter A. Milder, Mohammad Ahmad, James C. Hoe and Markus Püschel
Fast and Accurate Resource Estimation of Automatically Generated Custom DFT IP Cores
Proc. FPGA, pp. 211-220, 2006
Akin, Berkin
2016
- Berkin Akin, Franz Franchetti and James C. Hoe
HAMLeT Architecture for Parallel Data Reorganization in Memory
IEEE Micro, Vol. 36, No. 1, pp. 14-23, 2016
2015
- Berkin Akin
A Formal Approach to Memory Access Optimization: Data Layout, Reorganization, and Near-Data Processing
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2015
- Berkin Akin, Franz Franchetti and James C. Hoe
Data Reorganization in Memory Using 3D-stacked DRAM
Proc. International Symposium on Computer Architectur (ISCA), 2015
- Qi Guo, Tze-Meng Low, N. Alachiotis, Berkin Akin, Lawrence Pileggi, James C. Hoe and Franz Franchetti
Enabling Portable Energy Efficiency with Memory Accelerated Library
Proc. MICRO, 2015
- Berkin Akin, Franz Franchetti and James C. Hoe
FFTs with Near-Optimal Memory Access Through Block Data Layouts: Algorithm, Architecture and Design Automation
Journal of Signal Processing Systems, 2015
2014
- Qi Guo, N. Alachiotis, Berkin Akin, F. Sadi, G. Xu, Tze-Meng Low, Lawrence Pileggi, James C. Hoe and Franz Franchetti
3D-Stacked Memory-Side Acceleration: Accelerator and System Design
Proc. Workshop on Near Data Processing (WONDP), 2014
- F. Sadi, Berkin Akin, Thom Popovici, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Algorithm/Hardware Co-optimized SAR Image Reconstruction with 3D-stacked Logic in Memory
Proc. IEEE High Performance Extreme Computing (HPEC), 2014
- Berkin Akin, Franz Franchetti and James C. Hoe
FFTs with Near-Optimal Memory Access Through Block Data Layouts
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014
- Berkin Akin, James C. Hoe and Franz Franchetti
HAMLeT: Hardware Accelerated Memory Layout Transform within 3D-stacked DRAM
Proc. IEEE High Performance Extreme Computing (HPEC), 2014
- Berkin Akin, Franz Franchetti and James C. Hoe
Understanding the Design Space of DRAM-optimized Hardware FFT Accelerators
Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 248-255, 2014
2013
- Qiuling Zhu, Berkin Akin, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
A 3D-Stacked Logic-in-Memory Accelerator for Application-Specific Data Intensive Computing
Proc. IEEE International 3D Systems Integration Conference (3DIC), pp. 1-7, 2013
2012
- Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
Algorithm and Architecture Optimization for Large Size Two Dimensional Discrete Fourier Transform
ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA), 2012
- Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
Memory Bandwidth Efficient Two-Dimensional Fast Fourier Transform Algorithm and Implementation for Large Problem Sizes
Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), 2012
Alachiotis, N.
2015
- Qi Guo, Tze-Meng Low, N. Alachiotis, Berkin Akin, Lawrence Pileggi, James C. Hoe and Franz Franchetti
Enabling Portable Energy Efficiency with Memory Accelerated Library
Proc. MICRO, 2015
2014
- Qi Guo, N. Alachiotis, Berkin Akin, F. Sadi, G. Xu, Tze-Meng Low, Lawrence Pileggi, James C. Hoe and Franz Franchetti
3D-Stacked Memory-Side Acceleration: Accelerator and System Design
Proc. Workshop on Near Data Processing (WONDP), 2014
Al Badawi, A.
2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring-Processing Unit
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
2022
- N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
Towards Full-Stack Acceleration for Fully Homomorphic Encryption
Proc. High Performance Extreme Computing (HPEC), 2022
Almasi, G.
2012
- Franz Franchetti, Yevgen Voronenko and G. Almasi
Automatic Generation of the HPC Challenges Global FFT Benchmark for BlueGene/P
Proc. High Performance Computing for Computational Science (VECPAR), 2012
Andrews, Michael
2008
- Jeremy Johnson and Michael Andrews
Statistical Evaluation of a Self-Tuning Vectorized Library for the Walsh-Hadamard Transform
Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
2007
- Jeremy Johnson and Michael Andrews
Performance Analysis of a Family of WHT Algorithms
Proc. Workshop on Performance Optimization for High-Level Languages and Libraries (POHLL), 2007 IEEE International Parallel and Distributed Processing Symposium , IEEE, pp. 450, 2007
Angelopoulos, C.
2012
- C. Angelopoulos, Franz Franchetti and Markus Püschel
Automatic Generation of FFT Libraries for GPUs
NVIDIA Research Summit at the GPU Technology Conference, 2012
2010
- C. Angelopoulos, Franz Franchetti and Markus Püschel
DFT Transform on the Fermi (GTX480): Automatic Program Generation
NVIDIA Research Summit at the GPU Technology Conference, 2010
Arbatov, Volodymyr
2011
- Daniel McFarlin, Volodymyr Arbatov, Franz Franchetti and Markus Püschel
Automatic SIMD Vectorization of Fast Fourier Transforms for the Larrabee and AVX Instruction Sets
Proc. International Conference on Supercomputing (ICS), 2011
- Christian Berger, Volodymyr Arbatov, Yevgen Voronenko, Franz Franchetti and Markus Püschel
Real-Time Software Implementation of an IEEE 802.11a Baseband Receiver on Intel Multicore
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1693-1696 , 2011
2010
- Yevgen Voronenko, Volodymyr Arbatov, Christian Berger, Ronghui Peng, Markus Püschel and Franz Franchetti
Computer Generation of Platform-Adapted Physical Layer Software
Proc. Software Defined Radio (SDR), 2010
Asheghi, M.
2015
- M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
Computer, Vol. 48, No. 12, pp. 24-33, 2015
Austel, V.
2006
- F. Gygi, E. W. Draeger, M. Schulz, B. R. de Supinski, J. A. Gunnels, V. Austel, J. C. Sexton, Franz Franchetti, Stefan Kral, Christoph W. Ueberhuber and Juergen Lorenz
Large-Scale Electronic Structure Calculations of High-Z Metals on the BlueGene/L Platform
Proc. Supercomputing (SC), 2006
Bader, D.
2016
- J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
Mathematical Foundations of the GraphBLAS
Proc. High Performance Extreme Computing (HPEC), 2016
Balog, Mike
2002
- Mike Balog
A Flexible Framework for Implementing FFT Processors
Master thesis, Electrical and Computer Engineering, Drexel University, 2002
Beck, Calvin
2021
- Yannick Zakowski, Calvin Beck, Irene Yoon, Ilia Zaichuk, Vadim Zaliva and Steve Zdancewic
Modular, Compositional, and Executable Formal Semantics for LLVMIR
Proc. International Conference on Functional Programming (ICFP), 2021
Becker, Peter
2001
- Peter Becker
A High Speed VLSI Architecture for the Discrete Haar Wavelet Transform
Master thesis, Electrical and Computer Engineering, Drexel University, 2001
Benlachtar, Yannis
2011
- Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Design studies for ASIC implementations of 28 GS/s optical QPSK- and 16-QAM-OFDM transceivers
Optics Express, Vol. 19, No. 21, pp. 20857-20864, 2011
2010
- Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, Christian Berger, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Design Studies for an ASIC Implementation of an Optical OFDM Transceiver
Proc. European Conference on Optical Communication (ECOC), pp. 1-3, 2010
- Yannis Benlachtar, Rachid Bouziane, Robert I. Killey, Christian Berger, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel and Madeleine Glick
Optical OFDM for the Data Center
Proc. International Conference on Transparent Optical Networks (ICTON), pp. 1-4, 2010
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Real-Time Digital Signal Processing for the Generation of Optical Orthogonal Frequency Division Multiplexed Signals
IEEE Journal of Selected Topics in Quantum Electronics, Vol. 16, No. 5, pp. 1235-1244 , 2010
2009
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
21.4 GS/s Real-Time DSP-Based Optical OFDM Signal Generation and Transmission Over 1600 km of Uncompensated Fibre
Proc. European Conference on Optical Communication (ECOC), pp. 1-2, 2009
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009
Berger, Christian
2012
- Robert Koutsoyannis, Peter A. Milder, Christian Berger, Madeleine Glick, James C. Hoe and Markus Püschel
Improving Fixed-Point Accuracy of FFT Cores in O-OFDM Systems
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2012
- Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
Local Interpolation-based Polar Format SAR: Algorithm, Hardware Implementation and Design Automation
Journal of Signal Processing Systems, 2012
- Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
Polar Format Synthetic Aperture Radar in Energy Efficient Application-Specific Logic-in-Memory
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1557-1560, 2012
2011
- Qiuling Zhu, E. L. Turner, Christian Berger, Lawrence Pileggi and Franz Franchetti
Application-Specific Logic-in-Memory for Polar Format Synthetic Aperture Radar
Proc. High Performance Extreme Computing (HPEC), 2011
- Christian Berger, Volodymyr Arbatov, Yevgen Voronenko, Franz Franchetti and Markus Püschel
Real-Time Software Implementation of an IEEE 802.11a Baseband Receiver on Intel Multicore
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1693-1696 , 2011
2010
- Yevgen Voronenko, Volodymyr Arbatov, Christian Berger, Ronghui Peng, Markus Püschel and Franz Franchetti
Computer Generation of Platform-Adapted Physical Layer Software
Proc. Software Defined Radio (SDR), 2010
- Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, Christian Berger, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Design Studies for an ASIC Implementation of an Optical OFDM Transceiver
Proc. European Conference on Optical Communication (ECOC), pp. 1-3, 2010
- Yannis Benlachtar, Rachid Bouziane, Robert I. Killey, Christian Berger, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel and Madeleine Glick
Optical OFDM for the Data Center
Proc. International Conference on Transparent Optical Networks (ICTON), pp. 1-4, 2010
Beth, Thomas
2002
- Markus Püschel, Sebastian Egner and Thomas Beth
AREP
in New Reference Book on Computer Algebra, Eds. J. Grabmeier, E. Kaltofen, J. Grabmeier, E. Kaltofen, Springer 2002
Bientinesi, Paolo
2018
- Daniele G. Spampinato, Diego Fabregat-Traver, Paolo Bientinesi and Markus Püschel
Program Generation for Small-Scale Linear Algebra Applications
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 327-339, 2018
Blanco, Mark
2021
- Mark Blanco, S. McMillan and Tze-Meng Low
Delayed Asynchronous Iterative Graph Algorithms
Proc. High Performance Extreme Computing (HPEC), 2021
2020
- Mark Blanco, S. McMillan and Tze-Meng Low
Towards an Objective Metric for the Performance of Exact Triangle Count
Proc. High Performance Extreme Computing (HPEC), 2020
2019
- Mark Blanco, Tze-Meng Low and Kyungjoo Kim
Exploration of Fine-Grained Parallelism for Load Balancing Eager K-truss on GPU and CPU
Proc. High Performance Extreme Computing (HPEC), 2019
Bokor, J.
2015
- M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
Computer, Vol. 48, No. 12, pp. 24-33, 2015
Bolten, Matthias
2017
- Matthias Bolten, Franz Franchetti, P. H. J. Kelly, Christian Lengauer and Marcus Mohr
Algebraic Description and Automatic Generation of Multigrid Methods in SPIRAL
Concurrency and Computation: Practice and Experience, 2017
Bondalapati, Kiran
2000
- Neungsoo Park, Viktor K. Prasanna, Kiran Bondalapati and Dongsoo Kang
Dynamic Data Layouts for Cache-conscious Factorization of DFT
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 693-701, 2000
Bonelli, Andreas
2006
- Andreas Bonelli, Franz Franchetti, Juergen Lorenz, Markus Püschel and Christoph W. Ueberhuber
Automatic Performance Optimization of the Discrete Fourier Transform on Distributed Memory Computers
Proc. International Symposium on Parallel and Distributed Processing and Application (ISPA), Lecture Notes In Computer Science, Springer, Vol. 4330, pp. 818-832, 2006
- Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
Parallelism in Spiral
Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006
Bouziane, Rachid
2011
- Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Design studies for ASIC implementations of 28 GS/s optical QPSK- and 16-QAM-OFDM transceivers
Optics Express, Vol. 19, No. 21, pp. 20857-20864, 2011
2010
- Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, Christian Berger, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Design Studies for an ASIC Implementation of an Optical OFDM Transceiver
Proc. European Conference on Optical Communication (ECOC), pp. 1-3, 2010
- Yannis Benlachtar, Rachid Bouziane, Robert I. Killey, Christian Berger, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel and Madeleine Glick
Optical OFDM for the Data Center
Proc. International Conference on Transparent Optical Networks (ICTON), pp. 1-4, 2010
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Real-Time Digital Signal Processing for the Generation of Optical Orthogonal Frequency Division Multiplexed Signals
IEEE Journal of Selected Topics in Quantum Electronics, Vol. 16, No. 5, pp. 1235-1244 , 2010
2009
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
21.4 GS/s Real-Time DSP-Based Optical OFDM Signal Generation and Transmission Over 1600 km of Uncompensated Fibre
Proc. European Conference on Optical Communication (ECOC), pp. 1-2, 2009
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009
Breitzman, Anthony F.
2004
- Anthony F. Breitzman and Jeremy Johnson
Automatic Derivation and Implementation of Fast Convolution Algorithms
Journal of High Performance Computing and Applications, special issue on ``Computer Algebra and Signal Processing'', Vol. 37, No. 2, pp. 157-186, 2004
2003
- Anthony F. Breitzman
Automatic Derivation and Implementation of Fast Convolution Algorithms
PhD. thesis, Computer Science, Drexel University, 2003
Brinich, P.
2023
- N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
- P. Brinich, N. Zhang, A. Ebel, Franz Franchetti and Jeremy Johnson
Twiddle Factor Generation for a Vectorized Number Theoretic Transform
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
2022
- N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
Towards Full-Stack Acceleration for Fully Homomorphic Encryption
Proc. High Performance Extreme Computing (HPEC), 2022
Broderick, P.
2023
- B. Van Straalen, P. Colella, Franz Franchetti and P. Broderick
FFTX: Next-Generation Open-Source Software for Fast Fourier Transforms
to appear in Proc. Exascale Computing Project Annual Meeting (ECPAM), 2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
2022
- N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
Towards Full-Stack Acceleration for Fully Homomorphic Encryption
Proc. High Performance Extreme Computing (HPEC), 2022
Brouwer, Paul
2020
- Sanil Rao, A. Kutuluru, Paul Brouwer, S. McMillan and Franz Franchetti
GBTLX: A First Look
Proc. IEEE High Performance Extreme Computing (HPEC), 2020
Buluc, A.
2016
- J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
Mathematical Foundations of the GraphBLAS
Proc. High Performance Extreme Computing (HPEC), 2016
Cai, M.
2022
- J. Nguyen, M. Cai, Z. Zuo, L. Tang, K. Mai and Franz Franchetti
LIMA: Hardware for FFT based Large Integer Multiplication
Proc. High Performance Extreme Computing (HPEC), 2022
Canida, K.
2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring-Processing Unit
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
2022
- N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
Towards Full-Stack Acceleration for Fully Homomorphic Encryption
Proc. High Performance Extreme Computing (HPEC), 2022
Canning, A.
2020
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solver Libraries for Exascale: FFTX and SpectralPack
Exascale Computing Project (ECP) Annual Meeting, 2020, Poster
2019
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solvers for Exascale: FFTX and SpectralPACK
Exascale Computing Project (ECP) Annual Meeting, 2019
2018
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFTX and SpectralPack: A First Look
Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018
Caparrós Cabezas, Victoria
2014
- Georg Ofenbeck, Ruedi Steinmann, Victoria Caparrós Cabezas, Daniele G. Spampinato and Markus Püschel
Applying the Roofline Model
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 76 - 85, 2014
- Victoria Caparrós Cabezas and Markus Püschel
Extending the Roofline Model: Bottleneck Analysis with Microarchitectural Constraints
Proc. IEEE International Symposium on Workload Characterization (IISWC), pp. 222-231, 2014
Cartolano, Anthony
2009
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009
Chagnon, Tim
2008
- Jeremy Johnson, Tim Chagnon, Petya Vachranukunkiet, Prawat Nagvajara and Chika Nwankpa
Sparse LU Decomposition using FPGA
Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
Chang, Lawrence C.
2004
- Lawrence C. Chang, Inpyo Hong, Yevgen Voronenko and Markus Püschel
Adaptive Mapping of Linear DSP Algorithms to Fixed-Point Arithmetic
Proc. High Performance Extreme Computing (HPEC), 2004
Chang, Y.-J.
2010
- W. Yu, Franz Franchetti, James C. Hoe, Y.-J. Chang and Tsuhan Chen
Fast Bilateral Filtering By Adapting Block Size
Proc. IEEE International Conference on Image Processing (ICIP), 2010
Chellappa, Srinivas
2010
- Frédéric de Mesmay, Srinivas Chellappa, Franz Franchetti and Markus Püschel
Computer Generation of Efficient Software Viterbi Decoders
Proc. International Conference on High Performance Embedded Architectures and Compilers (HiPEAC), Lecture Notes in Computer Science, Springer, Vol. 5952, pp. 353-368, 2010
- Srinivas Chellappa
Computer Generation of Fourier Transform Libraries for Distributed Memory Architectures
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
2009
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
Computer Generation of Fast Fourier Transforms for the Cell Broadband Engine
Proc. International Conference on Supercomputing (ICS), pp. 26-35, 2009
- Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
High Performance Linear Transform Program Generation for the Cell BE
Proc. High Performance Extreme Computing (HPEC), 2009
2008
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
Automatic Linear Transform Program Generation for the Cell BE
Proc. Supercomputing (SC), 2008, Poster (Abstract reviewed)
- Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
Domain-Specific Library Generation for Parallel Software and Hardware Platforms
Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
FFT Program Generation for the Cell BE
Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
How To Write Fast Numerical Code: A Small Introduction
Proc. Summer School on Generative and Transformational Techniques in Software Engineering (GTTSE), Lecture Notes in Computer Science, Springer, Vol. 5235, pp. 196-259, 2008
- Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
Program Generation with Spiral: Beyond Transforms
Proc. High Performance Extreme Computing (HPEC), 2008
Chen, Kang
2005
- Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
SPIRAL: Code Generation for DSP Transforms
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005
2004
- Jeremy Johnson and Kang Chen
A Self-Adapting Distributed Memory Package for Fast Signal Transforms
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 44-, 2004
2002
- Kang Chen
A Prototypical Self-Optimizing Package for Parallel Implementation of Fast Signal Transforms
Master thesis, Computer Science, Drexel University, 2002
- Kang Chen and Jeremy Johnson
A Prototypical Self-Optimizing Package for Parallel Implementation of Fast Signal Transforms
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 58-63, 2002
Chen, S.
2024
- L. Tang, S. Chen, K. Harisrikanth, G. Xu, Franz Franchetti and K. Mai
A 1.19GHz 9.52Gsamples/sec Radix-8 FFT Hardware Accelerator in 28nm
Proc. Hot Chips Symposium (HCS), 2024
2022
- L. Tang, S. Chen, K. Harisrikanth, G. Xu, K. Mai and Franz Franchetti
A High Throughput Hardware Accelerator for FFTW Codelets: A First Look
Proc. High Performance Extreme Computing (HPEC), 2022
Chen, Tianshi
2016
- Qi Guo, Tianshi Chen, Y. Chen and Franz Franchetti
Accelerating Architectural Simulation Via Statistical Techniques: A Survey
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 35, No. 3, pp. 433-446, 2016
Chen, Tsuhan
2012
- W. Yu, Franz Franchetti, James C. Hoe and Tsuhan Chen
Highly Efficient Performance Portable Tracking of Evolving Surfaces
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 296-307, 2012
2011
- W. Yu, Franz Franchetti, James C. Hoe, José M. F. Moura and Tsuhan Chen
Performance Portable Tracking of Evolving Surfaces
Proc. High Performance Extreme Computing (HPEC), 2011
2010
- W. Yu, Franz Franchetti, James C. Hoe and Tsuhan Chen
Fast and Robust Active Contours for Image Segmentation
Proc. IEEE International Conference on Image Processing (ICIP), 2010
- W. Yu, Franz Franchetti, James C. Hoe, Y.-J. Chang and Tsuhan Chen
Fast Bilateral Filtering By Adapting Block Size
Proc. IEEE International Conference on Image Processing (ICIP), 2010
- W. Yu, Tsuhan Chen, Franz Franchetti and James C. Hoe
High Performance Stereo Vision Designed for Massively Data Parallel Platforms
IEEE Transactions on Circuits and Systems for Video Technology (T-CSVT), Vol. 20, No. 11, pp. 1509-1519, 2010
2003
- Fang Fang, Rob A. Rutenbar, Markus Püschel and Tsuhan Chen
Toward Efficient Static Analysis of Finite-Precision Effects in DSP Applications via Affine Arithmetic Modeling
Proc. Design Automation Conference (DAC), pp. 496-501, 2003
Chen, Y.
2016
- Qi Guo, Tianshi Chen, Y. Chen and Franz Franchetti
Accelerating Architectural Simulation Via Statistical Techniques: A Survey
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 35, No. 3, pp. 433-446, 2016
Cheng, Z.
2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
Chuangsuwanich, Ekapol
2006
- Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
Parallelism in Spiral
Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006
Colella, P.
2023
- B. Van Straalen, P. Colella, Franz Franchetti and P. Broderick
FFTX: Next-Generation Open-Source Software for Fast Fourier Transforms
to appear in Proc. Exascale Computing Project Annual Meeting (ECPAM), 2023
2022
- H. Mankad, Sanil Rao, P. Colella, B. Van Straalen and Franz Franchetti
ProtoX: A First Look
Proc. High Performance Extreme Computing (HPEC), 2022
2020
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solver Libraries for Exascale: FFTX and SpectralPack
Exascale Computing Project (ECP) Annual Meeting, 2020, Poster
2019
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solvers for Exascale: FFTX and SpectralPACK
Exascale Computing Project (ECP) Annual Meeting, 2019
2018
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFTX and SpectralPack: A First Look
Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018
Cousins, D. B.
2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring-Processing Unit
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
2022
- N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
Towards Full-Stack Acceleration for Fully Homomorphic Encryption
Proc. High Performance Extreme Computing (HPEC), 2022
Cui, Tao
2014
- Tao Cui, R. Yang, Gabriela Hug and Franz Franchetti
Accelerated AC Contingency Calculation on Commodity Multi-core SIMD CPUs
Proc. IEEE Power and Energy Society General Meeting (PES-GM), 2014
2013
- Q. Li, Tao Cui, Yang Weng, Rohit Negi, Franz Franchetti and Marija Ilic
An Information-Theoretic Approach to PMU Placement in Electric Power Systems
IEEE Transactions on Smart Grid, Vol. 4, No. 1, pp. 446-456, 2013
- Tao Cui and Franz Franchetti
A Quasi-Monte Carlo Approach for Radial Distribution System Probabilistic Load Flow
Proc. IEEE Innovative Smart Grid Technology Conference, 2013
- Tao Cui and Franz Franchetti
A Software Performance Engineering Aproach to Fast Transmission Probabilistic Load Flow
IEEE PES General Meeting, 2013
- Tao Cui and Franz Franchetti
Power System Probabilistic and Security Analysis on Commodity High Performance Computing Systems
Proc. High Performance Computing, Networking and Analytics for the Power Grid (HiPCNA-PG), 2013
- Tao Cui
Power System Probabilistic and Security Analysis Using Commodity High Performance Computing Systems
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2013
- Cory Thoma, Tao Cui and Franz Franchetti
Privacy Preserving Smart Meter System Based Retail Level Electricity Market
Proc. IEEE Power and Energy Society General Meeting (PES-GM), 2013
2012
- Tao Cui and Franz Franchetti
A Multi-Core High Performance Computing Framework for Probabilistic Solutions of Distribution Systems
Proc. IEEE Power and Energy Society General Meeting (PES-GM), pp. 1-6, 2012
- Tao Cui and Franz Franchetti
Optimized Parallel Distribution Load Flow Solver on Commodity Multi-core CPU
Proc. IEEE High Performance Extreme Computing (HPEC), 2012
- Cory Thoma, Tao Cui and Franz Franchetti
Secure Multiparty Computation Based Privacy Preserving Smart Metering System
Proc. North American Power Symposium (NAPS), pp. 1-6, 2012
2011
- Tao Cui and Franz Franchetti
A Monte Carlo Framework for Probabilistic Distribution Power Flow
Carnegie Mellon Conference on the Electricity Industry, 2011
- Tao Cui and Franz Franchetti
A Multi-core High Performance Computing Framework for Distribution Power Flow
Proc. North American Power Symposium (NAPS), 2011
- Tao Cui and Franz Franchetti
Autotuning a Random Walk Boolean Satisfiability Solver
Proc. International Workshop on Automatic Performance Tuning (iWAPT), 2011
2010
- Q. Li, Tao Cui, Rohit Negi, Franz Franchetti and Marija Ilic
On-line Decentralized Charging of Plug-In Electric Vehicles in Power Systems
, 2010
D'Alberto, Paolo
2008
- Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
Domain-Specific Library Generation for Parallel Software and Hardware Platforms
Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
2007
- Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
Generating FPGA Accelerated DFT Libraries
Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007
- Paolo D'Alberto, Markus Püschel and Franz Franchetti
Performance/Energy Optimization of DSP Transforms on the XScale Processor
Proc. International Conference on High Performance Embedded Architectures and Compilers (HiPEAC), Lecture Notes in Computer Science, Springer, Vol. 4367, pp. 201-214, 2007
2006
- Paolo D'Alberto, Peter A. Milder, Franz Franchetti, James C. Hoe, Markus Püschel and José M. F. Moura
Discrete Fourier Transform Compiler for FPGA and CPU/FPGA Partitioned Implementations
Proc. High Performance Extreme Computing (HPEC), 2006
de Mesmay, Frédéric
2010
- Frédéric de Mesmay, Srinivas Chellappa, Franz Franchetti and Markus Püschel
Computer Generation of Efficient Software Viterbi Decoders
Proc. International Conference on High Performance Embedded Architectures and Compilers (HiPEAC), Lecture Notes in Computer Science, Springer, Vol. 5952, pp. 353-368, 2010
- Frédéric de Mesmay, Yevgen Voronenko and Markus Püschel
Offline Library Adaptation Using Automatically Generated Heuristics
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 1-10, 2010
- Frédéric de Mesmay
On the Computer Generation of Adaptive Numerical Libraries
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
2009
- Frédéric de Mesmay, Arpad Rimmel, Yevgen Voronenko and Markus Püschel
Bandit-Based Optimization on Graphs with Application to Library Performance Tuning
Proc. International Conference on Machine Learning (ICML), pp. 729-736, 2009
- Yevgen Voronenko, Frédéric de Mesmay and Markus Püschel
Computer Generation of General Size Linear Transform Libraries
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 102-113, 2009
- Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
Operator Language: A Program Generation Framework for Fast Kernels
Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009
2008
- Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
Automatic Generation of Adaptive Libraries for Matrix-Multiplication
Parallel Matrix Algorithms and Applications (PMAA), 2008, Presentation (Abstract reviewed)
- Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
Automatic Generation of Multithreaded Vectorized Adaptive Libraries for Matrix Multiplication
Proc. International Workshop on Parallel Matrix Algorithms and Applications (PMAA), 2008
- Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
Domain-Specific Library Generation for Parallel Software and Hardware Platforms
Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
- Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
Generating High-Performance General Size Linear Transform Libraries Using Spiral
Proc. High Performance Extreme Computing (HPEC), 2008
- Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
Program Generation with Spiral: Beyond Transforms
Proc. High Performance Extreme Computing (HPEC), 2008
- Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
System Demonstration of Spiral: Generator for High-Performance Linear Transform Libraries
Proc. Algebraic Methodology and Software Technology (AMAST), 2008
Deri, Joya
2016
- Joya Deri, Franz Franchetti and José M. F. Moura
Big Data Computation of Taxi Movement in New York City
Proc. IEEE International Conference on Big Data (Big Data), IEEE, pp. 2616-2625, 2016
de Supinski, B. R.
2006
- F. Gygi, E. W. Draeger, M. Schulz, B. R. de Supinski, J. A. Gunnels, V. Austel, J. C. Sexton, Franz Franchetti, Stefan Kral, Christoph W. Ueberhuber and Juergen Lorenz
Large-Scale Electronic Structure Calculations of High-Z Metals on the BlueGene/L Platform
Proc. Supercomputing (SC), 2006
2005
- F. Gygi, E. W. Draeger, B. R. de Supinski, R. K. Yates, Franz Franchetti, Stefan Kral, Juergen Lorenz, Christoph W. Ueberhuber, J. A. Gunnels and J. C. Sexton
Large-Scale First-Principles Molecular Dynamics Simulations on the BlueGene/L Platform Using the Qbox Code
Proc. Supercomputing (SC), 2005
Dongarra, Jack
2005
- José M. F. Moura, Markus Püschel, David Padua and Jack Dongarra
Scanning the Issue: Special Issue on Program Generation, Optimization, and Platform Adaptation
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 211-215, 2005
Draeger, E. W.
2006
- F. Gygi, E. W. Draeger, M. Schulz, B. R. de Supinski, J. A. Gunnels, V. Austel, J. C. Sexton, Franz Franchetti, Stefan Kral, Christoph W. Ueberhuber and Juergen Lorenz
Large-Scale Electronic Structure Calculations of High-Z Metals on the BlueGene/L Platform
Proc. Supercomputing (SC), 2006
2005
- F. Gygi, E. W. Draeger, B. R. de Supinski, R. K. Yates, Franz Franchetti, Stefan Kral, Juergen Lorenz, Christoph W. Ueberhuber, J. A. Gunnels and J. C. Sexton
Large-Scale First-Principles Molecular Dynamics Simulations on the BlueGene/L Platform Using the Qbox Code
Proc. Supercomputing (SC), 2005
Duff, B.
2014
- B. Duff, J. Larkin, M. Franusich and Franz Franchetti
Automatic Generation of 3-D FFTs
submitted for publication
Ebel, A.
2023
- N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
- P. Brinich, N. Zhang, A. Ebel, Franz Franchetti and Jeremy Johnson
Twiddle Factor Generation for a Vectorized Number Theoretic Transform
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
Egner, Sebastian
2002
- Markus Püschel, Sebastian Egner and Thomas Beth
AREP
in New Reference Book on Computer Algebra, Eds. J. Grabmeier, E. Kaltofen, J. Grabmeier, E. Kaltofen, Springer 2002
2001
- Sebastian Egner, Jeremy Johnson, David Padua, Jianxin Xiong and Markus Püschel
Automatic Derivation and Implementation of Signal Processing Algorithms
ACM SIGSAM Bulletin Communications in Computer Algebra, Vol. 35, No. 2, pp. 1-19, 2001
- Sebastian Egner and Markus Püschel
Automatic Generation of Fast Discrete Signal Transforms
IEEE Transactions on Signal Processing, Vol. 49, No. 9, pp. 1992-2002, 2001
Eum, Y.
2024
- Y. Eum, N. Zhang, L. Tang and Franz Franchetti
Towards a RISC-V Instruction Set Extension for Multi-word Arithmetic
Proc. High Performance Extreme Computing (HPEC), 2024
Fabregat-Traver, Diego
2018
- Daniele G. Spampinato, Diego Fabregat-Traver, Paolo Bientinesi and Markus Püschel
Program Generation for Small-Scale Linear Algebra Applications
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 327-339, 2018
Fahringer, T.
2003
- T. Fahringer, Franz Franchetti, M. Geissler, G. Madsen, H. Moritsch and R. Prodan
On Using ZENTURIO for Performance and Parameter Studies on Clusters and Grids
Proc. Euromicro Conference on Parallel Distributed and Network-based Processing (Euro PDP), pp. 185-192, 2003
Fang, Fang
2003
- Fang Fang, Rob A. Rutenbar, Markus Püschel and Tsuhan Chen
Toward Efficient Static Analysis of Finite-Precision Effects in DSP Applications via Affine Arithmetic Modeling
Proc. Design Automation Conference (DAC), pp. 496-501, 2003
2002
- Fang Fang, James C. Hoe, Markus Püschel and Smarahara Misra
Generation of Custom DSP Transform IP Cores: Case Study Walsh-Hadamard Transform
Proc. High Performance Extreme Computing (HPEC), 2002
Feeder, G.
2013
- H. E. Sumbul, A. Patterson, A. Tazzoli, G. Feeder, Franz Franchetti, G. Piazza and Lawrence Pileggi
Trusted Split-Fabrication System-on-Chip Design Technology and Methodology
Proc. Government Applications & Critical Technology Conference (GOMACTech), 2013
Fraguela, Basilio B.
2009
- Basilio B. Fraguela, Yevgen Voronenko and Markus Püschel
Automatic Tuning of Discrete Fourier Transforms Driven by Analytical Modeling
Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 271-280, 2009
Franchetti, Franz
2024
- L. Tang, S. Chen, K. Harisrikanth, G. Xu, Franz Franchetti and K. Mai
A 1.19GHz 9.52Gsamples/sec Radix-8 FFT Hardware Accelerator in 28nm
Proc. Hot Chips Symposium (HCS), 2024
- S. Fu, N. Zhang and Franz Franchetti
Accelerating High-Precision Number Theoretic Transforms using Intel AVX-512
Proc. Parallel Architectures and Compilation Techniques (PACT), 2024
- E. Tang, James C. Hoe and Franz Franchetti
Magic Memory: A Memory-Centric Declarative Programming Paradigm to Enable High Productivity on Heterogeneous Systems
Proc. SRC TECHCON, 2024
- T. Zhang, E. Tang, F. Siddique, K. Skadron and Franz Franchetti
Towards an End-to-End Processing-in-DRAM Acceleration of Spectral Library Search
Proc. High Performance Extreme Computing (HPEC), 2024
- Y. Eum, N. Zhang, L. Tang and Franz Franchetti
Towards a RISC-V Instruction Set Extension for Multi-word Arithmetic
Proc. High Performance Extreme Computing (HPEC), 2024
- Sanil Rao, A. Prakash and Franz Franchetti
Towards LibraryX: A Framework for Cross-Library Call Optimization
Proc. High Performance Extreme Computing (HPEC), 2024
2023
- G. Xu, Franz Franchetti and James C. Hoe
An Approach to Generating Customized Load-Store Architectures
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2023
- Sanil Rao, M. A. H. Monil, H. Mankad, J. Vetter and Franz Franchetti
FFTX-IRIS: Towards Performance Portability and Heterogeneity for SPIRAL Generated Code
Proc. Supercomputing (SC), 2023
- B. Van Straalen, P. Colella, Franz Franchetti and P. Broderick
FFTX: Next-Generation Open-Source Software for Fast Fourier Transforms
to appear in Proc. Exascale Computing Project Annual Meeting (ECPAM), 2023
- N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
- N. Zhang and Franz Franchetti
Generating Number Theoretic Transforms for Multi-Word Integer Data Types
Proc. International Symposium on Code Generation and Optimization (CGO), 2023
- D. Sun, N. Zhang and Franz Franchetti
Optimization and Performance Analysis of Shor’s Algorithm in Qiskit
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring-Processing Unit
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
- P. Brinich, N. Zhang, A. Ebel, Franz Franchetti and Jeremy Johnson
Twiddle Factor Generation for a Vectorized Number Theoretic Transform
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
2022
- Joao Rivera, Franz Franchetti and Markus Püschel
A Compiler for Sound Floating-Point Computations using Affine Arithmetic
Proc. International Symposium on Code Generation and Optimization (CGO), 2022
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
A Framework for Low Communication Approaches for Large Scale 3D Convolution
Proc. International Conference on Parallel Processing (ICPP), 2022
- L. Tang, S. Chen, K. Harisrikanth, G. Xu, K. Mai and Franz Franchetti
A High Throughput Hardware Accelerator for FFTW Codelets: A First Look
Proc. High Performance Extreme Computing (HPEC), 2022
- P. Oostema and Franz Franchetti
Approximating Manifolds and Geodesics with Curved Surfaces
Proc. High Performance Extreme Computing (HPEC), 2022
- J. Nguyen, Sanil Rao, J. Moreira and Franz Franchetti
Automatic Generation of Matrix-Vector Code Using SPIRAL for the Power10 ISA
Proc. IEEE High Performance Extreme Computing (HPEC), 2022
- Z. Gong, N. Zhu, M. Ngaw, Joao Rivera, L. Tang, E. Tang, H. Mankad and Franz Franchetti
Interval Arithmetic-based FFT for Large Integer Multiplication
Proc. High Performance Extreme Computing (HPEC), 2022
- J. Nguyen, M. Cai, Z. Zuo, L. Tang, K. Mai and Franz Franchetti
LIMA: Hardware for FFT based Large Integer Multiplication
Proc. High Performance Extreme Computing (HPEC), 2022
- E. Tang and Franz Franchetti
Magic Memory: A Programming Model For Big Data Analytics
Proc. High Performance Extreme Computing (HPEC), 2022
- P. Oostema and Franz Franchetti
On Manifold Graph Embedding
Proc. High Performance Extreme Computing (HPEC), 2022
- H. Mankad, Sanil Rao, P. Colella, B. Van Straalen and Franz Franchetti
ProtoX: A First Look
Proc. High Performance Extreme Computing (HPEC), 2022
- N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
Towards Full-Stack Acceleration for Fully Homomorphic Encryption
Proc. High Performance Extreme Computing (HPEC), 2022
2021
- N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
Adaptation of A64 Scalable Vector Extension for Spiral
Proc. High Performance Computing Conference (HPCC), Japan, 2021
- N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
An Auto-tuning with Adaptation of A64 Scalable Vector Extension for SPIRAL
Proc. International Workshop on Automatic Performance Tuning (iWAPT), 2021
- Joao Rivera, Franz Franchetti and Markus Püschel
An Interval Compiler for Sound Floating Point Computations
Proc. International Symposium on Code Generation and Optimization (CGO), 2021
- P. Oostema and Franz Franchetti
Leveraging High Dimensional Spatial Graph Embedding as a Heuristic for Graph Algorithms
Proc. IEEE Workshop on Parallel / Distributed Combinatorics and Optimization (PDCO), 2021
- Scott Mionis, Franz Franchetti and J. Larkin
Optimized Quantum Circuit Generation with SPIRAL
Proc. High Performance Extreme Computing (HPEC), 2021
2020
- Thom Popovici, Martin Schatz, Franz Franchetti and Tze-Meng Low
A Flexible Framework for Multi-Dimensional DFTs
SIAM Journal on Scientific Computing (SISC), Software and High-Performance Computing, 2020
- Jiyuan Zhang, Yi Lu, Daniele G. Spampinato and Franz Franchetti
FESIA: A Fast and Efficient Set Intersection Approach on Modern CPUs
Proc. IEEE International Conference on Data Engineering (ICDE), 2020
- Jiyuan Zhang, Daniele G. Spampinato and Franz Franchetti
FESIA: A Fast and SIMD-Efficient Set Intersection Approach on Modern CPUs
North East Database Day (NEDB), 2020, Poster
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solver Libraries for Exascale: FFTX and SpectralPack
Exascale Computing Project (ECP) Annual Meeting, 2020, Poster
- Daisuke Takahashi and Franz Franchetti
FFTE on SVE: SPIRAL-Generated Kernels
Proc. International Conference on High Performance Computing in Asia-Pacific Region (HPCAsia), pp. 114-122, 2020
- Sanil Rao, A. Kutuluru, Paul Brouwer, S. McMillan and Franz Franchetti
GBTLX: A First Look
Proc. IEEE High Performance Extreme Computing (HPEC), 2020
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
Massive Scaling of MASSIF: Algorithm Development and Analysis for Simulation on GPUs
Proc. Platform for Advanced Scientific Computing (PASC), Article 13, pp. 1 - 10, 2020
- Scott Mionis, Franz Franchetti and J. Larkin
Quantum Circuit Optimization with SPIRAL: A First Look
Proc. Supercomputing (SC), 2020
- Vadim Zaliva, Ilia Zaichuk and Franz Franchetti
Verified Translation Between Purely Functional and Imperative Domain Specific Languages in HELIX
Proc. Working Conference on Verified Software: Theories, Tools, and Experiments (VSTTE), 2020
2019
- Thom Popovici, Martin Schatz, Franz Franchetti and Tze-Meng Low
A Flexible Framework for Parallel Multi-Dimensional DFTs
arXiv (Technical Report), 2019
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
Algorithm Design at Scale: Porting Parallel FFT-based Fortran Simulations to GPUs
SIAM Conference on Computational Science and Engineering (CSE), 2019
- Anuva Kulkarni, Daniele G. Spampinato and Franz Franchetti
Design and Specification of Large-scale Simulations for GPUs using FFTX
Supercomputing, 2019
- F. Sadi, Joe Sweeney, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Efficient SpMV Operation for Large and Highly Sparse Matrices Using Scalable Multi-way Merge Parallelization
Proc. MICRO, 2019
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solvers for Exascale: FFTX and SpectralPACK
Exascale Computing Project (ECP) Annual Meeting, 2019
- Anuva Kulkarni, Daniele G. Spampinato and Franz Franchetti
FFTX for Micromechanical Stress-Strain Analysis
IEEE High Performance Extreme Computing Conference (HPEC), 2019
- Yoko Franchetti, Thomas Nolin and Franz Franchetti
Towards Precision Medicine: Simulation Based Parameter Estimation for Drug Metabolism
SIAM Conference on Computational Science and Engineering (CSE), 2019
2018
- Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
Algorithm Design for Large Scale FFT-Based Simulations on CPU-GPU Platforms
, 2018
- Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
Algorithm Design for Large Scale Parallel FFT-Based Simulations on Heterogeneous Platforms
Proc. High Performance Extreme Computing (HPEC), 2018
- Vit Ruzicka and Franz Franchetti
Fast and Accurate Object Detection in High Resolution 4K and 8K Video Using GPUs
Proc. IEEE High Performance Extreme Computing (HPEC), 2018
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFTX and SpectralPack: A First Look
Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018
- Vadim Zaliva and Franz Franchetti
HELIX: A Case Study of a Formal Verification of High Performance Program Generation
Proc. Workshop on Functional High Performance Computing (FHPC), 2018
- Jiyuan Zhang, Franz Franchetti and Tze-Meng Low
High Performance Zero-Memory Overhead Direct Convolutions
Proc. International Conference on Machine Learning (ICML), 2018
- Thom Popovici, Tze-Meng Low and Franz Franchetti
Large Bandwidth-Efficient FFTs on Multicore and Multi-Socket Systems
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), IEEE, 2018
- Tze-Meng Low, Daniele G. Spampinato, A. Kutuluru, U. Sridhar, Thom Popovici, Franz Franchetti and S. McMillan
Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency Matrices
Proc. High Performance Extreme Computing (HPEC), 2018
- F. Sadi, Joe Sweeney, S. McMillan, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
PageRank Acceleration for Large Graphs with Scalable Hardware and Two-Step SpMV
Proc. High Performance Extreme Computing (HPEC), 2018
- Jiyuan Zhang, Daniele G. Spampinato, S. McMillan and Franz Franchetti
Preliminary Exploration on Large-Scale Triangle Counting in Shared-Memory Multicore System
Proc. IEEE High Performance Extreme Computing (HPEC), 2018
- Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
SPIRAL: Extreme Performance Portability
Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018
2017
- Matthias Bolten, Franz Franchetti, P. H. J. Kelly, Christian Lengauer and Marcus Mohr
Algebraic Description and Automatic Generation of Multigrid Methods in SPIRAL
Concurrency and Computation: Practice and Experience, 2017
- F. Sadi, Lawrence Pileggi and Franz Franchetti
Algorithm and Hardware Co-Optimized Solution for Large SpMV Problems
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017
- Richard Veras and Franz Franchetti
A Scale-free Structure for Real World Networks
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017
- Tze-Meng Low, Varun Rao, Matthew Lee, Thom Popovici, Franz Franchetti and S. McMillan
First Look: Linear Algebra-based Triangle Counting Without Matrix Multiplication
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-6, 2017
- Tze-Meng Low and Franz Franchetti
High Assurance Code Generation for Cyber-Physical Systems
Proc. IEEE International Symposium on High Assurance Systems Engineering (HASE), 2017
- Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
IEEE Control Systems Magazine, 2017
- Thom Popovici, Franz Franchetti and Tze-Meng Low
Mixed Data Layout Kernels for Vectorized Complex Arithmetic
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017
- H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
Multirotor UAV State Prediction Through Multi-microphone Side-channel Fusion
Proc. Multisensor Fusion and Integration for Intelligent Systems (MFI), IEEE, pp. 15-21, 2017
- G. Xu, Tze-Meng Low, James C. Hoe and Franz Franchetti
Optimizing FFT Resource Efficiency of FPGA using High-Level Synthesis
High Performance Extreme Computing Conference (HPEC), 2017
- H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
Prediction of Quadcopter State through Multi-Microphone Side-Channel Fusion
Technical report UU-CS-2017-001, Dept. of Information and Computing Sciences, Utrecht University, 2017
2016
- F. Sadi, Lawrence Pileggi and Franz Franchetti
3D DRAM Based Application Specific Hardware Accelerator for SpMV
High Performance Extreme Computing Conference (HPEC), 2016
- Qi Guo, Tianshi Chen, Y. Chen and Franz Franchetti
Accelerating Architectural Simulation Via Statistical Techniques: A Survey
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 35, No. 3, pp. 433-446, 2016
- Richard Veras, Tze-Meng Low and Franz Franchetti
A Scale-Free Structure for Power-Law Graphs
Proc. High Performance Extreme Computing (HPEC), pp. 1-7, 2016
- Joya Deri, Franz Franchetti and José M. F. Moura
Big Data Computation of Taxi Movement in New York City
Proc. IEEE International Conference on Big Data (Big Data), IEEE, pp. 2616-2625, 2016
- Richard Veras, Thom Popovici, Tze-Meng Low and Franz Franchetti
Compilers, Hands-Off My Hands-On Optimizations
Proc. Workshop on Programming Models for SIMD/Vector Programming (WPMVP), 2016
- Berkin Akin, Franz Franchetti and James C. Hoe
HAMLeT Architecture for Parallel Data Reorganization in Memory
IEEE Micro, Vol. 36, No. 1, pp. 14-23, 2016
- J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
Mathematical Foundations of the GraphBLAS
Proc. High Performance Extreme Computing (HPEC), 2016
2015
- Jiyuan Zhang, Tze-Meng Low, Qi Guo and Franz Franchetti
A 3D-Stacked Memory Manycore Stencil Accelerator System
Proc. Workshop on Near Data Processing (WONDP), 2015
- H. V. Koops and Franz Franchetti
An Ensemble Technique for Estimating Vehicle Speed and Gear Position from Acoustic Data
Proc. International Conference on Digital Signal Processing (DSP), 2015
- H. E. Sumbul, K. Vaidyanathan, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
A Synthesis Methodology for Application-Specific Logic-in-Memory Designs
Proc. Design Automation Conference (DAC), 2015
- Berkin Akin, Franz Franchetti and James C. Hoe
Data Reorganization in Memory Using 3D-stacked DRAM
Proc. International Symposium on Computer Architectur (ISCA), 2015
- Qi Guo, Tze-Meng Low, N. Alachiotis, Berkin Akin, Lawrence Pileggi, James C. Hoe and Franz Franchetti
Enabling Portable Energy Efficiency with Memory Accelerated Library
Proc. MICRO, 2015
- M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
Computer, Vol. 48, No. 12, pp. 24-33, 2015
- Berkin Akin, Franz Franchetti and James C. Hoe
FFTs with Near-Optimal Memory Access Through Block Data Layouts: Algorithm, Architecture and Design Automation
Journal of Signal Processing Systems, 2015
- Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
Proc. Workshop on Compilers for Parallel Computing (CPC), 2015
- Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2015
- Tze-Meng Low, Qi Guo and Franz Franchetti
Optimizing Space Time Adaptive Processing Through Accelerating Memory-Bounded Operations
Proc. High Performance Extreme Computing (HPEC), 2015
- T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, R. Suter, Franz Franchetti and Anthony Rollett
Spectral Full-Field Deformation Modeling of Polycrystalline Materials
submitted for publication
2014
- Qi Guo, N. Alachiotis, Berkin Akin, F. Sadi, G. Xu, Tze-Meng Low, Lawrence Pileggi, James C. Hoe and Franz Franchetti
3D-Stacked Memory-Side Acceleration: Accelerator and System Design
Proc. Workshop on Near Data Processing (WONDP), 2014
- Tao Cui, R. Yang, Gabriela Hug and Franz Franchetti
Accelerated AC Contingency Calculation on Commodity Multi-core SIMD CPUs
Proc. IEEE Power and Energy Society General Meeting (PES-GM), 2014
- F. Sadi, Berkin Akin, Thom Popovici, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Algorithm/Hardware Co-optimized SAR Image Reconstruction with 3D-stacked Logic in Memory
Proc. IEEE High Performance Extreme Computing (HPEC), 2014
- B. Duff, J. Larkin, M. Franusich and Franz Franchetti
Automatic Generation of 3-D FFTs
submitted for publication
- Vadim Zaliva and Franz Franchetti
Barometric and GPS Altitude Sensor Fusion
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014
- K. Vaidyanathan, R. Liu, H. E. Sumbul, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
Efficient and Secure Intellectual Property (IP) Design for Split Fabrication
Proc. IEEE International Symposium on Hardware-Oriented Security and Trust (HOST), 2014
- T. Ozturk, Thom Popovici, C. Stein, R. Pokharel, Franz Franchetti and Anthony Rollett
Fast Fourier Transform Based Mechanical Behavior Formulation: Optimized Implementation and Sensitivity Analysis of the Method
Proc. Materials Science & Technology, 2014
- Berkin Akin, Franz Franchetti and James C. Hoe
FFTs with Near-Optimal Memory Access Through Block Data Layouts
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014
- Berkin Akin, James C. Hoe and Franz Franchetti
HAMLeT: Hardware Accelerated Memory Layout Transform within 3D-stacked DRAM
Proc. IEEE High Performance Extreme Computing (HPEC), 2014
- Franz Franchetti, Aliaksei Sandryhaila and Jeremy Johnson
High Assurance SPIRAL
Proc. SPIE, Proceedings of SPIE 2014, 2014
- T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, Franz Franchetti, R. Suter and Anthony Rollett
Performance Evaluation, Algorithm Optimization and Sensitivity Analysis of the Spectral Full-Field Deformation Modeling of Polycrystalline Materials
submitted for publication
- Berkin Akin, Franz Franchetti and James C. Hoe
Understanding the Design Space of DRAM-optimized Hardware FFT Accelerators
Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 248-255, 2014
2013
- Qiuling Zhu, Berkin Akin, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
A 3D-Stacked Logic-in-Memory Accelerator for Application-Specific Data Intensive Computing
Proc. IEEE International 3D Systems Integration Conference (3DIC), pp. 1-7, 2013
- Qiuling Zhu, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Accelerating Sparse Matrix-Matrix Multiplication with 3D-Stacked Logic-in-Memory Hardware
Proc. High Performance Extreme Computing (HPEC), pp. 1-6, 2013
- Q. Li, Tao Cui, Yang Weng, Rohit Negi, Franz Franchetti and Marija Ilic
An Information-Theoretic Approach to PMU Placement in Electric Power Systems
IEEE Transactions on Smart Grid, Vol. 4, No. 1, pp. 446-456, 2013
- Tao Cui and Franz Franchetti
A Quasi-Monte Carlo Approach for Radial Distribution System Probabilistic Load Flow
Proc. IEEE Innovative Smart Grid Technology Conference, 2013
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
A Smart Memory Accelerated Computed Tomography Parallel Backprojection
in From Algorithms to Circuits and System-on-Chip Design (VLSI-SoC 2012), Eds. A. Berg, A. Coskun, M. Guthaus, S. Katkoori, R. Reis, pp. 21-44, Springer, Berlin, Heidelberg 2013
- Tao Cui and Franz Franchetti
A Software Performance Engineering Aproach to Fast Transmission Probabilistic Load Flow
IEEE PES General Meeting, 2013
- Tom Henretty, Richard Veras, Franz Franchetti, Louis-Noël Pouchet, J. Ramanujam and P. Sadayappan
A Stencil Compiler for Short-Vector SIMD Architectures
Proc. ACM International Conference on Supercomputing , pp. 13-24, 2013
- Tao Cui and Franz Franchetti
Power System Probabilistic and Security Analysis on Commodity High Performance Computing Systems
Proc. High Performance Computing, Networking and Analytics for the Power Grid (HiPCNA-PG), 2013
- Cory Thoma, Tao Cui and Franz Franchetti
Privacy Preserving Smart Meter System Based Retail Level Electricity Market
Proc. IEEE Power and Energy Society General Meeting (PES-GM), 2013
- H. E. Sumbul, A. Patterson, A. Tazzoli, G. Feeder, Franz Franchetti, G. Piazza and Lawrence Pileggi
Trusted Split-Fabrication System-on-Chip Design Technology and Methodology
Proc. Government Applications & Critical Technology Conference (GOMACTech), 2013
- Martin Kong, Richard Veras, Kevin Stock, Franz Franchetti, Louis-Noël Pouchet and P. Sadayappan
When Polyhedral Transformations Meet SIMD Code Generation
Proc. Programming Languages Design and Implementation (PLDI), Vol. 48, pp. 127-138, 2013
2012
- Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
Algorithm and Architecture Optimization for Large Size Two Dimensional Discrete Fourier Transform
ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA), 2012
- Tao Cui and Franz Franchetti
A Multi-Core High Performance Computing Framework for Probabilistic Solutions of Distribution Systems
Proc. IEEE Power and Energy Society General Meeting (PES-GM), pp. 1-6, 2012
- C. Angelopoulos, Franz Franchetti and Markus Püschel
Automatic Generation of FFT Libraries for GPUs
NVIDIA Research Summit at the GPU Technology Conference, 2012
- Franz Franchetti, Yevgen Voronenko and G. Almasi
Automatic Generation of the HPC Challenges Global FFT Benchmark for BlueGene/P
Proc. High Performance Computing for Computational Science (VECPAR), 2012
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Computer Generation of Hardware for Linear Digital Signal Processing Transforms
ACM Transactions on Design Automation of Electronic Systems, Vol. 17, No. 2, 2012
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
Cost-Effective Smart Memory Implementation for Parallel Backprojection in Computed Tomography
Proc. IFIP/IEEE Internationa Conference on Very Large Scale Integration, pp. 111-116, 2012
- Qiuling Zhu, K. Vaidyanathan, O. Shacham, M. Horowitz, Lawrence Pileggi and Franz Franchetti
Design Automation Framwork for Application-Specific Logic-in-Memory Blocks
Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 125-132, 2012
- W. Yu, Franz Franchetti, James C. Hoe and Tsuhan Chen
Highly Efficient Performance Portable Tracking of Evolving Surfaces
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 296-307, 2012
- Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
Local Interpolation-based Polar Format SAR: Algorithm, Hardware Implementation and Design Automation
Journal of Signal Processing Systems, 2012
- Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
Memory Bandwidth Efficient Two-Dimensional Fast Fourier Transform Algorithm and Implementation for Large Problem Sizes
Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), 2012
- Tao Cui and Franz Franchetti
Optimized Parallel Distribution Load Flow Solver on Commodity Multi-core CPU
Proc. IEEE High Performance Extreme Computing (HPEC), 2012
- Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
Polar Format Synthetic Aperture Radar in Energy Efficient Application-Specific Logic-in-Memory
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1557-1560, 2012
- Cory Thoma, Tao Cui and Franz Franchetti
Secure Multiparty Computation Based Privacy Preserving Smart Metering System
Proc. North American Power Symposium (NAPS), pp. 1-6, 2012
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
Smart Memory Synthesis for Energy-Efficient Computed Tomography Reconstruction
Proc. SRC TECHCON, 2012
2011
- Tao Cui and Franz Franchetti
A Monte Carlo Framework for Probabilistic Distribution Power Flow
Carnegie Mellon Conference on the Electricity Industry, 2011
- Tao Cui and Franz Franchetti
A Multi-core High Performance Computing Framework for Distribution Power Flow
Proc. North American Power Symposium (NAPS), 2011
- Qiuling Zhu, E. L. Turner, Christian Berger, Lawrence Pileggi and Franz Franchetti
Application-Specific Logic-in-Memory for Polar Format Synthetic Aperture Radar
Proc. High Performance Extreme Computing (HPEC), 2011
- Daniel McFarlin, Volodymyr Arbatov, Franz Franchetti and Markus Püschel
Automatic SIMD Vectorization of Fast Fourier Transforms for the Larrabee and AVX Instruction Sets
Proc. International Conference on Supercomputing (ICS), 2011
- Tao Cui and Franz Franchetti
Autotuning a Random Walk Boolean Satisfiability Solver
Proc. International Workshop on Automatic Performance Tuning (iWAPT), 2011
- Tom Henretty, Kevin Stock, Louis-Noël Pouchet, Franz Franchetti, J. Ramanujam and P. Sadayappan
Data Layout Transformation for Stencil Computations on Short SIMD Architectures
Proc. International Conference on Compiler Construction (CC), 2011
- Franz Franchetti and Markus Püschel
Fast Fourier Transform
in Encyclopedia of Parallel Computing, Eds. David Padua, Springer 2011
- W. Yu, Franz Franchetti, James C. Hoe, José M. F. Moura and Tsuhan Chen
Performance Portable Tracking of Evolving Surfaces
Proc. High Performance Extreme Computing (HPEC), 2011
- Christian Berger, Volodymyr Arbatov, Yevgen Voronenko, Franz Franchetti and Markus Püschel
Real-Time Software Implementation of an IEEE 802.11a Baseband Receiver on Intel Multicore
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1693-1696 , 2011
- Markus Püschel, Franz Franchetti and Yevgen Voronenko
Spiral
in Encyclopedia of Parallel Computing, Eds. David Padua, pp. 1920-1933, Springer 2011
2010
- Frédéric de Mesmay, Srinivas Chellappa, Franz Franchetti and Markus Püschel
Computer Generation of Efficient Software Viterbi Decoders
Proc. International Conference on High Performance Embedded Architectures and Compilers (HiPEAC), Lecture Notes in Computer Science, Springer, Vol. 5952, pp. 353-368, 2010
- Yevgen Voronenko, Volodymyr Arbatov, Christian Berger, Ronghui Peng, Markus Püschel and Franz Franchetti
Computer Generation of Platform-Adapted Physical Layer Software
Proc. Software Defined Radio (SDR), 2010
- C. Angelopoulos, Franz Franchetti and Markus Püschel
DFT Transform on the Fermi (GTX480): Automatic Program Generation
NVIDIA Research Summit at the GPU Technology Conference, 2010
- W. Yu, Franz Franchetti, James C. Hoe and Tsuhan Chen
Fast and Robust Active Contours for Image Segmentation
Proc. IEEE International Conference on Image Processing (ICIP), 2010
- W. Yu, Franz Franchetti, James C. Hoe, Y.-J. Chang and Tsuhan Chen
Fast Bilateral Filtering By Adapting Block Size
Proc. IEEE International Conference on Image Processing (ICIP), 2010
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Hardware Implementation of the Discrete Fourier Transform with Non-Power-of-Two Problem Size
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2010
- W. Yu, Tsuhan Chen, Franz Franchetti and James C. Hoe
High Performance Stereo Vision Designed for Massively Data Parallel Platforms
IEEE Transactions on Circuits and Systems for Video Technology (T-CSVT), Vol. 20, No. 11, pp. 1509-1519, 2010
- Q. Li, Tao Cui, Rohit Negi, Franz Franchetti and Marija Ilic
On-line Decentralized Charging of Plug-In Electric Vehicles in Power Systems
, 2010
- Lingchuan Meng, Jeremy Johnson, Franz Franchetti, Yevgen Voronenko, Marc Moreno Maza and Yuzhen Xie
Spiral-Generated Modular FFT Algorithms
Proc. Parallel Symbolic Computation (PASCO), pp. 169-170, 2010
2009
- Daniel McFarlin, Franz Franchetti and Markus Püschel
Automatic Generation of Vectorized Fast Fourier Transform Libraries for the Larrabee and AVX Instruction Set Extension
Proc. High Performance Extreme Computing (HPEC), 2009
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
Computer Generation of Fast Fourier Transforms for the Cell Broadband Engine
Proc. International Conference on Supercomputing (ICS), pp. 26-35, 2009
- Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
- Franz Franchetti and Markus Püschel
Generating High-Performance Pruned FFT Implementations
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 549-552, 2009
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
High Performance Linear Transform Program Generation for the Cell BE
Proc. High Performance Extreme Computing (HPEC), 2009
- Daniel McFarlin, Franz Franchetti, José M. F. Moura and Markus Püschel
High Performance Synthetic Aperture Radar Image Formation On Commodity Architectures
Proc. SPIE Conference on Defense, Security, and Sensing, Proceedings of SPIE, Vol. 7337, pp. 733708, 2009
- Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
Operator Language: A Program Generation Framework for Fast Kernels
Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009
2008
- Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
Automatic Generation of Adaptive Libraries for Matrix-Multiplication
Parallel Matrix Algorithms and Applications (PMAA), 2008, Presentation (Abstract reviewed)
- Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
Automatic Generation of Multithreaded Vectorized Adaptive Libraries for Matrix Multiplication
Proc. International Workshop on Parallel Matrix Algorithms and Applications (PMAA), 2008
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
Automatic Linear Transform Program Generation for the Cell BE
Proc. Supercomputing (SC), 2008, Poster (Abstract reviewed)
- Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
Domain-Specific Library Generation for Parallel Software and Hardware Platforms
Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
FFT Program Generation for the Cell BE
Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Formal Datapath Representation and Manipulation for Implementing DSP Transforms
Proc. Design Automation Conference (DAC), pp. 385-390, 2008
- Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
Generating High-Performance General Size Linear Transform Libraries Using Spiral
Proc. High Performance Extreme Computing (HPEC), 2008
- Franz Franchetti and Markus Püschel
Generating SIMD Vectorized Permutations
Proc. International Conference on Compiler Construction (CC), Lecture Notes in Computer Science, Springer, Vol. 4959, pp. 116-131, 2008
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
How To Write Fast Numerical Code: A Small Introduction
Proc. Summer School on Generative and Transformational Techniques in Software Engineering (GTTSE), Lecture Notes in Computer Science, Springer, Vol. 5235, pp. 196-259, 2008
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Linear Transforms: From Math to Efficient Hardware
Proc. Workshop on High-Level Synthesis colocated with DAC, 2008
- Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
Program Generation with Spiral: Beyond Transforms
Proc. High Performance Extreme Computing (HPEC), 2008
- Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
System Demonstration of Spiral: Generator for High-Performance Linear Transform Libraries
Proc. Algebraic Methodology and Software Technology (AMAST), 2008
2007
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Discrete Fourier Transform Compiler: From Mathematical Representation to Efficient Hardware
CSSI Technical Report #CSSI-07-01, Carnegie Mellon University, 2007
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Fast Fourier Transform on FPGA: Design Choices and Evaluation
Proc. International Symposium on Field-Programmable Gate Arrays (FPGA), 2007
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
FFT Compiler: From Math to Efficient Hardware
Proc. IEEE International High Level Design Validation and Test Workshop (HLDVT), 2007
- Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
Generating FPGA Accelerated DFT Libraries
Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007
- Paolo D'Alberto, Markus Püschel and Franz Franchetti
Performance/Energy Optimization of DSP Transforms on the XScale Processor
Proc. International Conference on High Performance Embedded Architectures and Compilers (HiPEAC), Lecture Notes in Computer Science, Springer, Vol. 4367, pp. 201-214, 2007
- Franz Franchetti and Markus Püschel
SIMD Vectorization of Non-Two-Power Sized FFTs
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. II-17-II-20, 2007
2006
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
A Rewriting System for the Vectorization of Signal Transforms
Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 4395, pp. 363-377, 2006
- Andreas Bonelli, Franz Franchetti, Juergen Lorenz, Markus Püschel and Christoph W. Ueberhuber
Automatic Performance Optimization of the Discrete Fourier Transform on Distributed Memory Computers
Proc. International Symposium on Parallel and Distributed Processing and Application (ISPA), Lecture Notes In Computer Science, Springer, Vol. 4330, pp. 818-832, 2006
- Paolo D'Alberto, Peter A. Milder, Franz Franchetti, James C. Hoe, Markus Püschel and José M. F. Moura
Discrete Fourier Transform Compiler for FPGA and CPU/FPGA Partitioned Implementations
Proc. High Performance Extreme Computing (HPEC), 2006
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
FFT Program Generation for Shared Memory: SMP and Multicore
Proc. Supercomputing (SC), 2006
- F. Gygi, E. W. Draeger, M. Schulz, B. R. de Supinski, J. A. Gunnels, V. Austel, J. C. Sexton, Franz Franchetti, Stefan Kral, Christoph W. Ueberhuber and Juergen Lorenz
Large-Scale Electronic Structure Calculations of High-Z Metals on the BlueGene/L Platform
Proc. Supercomputing (SC), 2006
- Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
Parallelism in Spiral
Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006
- Sung-Chul Han, Franz Franchetti and Markus Püschel
Program Generation for the All-Pairs Shortest Path Problem
Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 222-232, 2006
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
Spiral: Generating Signal Processing Kernels for New Commodity Architectures
Proc. EDGE Workshop, pp. D49-D50, 2006
2005
- Franz Franchetti, Stefan Kral, Juergen Lorenz and Christoph W. Ueberhuber
Efficient Utilization of SIMD Extensions
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 409-425, 2005
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
Formal Loop Merging for Signal Transforms
Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326 , 2005
- F. Gygi, E. W. Draeger, B. R. de Supinski, R. K. Yates, Franz Franchetti, Stefan Kral, Juergen Lorenz, Christoph W. Ueberhuber, J. A. Gunnels and J. C. Sexton
Large-Scale First-Principles Molecular Dynamics Simulations on the BlueGene/L Platform Using the Qbox Code
Proc. Supercomputing (SC), 2005
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
Loop Merging for Signal Transforms
Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326, 2005
- Thammanit Pipatsrisawat, Aca Gacic, Franz Franchetti, Markus Püschel and José M. F. Moura
Performance Analysis of the Filtered Backprojection Image Reconstruction Algorithms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. 153-156, 2005
- Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
SPIRAL: Code Generation for DSP Transforms
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005
- Juergen Lorenz, Stefan Kral, Franz Franchetti and Christoph W. Ueberhuber
Vectorization Techniques for the BlueGene/L Double FPU
IBM Journal of Research and Development, Vol. 49, No. 2/3, pp. 437-446, 2005
2004
- Franz Franchetti, Stefan Kral, Juergen Lorenz, Markus Püschel, Christoph W. Ueberhuber and Peter Wurzinger
Automatically Tuned FFTs for BlueGene/L’s Double FPU
Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 3402, pp. 23-36, 2004
2003
- Franz Franchetti
A Portable Short Vector Version of FFTW
Proc. IMACS Symposium on Mathematical Modelling (MATHMOD), Vol. 2, pp. 1539-1548, 2003
- T. Fahringer, Franz Franchetti, M. Geissler, G. Madsen, H. Moritsch and R. Prodan
On Using ZENTURIO for Performance and Parameter Studies on Clusters and Grids
Proc. Euromicro Conference on Parallel Distributed and Network-based Processing (Euro PDP), pp. 185-192, 2003
- Franz Franchetti
Performance Portable Short Vector Transforms
PhD. thesis, Vienna University of Technology, 2003
- Franz Franchetti and Markus Püschel
Short Vector Code Generation and Adaptation for DSP Algorithms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 537-540, 2003
- Franz Franchetti and Markus Püschel
Short Vector Code Generation for the Discrete Fourier Transform
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2003
- Stefan Kral, Franz Franchetti, Juergen Lorenz and Christoph W. Ueberhuber
SIMD Vectorization of Straight Line Code
Proc. Euro-Par Conference on Parallel and Distributed Computing, LNCS 2985, pp. 251-260, 2003
- Franz Franchetti
Top Performance in Signal Processing
Proc. International Workshop on Numerical and Symbolic Scientific Computing, 2003
2002
- Franz Franchetti and Markus Püschel
A SIMD Vectorizing Compiler for Digital Signal Processing Algorithms
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 20-26, 2002
- Franz Franchetti, F. Kaltenberger and Christoph W. Ueberhuber
FFT Kernels with FMA Utilization
Proc. APLIMAT Conference, pp. 333-339, 2002
- Franz Franchetti, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
Short Vector SIMD Code Generation for DSP Algorithms
Proc. High Performance Extreme Computing (HPEC), 2002
2001
- Franz Franchetti, H. Karner, Stefan Kral and Christoph W. Ueberhuber
Architecture Independent Short Vector FFTs
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 1109-1112, 2001
2000
- Franz Franchetti
Short Vector FFTs
Master thesis, Vienna University of Technology, 2000
Franchetti, Yoko
2019
- Yoko Franchetti, Thomas Nolin and Franz Franchetti
Towards Precision Medicine: Simulation Based Parameter Estimation for Drug Metabolism
SIAM Conference on Computational Science and Engineering (CSE), 2019
Franusich, M.
2023
- N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
2022
- N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
Towards Full-Stack Acceleration for Fully Homomorphic Encryption
Proc. High Performance Extreme Computing (HPEC), 2022
2020
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solver Libraries for Exascale: FFTX and SpectralPack
Exascale Computing Project (ECP) Annual Meeting, 2020, Poster
2019
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solvers for Exascale: FFTX and SpectralPACK
Exascale Computing Project (ECP) Annual Meeting, 2019
2018
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFTX and SpectralPack: A First Look
Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018
2017
- Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
IEEE Control Systems Magazine, 2017
2014
- B. Duff, J. Larkin, M. Franusich and Franz Franchetti
Automatic Generation of 3-D FFTs
submitted for publication
French, M.
2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring-Processing Unit
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
Fu, S.
2024
- S. Fu, N. Zhang and Franz Franchetti
Accelerating High-Precision Number Theoretic Transforms using Intel AVX-512
Proc. Parallel Architectures and Compilation Techniques (PACT), 2024
Furis, Mihai
2005
- Mihai Furis, Pawel Hitczenko and Jeremy Johnson
Cache Miss Analysis of WHT Algorithms
Proc. International Conference on Analysis of Algorithms, Discrete Mathematics and Theoretical Computer Science, pp. 115-124, 2005
2003
- Mihai Furis
Cache Miss Analysis of Walsh-Hadamard Transform Algorithms
Master thesis, Computer Science, Drexel University, 2003
Gacic, Aca
2005
- Thammanit Pipatsrisawat, Aca Gacic, Franz Franchetti, Markus Püschel and José M. F. Moura
Performance Analysis of the Filtered Backprojection Image Reconstruction Algorithms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. 153-156, 2005
- Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
SPIRAL: Code Generation for DSP Transforms
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005
2004
- Aca Gacic, Markus Püschel and José M. F. Moura
Automatically Generated High-Performance Code for Discrete Wavelet Transforms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-69-V-72, 2004
- Aca Gacic
Automatic Implementation and Platform Adaptation of Discrete Filtering and Wavelet Algorithms
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2004
2003
- Aca Gacic, Markus Püschel and José M. F. Moura
Fast Automatic Implementations of FIR Filters
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 541-544, 2003
Gamil, H.
2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
2022
- N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
Towards Full-Stack Acceleration for Fully Homomorphic Encryption
Proc. High Performance Extreme Computing (HPEC), 2022
Gao, M.
2015
- M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
Computer, Vol. 48, No. 12, pp. 24-33, 2015
Garg, Kashish
2017
- H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
Multirotor UAV State Prediction Through Multi-microphone Side-channel Fusion
Proc. Multisensor Fusion and Integration for Intelligent Systems (MFI), IEEE, pp. 15-21, 2017
- H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
Prediction of Quadcopter State through Multi-Microphone Side-Channel Fusion
Technical report UU-CS-2017-001, Dept. of Information and Computing Sciences, Utrecht University, 2017
Garzarán, María J.
2005
- Xiaoming Li, María J. Garzarán and David Padua
Optimizing Sorting with Genetic Algorithm
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 99-110, 2005
2004
- Xiaoming Li, María J. Garzarán and David Padua
A Dynamically Tuned Sorting Library
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 111-124, 2004
Geissler, M.
2003
- T. Fahringer, Franz Franchetti, M. Geissler, G. Madsen, H. Moritsch and R. Prodan
On Using ZENTURIO for Performance and Parameter Studies on Clusters and Grids
Proc. Euromicro Conference on Parallel Distributed and Network-based Processing (Euro PDP), pp. 185-192, 2003
Gilbert, J.
2016
- J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
Mathematical Foundations of the GraphBLAS
Proc. High Performance Extreme Computing (HPEC), 2016
Glick, Madeleine
2012
- Robert Koutsoyannis, Peter A. Milder, Christian Berger, Madeleine Glick, James C. Hoe and Markus Püschel
Improving Fixed-Point Accuracy of FFT Cores in O-OFDM Systems
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2012
2011
- Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Design studies for ASIC implementations of 28 GS/s optical QPSK- and 16-QAM-OFDM transceivers
Optics Express, Vol. 19, No. 21, pp. 20857-20864, 2011
2010
- Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, Christian Berger, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Design Studies for an ASIC Implementation of an Optical OFDM Transceiver
Proc. European Conference on Optical Communication (ECOC), pp. 1-3, 2010
- Yannis Benlachtar, Rachid Bouziane, Robert I. Killey, Christian Berger, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel and Madeleine Glick
Optical OFDM for the Data Center
Proc. International Conference on Transparent Optical Networks (ICTON), pp. 1-4, 2010
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Real-Time Digital Signal Processing for the Generation of Optical Orthogonal Frequency Division Multiplexed Signals
IEEE Journal of Selected Topics in Quantum Electronics, Vol. 16, No. 5, pp. 1235-1244 , 2010
2009
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
21.4 GS/s Real-Time DSP-Based Optical OFDM Signal Generation and Transmission Over 1600 km of Uncompensated Fibre
Proc. European Conference on Optical Communication (ECOC), pp. 1-2, 2009
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009
Gong, Z.
2022
- Z. Gong, N. Zhu, M. Ngaw, Joao Rivera, L. Tang, E. Tang, H. Mankad and Franz Franchetti
Interval Arithmetic-based FFT for Large Integer Multiplication
Proc. High Performance Extreme Computing (HPEC), 2022
Goodson, K. E.
2015
- M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
Computer, Vol. 48, No. 12, pp. 24-33, 2015
Gross, Thomas
2014
- Benjamin Hess, Thomas Gross and Markus Püschel
Automatic Locality-Friendly Interface Extension of Numerical Functions
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 83-92, 2014
Gui, Liangyan
2017
- Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
IEEE Control Systems Magazine, 2017
Gunnels, J. A.
2006
- F. Gygi, E. W. Draeger, M. Schulz, B. R. de Supinski, J. A. Gunnels, V. Austel, J. C. Sexton, Franz Franchetti, Stefan Kral, Christoph W. Ueberhuber and Juergen Lorenz
Large-Scale Electronic Structure Calculations of High-Z Metals on the BlueGene/L Platform
Proc. Supercomputing (SC), 2006
2005
- F. Gygi, E. W. Draeger, B. R. de Supinski, R. K. Yates, Franz Franchetti, Stefan Kral, Juergen Lorenz, Christoph W. Ueberhuber, J. A. Gunnels and J. C. Sexton
Large-Scale First-Principles Molecular Dynamics Simulations on the BlueGene/L Platform Using the Qbox Code
Proc. Supercomputing (SC), 2005
Guo, Qi
2016
- Qi Guo, Tianshi Chen, Y. Chen and Franz Franchetti
Accelerating Architectural Simulation Via Statistical Techniques: A Survey
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 35, No. 3, pp. 433-446, 2016
2015
- Jiyuan Zhang, Tze-Meng Low, Qi Guo and Franz Franchetti
A 3D-Stacked Memory Manycore Stencil Accelerator System
Proc. Workshop on Near Data Processing (WONDP), 2015
- Qi Guo, Tze-Meng Low, N. Alachiotis, Berkin Akin, Lawrence Pileggi, James C. Hoe and Franz Franchetti
Enabling Portable Energy Efficiency with Memory Accelerated Library
Proc. MICRO, 2015
- Tze-Meng Low, Qi Guo and Franz Franchetti
Optimizing Space Time Adaptive Processing Through Accelerating Memory-Bounded Operations
Proc. High Performance Extreme Computing (HPEC), 2015
2014
- Qi Guo, N. Alachiotis, Berkin Akin, F. Sadi, G. Xu, Tze-Meng Low, Lawrence Pileggi, James C. Hoe and Franz Franchetti
3D-Stacked Memory-Side Acceleration: Accelerator and System Design
Proc. Workshop on Near Data Processing (WONDP), 2014
Gygi, F.
2006
- F. Gygi, E. W. Draeger, M. Schulz, B. R. de Supinski, J. A. Gunnels, V. Austel, J. C. Sexton, Franz Franchetti, Stefan Kral, Christoph W. Ueberhuber and Juergen Lorenz
Large-Scale Electronic Structure Calculations of High-Z Metals on the BlueGene/L Platform
Proc. Supercomputing (SC), 2006
2005
- F. Gygi, E. W. Draeger, B. R. de Supinski, R. K. Yates, Franz Franchetti, Stefan Kral, Juergen Lorenz, Christoph W. Ueberhuber, J. A. Gunnels and J. C. Sexton
Large-Scale First-Principles Molecular Dynamics Simulations on the BlueGene/L Platform Using the Qbox Code
Proc. Supercomputing (SC), 2005
Haentjens, Gavin
2000
- Gavin Haentjens
An Investigation of Recursive FFT Implementations
Master thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2000
Han, Sung-Chul
2007
- Sung-Chul Han
A Flexible Decoder and Performance Evaluation of Array-Structured LDPC Codes
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2007
2006
- Sung-Chul Han, Franz Franchetti and Markus Püschel
Program Generation for the All-Pairs Shortest Path Problem
Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 222-232, 2006
Harisrikanth, K.
2024
- L. Tang, S. Chen, K. Harisrikanth, G. Xu, Franz Franchetti and K. Mai
A 1.19GHz 9.52Gsamples/sec Radix-8 FFT Hardware Accelerator in 28nm
Proc. Hot Chips Symposium (HCS), 2024
2022
- L. Tang, S. Chen, K. Harisrikanth, G. Xu, K. Mai and Franz Franchetti
A High Throughput Hardware Accelerator for FFTW Codelets: A First Look
Proc. High Performance Extreme Computing (HPEC), 2022
Henretty, Tom
2013
- Tom Henretty, Richard Veras, Franz Franchetti, Louis-Noël Pouchet, J. Ramanujam and P. Sadayappan
A Stencil Compiler for Short-Vector SIMD Architectures
Proc. ACM International Conference on Supercomputing , pp. 13-24, 2013
2011
- Tom Henretty, Kevin Stock, Louis-Noël Pouchet, Franz Franchetti, J. Ramanujam and P. Sadayappan
Data Layout Transformation for Stencil Computations on Short SIMD Architectures
Proc. International Conference on Compiler Construction (CC), 2011
Hess, Benjamin
2014
- Benjamin Hess, Thomas Gross and Markus Püschel
Automatic Locality-Friendly Interface Extension of Numerical Functions
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 83-92, 2014
2013
- Benjamin Hess
Automatic Refactoring: Locality Friendly Interface Enhancements for Numerical Functions
Master thesis, Computer Science, ETH Zurich, Switzerland, 2013
Hess, Benjamin
2014
- Benjamin Hess, Thomas Gross and Markus Püschel
Automatic Locality-Friendly Interface Extension of Numerical Functions
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 83-92, 2014
2013
- Benjamin Hess
Automatic Refactoring: Locality Friendly Interface Enhancements for Numerical Functions
Master thesis, Computer Science, ETH Zurich, Switzerland, 2013
Heyman, B.
2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring-Processing Unit
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
Hills, G.
2015
- M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
Computer, Vol. 48, No. 12, pp. 24-33, 2015
Hitczenko, Pawel
2006
- Pawel Hitczenko, Jeremy Johnson and Hung-Jen Huang
Distribution of a Class of Divide and Conquer Recurrences arising from the Computation of the Walsh-Hadamard Transform
Theoretical Computer Science, Vol. 352, pp. 8-30, 2006
2005
- Mihai Furis, Pawel Hitczenko and Jeremy Johnson
Cache Miss Analysis of WHT Algorithms
Proc. International Conference on Analysis of Algorithms, Discrete Mathematics and Theoretical Computer Science, pp. 115-124, 2005
Hoe, James C.
2024
- E. Tang, James C. Hoe and Franz Franchetti
Magic Memory: A Memory-Centric Declarative Programming Paradigm to Enable High Productivity on Heterogeneous Systems
Proc. SRC TECHCON, 2024
2023
- G. Xu, Franz Franchetti and James C. Hoe
An Approach to Generating Customized Load-Store Architectures
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2023
2019
- F. Sadi, Joe Sweeney, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Efficient SpMV Operation for Large and Highly Sparse Matrices Using Scalable Multi-way Merge Parallelization
Proc. MICRO, 2019
2018
- F. Sadi, Joe Sweeney, S. McMillan, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
PageRank Acceleration for Large Graphs with Scalable Hardware and Two-Step SpMV
Proc. High Performance Extreme Computing (HPEC), 2018
- Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
SPIRAL: Extreme Performance Portability
Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018
2017
- G. Xu, Tze-Meng Low, James C. Hoe and Franz Franchetti
Optimizing FFT Resource Efficiency of FPGA using High-Level Synthesis
High Performance Extreme Computing Conference (HPEC), 2017
2016
- Berkin Akin, Franz Franchetti and James C. Hoe
HAMLeT Architecture for Parallel Data Reorganization in Memory
IEEE Micro, Vol. 36, No. 1, pp. 14-23, 2016
2015
- Berkin Akin, Franz Franchetti and James C. Hoe
Data Reorganization in Memory Using 3D-stacked DRAM
Proc. International Symposium on Computer Architectur (ISCA), 2015
- Qi Guo, Tze-Meng Low, N. Alachiotis, Berkin Akin, Lawrence Pileggi, James C. Hoe and Franz Franchetti
Enabling Portable Energy Efficiency with Memory Accelerated Library
Proc. MICRO, 2015
- Berkin Akin, Franz Franchetti and James C. Hoe
FFTs with Near-Optimal Memory Access Through Block Data Layouts: Algorithm, Architecture and Design Automation
Journal of Signal Processing Systems, 2015
2014
- Qi Guo, N. Alachiotis, Berkin Akin, F. Sadi, G. Xu, Tze-Meng Low, Lawrence Pileggi, James C. Hoe and Franz Franchetti
3D-Stacked Memory-Side Acceleration: Accelerator and System Design
Proc. Workshop on Near Data Processing (WONDP), 2014
- F. Sadi, Berkin Akin, Thom Popovici, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Algorithm/Hardware Co-optimized SAR Image Reconstruction with 3D-stacked Logic in Memory
Proc. IEEE High Performance Extreme Computing (HPEC), 2014
- Berkin Akin, Franz Franchetti and James C. Hoe
FFTs with Near-Optimal Memory Access Through Block Data Layouts
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014
- Berkin Akin, James C. Hoe and Franz Franchetti
HAMLeT: Hardware Accelerated Memory Layout Transform within 3D-stacked DRAM
Proc. IEEE High Performance Extreme Computing (HPEC), 2014
- Berkin Akin, Franz Franchetti and James C. Hoe
Understanding the Design Space of DRAM-optimized Hardware FFT Accelerators
Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 248-255, 2014
2013
- Qiuling Zhu, Berkin Akin, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
A 3D-Stacked Logic-in-Memory Accelerator for Application-Specific Data Intensive Computing
Proc. IEEE International 3D Systems Integration Conference (3DIC), pp. 1-7, 2013
- Qiuling Zhu, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Accelerating Sparse Matrix-Matrix Multiplication with 3D-Stacked Logic-in-Memory Hardware
Proc. High Performance Extreme Computing (HPEC), pp. 1-6, 2013
2012
- Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
Algorithm and Architecture Optimization for Large Size Two Dimensional Discrete Fourier Transform
ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA), 2012
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Computer Generation of Hardware for Linear Digital Signal Processing Transforms
ACM Transactions on Design Automation of Electronic Systems, Vol. 17, No. 2, 2012
- W. Yu, Franz Franchetti, James C. Hoe and Tsuhan Chen
Highly Efficient Performance Portable Tracking of Evolving Surfaces
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 296-307, 2012
- Robert Koutsoyannis, Peter A. Milder, Christian Berger, Madeleine Glick, James C. Hoe and Markus Püschel
Improving Fixed-Point Accuracy of FFT Cores in O-OFDM Systems
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2012
- Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
Memory Bandwidth Efficient Two-Dimensional Fast Fourier Transform Algorithm and Implementation for Large Problem Sizes
Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), 2012
2011
- Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Design studies for ASIC implementations of 28 GS/s optical QPSK- and 16-QAM-OFDM transceivers
Optics Express, Vol. 19, No. 21, pp. 20857-20864, 2011
- W. Yu, Franz Franchetti, James C. Hoe, José M. F. Moura and Tsuhan Chen
Performance Portable Tracking of Evolving Surfaces
Proc. High Performance Extreme Computing (HPEC), 2011
2010
- Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, Christian Berger, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Design Studies for an ASIC Implementation of an Optical OFDM Transceiver
Proc. European Conference on Optical Communication (ECOC), pp. 1-3, 2010
- W. Yu, Franz Franchetti, James C. Hoe and Tsuhan Chen
Fast and Robust Active Contours for Image Segmentation
Proc. IEEE International Conference on Image Processing (ICIP), 2010
- W. Yu, Franz Franchetti, James C. Hoe, Y.-J. Chang and Tsuhan Chen
Fast Bilateral Filtering By Adapting Block Size
Proc. IEEE International Conference on Image Processing (ICIP), 2010
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Hardware Implementation of the Discrete Fourier Transform with Non-Power-of-Two Problem Size
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2010
- W. Yu, Tsuhan Chen, Franz Franchetti and James C. Hoe
High Performance Stereo Vision Designed for Massively Data Parallel Platforms
IEEE Transactions on Circuits and Systems for Video Technology (T-CSVT), Vol. 20, No. 11, pp. 1509-1519, 2010
- Yannis Benlachtar, Rachid Bouziane, Robert I. Killey, Christian Berger, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel and Madeleine Glick
Optical OFDM for the Data Center
Proc. International Conference on Transparent Optical Networks (ICTON), pp. 1-4, 2010
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Real-Time Digital Signal Processing for the Generation of Optical Orthogonal Frequency Division Multiplexed Signals
IEEE Journal of Selected Topics in Quantum Electronics, Vol. 16, No. 5, pp. 1235-1244 , 2010
2009
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
21.4 GS/s Real-Time DSP-Based Optical OFDM Signal Generation and Transmission Over 1600 km of Uncompensated Fibre
Proc. European Conference on Optical Communication (ECOC), pp. 1-2, 2009
- Peter A. Milder, James C. Hoe and Markus Püschel
Automatic Generation of Streaming Datapaths for Arbitrary Fixed Permutations
Proc. Design, Automation and Test in Europe (DATE), pp. 1118-1123, 2009
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009
- Markus Püschel, Peter A. Milder and James C. Hoe
Permuting Streaming Data Using RAMs
Journal of the ACM, Vol. 56, No. 2, pp. 10:1-10:34, 2009
2008
- Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
Domain-Specific Library Generation for Parallel Software and Hardware Platforms
Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Formal Datapath Representation and Manipulation for Implementing DSP Transforms
Proc. Design Automation Conference (DAC), pp. 385-390, 2008
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Linear Transforms: From Math to Efficient Hardware
Proc. Workshop on High-Level Synthesis colocated with DAC, 2008
- Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
Program Generation with Spiral: Beyond Transforms
Proc. High Performance Extreme Computing (HPEC), 2008
2007
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Discrete Fourier Transform Compiler: From Mathematical Representation to Efficient Hardware
CSSI Technical Report #CSSI-07-01, Carnegie Mellon University, 2007
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Fast Fourier Transform on FPGA: Design Choices and Evaluation
Proc. International Symposium on Field-Programmable Gate Arrays (FPGA), 2007
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
FFT Compiler: From Math to Efficient Hardware
Proc. IEEE International High Level Design Validation and Test Workshop (HLDVT), 2007
- Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
Generating FPGA Accelerated DFT Libraries
Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007
- Peter Tummeltshammer, James C. Hoe and Markus Püschel
Time-Multiplexed Multiple Constant Multiplication
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 26, No. 9, pp. 1551-1563, 2007
2006
- Paolo D'Alberto, Peter A. Milder, Franz Franchetti, James C. Hoe, Markus Püschel and José M. F. Moura
Discrete Fourier Transform Compiler for FPGA and CPU/FPGA Partitioned Implementations
Proc. High Performance Extreme Computing (HPEC), 2006
- Peter A. Milder, Mohammad Ahmad, James C. Hoe and Markus Püschel
Fast and Accurate Resource Estimation of Automatically Generated Custom DFT IP Cores
Proc. FPGA, pp. 211-220, 2006
- Marek Telgarsky, James C. Hoe and José M. F. Moura
Spiral: Joint Runtime and Energy Optimization of Linear Transforms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 3, 2006
2005
- Roland Wunderlich, Markus Püschel and James C. Hoe
Accelerating Blocked Matrix-Matrix Multiplication using a Software-Managed Memory Hierarchy with DMA
Proc. High Performance Extreme Computing (HPEC), 2005
- Grace Nordin, Peter A. Milder, James C. Hoe and Markus Püschel
Automatic Generation of Customized Discrete Fourier Transform IPs
Proc. Design Automation Conference (DAC), pp. 471-474, 2005
2004
- Adam C. Zelinski, Markus Püschel, Smarahara Misra and James C. Hoe
Automatic Cost Minimization for Multiplierless Implementations of Discrete Signal Transforms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-221-V-224, 2004
- Markus Püschel, Adam C. Zelinski and James C. Hoe
Custom-Optimized Multiplierless Implementations of DSP Algorithms
Proc. International Conference on Computer-Aided Design (ICCAD), pp. 175-182, 2004
- Grace Nordin, James C. Hoe and Markus Püschel
Discrete Fourier Transform IP Generator
Proc. High Performance Extreme Computing (HPEC), 2004
- Peter Tummeltshammer, James C. Hoe and Markus Püschel
Multiple Constant Multiplication By Time-Multiplexed Mapping of Addition Chains
Proc. Design Automation Conference (DAC), pp. 826-829, 2004
2003
- Smarahara Misra, Adam C. Zelinski, James C. Hoe and Markus Püschel
Custom Reduction of Arithmetic in Linear DSP Transforms
Proc. High Performance Extreme Computing (HPEC), 2003
2002
- Fang Fang, James C. Hoe, Markus Püschel and Smarahara Misra
Generation of Custom DSP Transform IP Cores: Case Study Walsh-Hadamard Transform
Proc. High Performance Extreme Computing (HPEC), 2002
Hong, Inpyo
2004
- Lawrence C. Chang, Inpyo Hong, Yevgen Voronenko and Markus Püschel
Adaptive Mapping of Linear DSP Algorithms to Fixed-Point Arithmetic
Proc. High Performance Extreme Computing (HPEC), 2004
Horowitz, M.
2012
- Qiuling Zhu, K. Vaidyanathan, O. Shacham, M. Horowitz, Lawrence Pileggi and Franz Franchetti
Design Automation Framwork for Application-Specific Logic-in-Memory Blocks
Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 125-132, 2012
Huang, Hung-Jen
2006
- Pawel Hitczenko, Jeremy Johnson and Hung-Jen Huang
Distribution of a Class of Divide and Conquer Recurrences arising from the Computation of the Walsh-Hadamard Transform
Theoretical Computer Science, Vol. 352, pp. 8-30, 2006
2002
- Hung-Jen Huang
Performance Analysis of an Adaptive Algorithm for the Walsh-Hadamard Transform
Master thesis, Computer Science, Drexel University, 2002
Hug, Gabriela
2014
- Tao Cui, R. Yang, Gabriela Hug and Franz Franchetti
Accelerated AC Contingency Calculation on Commodity Multi-core SIMD CPUs
Proc. IEEE Power and Energy Society General Meeting (PES-GM), 2014
Hutchison, D.
2016
- J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
Mathematical Foundations of the GraphBLAS
Proc. High Performance Extreme Computing (HPEC), 2016
Ilic, Marija
2013
- Q. Li, Tao Cui, Yang Weng, Rohit Negi, Franz Franchetti and Marija Ilic
An Information-Theoretic Approach to PMU Placement in Electric Power Systems
IEEE Transactions on Smart Grid, Vol. 4, No. 1, pp. 446-456, 2013
2010
- Q. Li, Tao Cui, Rohit Negi, Franz Franchetti and Marija Ilic
On-line Decentralized Charging of Plug-In Electric Vehicles in Power Systems
, 2010
Jacob, A.
2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
Jaiswal, A.
2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
Johnson, Jeremy
2023
- N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
- P. Brinich, N. Zhang, A. Ebel, Franz Franchetti and Jeremy Johnson
Twiddle Factor Generation for a Vectorized Number Theoretic Transform
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
2022
- N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
Towards Full-Stack Acceleration for Fully Homomorphic Encryption
Proc. High Performance Extreme Computing (HPEC), 2022
2018
- Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
SPIRAL: Extreme Performance Portability
Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018
2017
- Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
IEEE Control Systems Magazine, 2017
2014
- Franz Franchetti, Aliaksei Sandryhaila and Jeremy Johnson
High Assurance SPIRAL
Proc. SPIE, Proceedings of SPIE 2014, 2014
- Lingchuan Meng and Jeremy Johnson
Towards parallel general-size library generation for polynomial multiplication
ACM Communications in Computer Algebra, 2014
2013
- Lingchuan Meng and Jeremy Johnson
Automatic Parallel Library Generation for General-Size Modular FFT Algorithms
Proc. International Workshop on Computer Algebra in Scientific Computing, Springer, pp. 243-256, 2013
2010
- Lingchuan Meng, Jeremy Johnson, Franz Franchetti, Yevgen Voronenko, Marc Moreno Maza and Yuzhen Xie
Spiral-Generated Modular FFT Algorithms
Proc. Parallel Symbolic Computation (PASCO), pp. 169-170, 2010
2008
- Jeremy Johnson, Tim Chagnon, Petya Vachranukunkiet, Prawat Nagvajara and Chika Nwankpa
Sparse LU Decomposition using FPGA
Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
- Jeremy Johnson and Michael Andrews
Statistical Evaluation of a Self-Tuning Vectorized Library for the Walsh-Hadamard Transform
Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
2007
- Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
Generating FPGA Accelerated DFT Libraries
Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007
- Jeremy Johnson and Xu Xu
Generating Symmetric DFTs and Equivariant FFT Algorithms
Proc. ACM International Symposium on Symbolic and Algebraic Computation (ISSAC), ACM, pp. 195-202, 2007
- Jeremy Johnson and Michael Andrews
Performance Analysis of a Family of WHT Algorithms
Proc. Workshop on Performance Optimization for High-Level Languages and Libraries (POHLL), 2007 IEEE International Parallel and Distributed Processing Symposium , IEEE, pp. 450, 2007
2006
- Pawel Hitczenko, Jeremy Johnson and Hung-Jen Huang
Distribution of a Class of Divide and Conquer Recurrences arising from the Computation of the Walsh-Hadamard Transform
Theoretical Computer Science, Vol. 352, pp. 8-30, 2006
2005
- Mihai Furis, Pawel Hitczenko and Jeremy Johnson
Cache Miss Analysis of WHT Algorithms
Proc. International Conference on Analysis of Algorithms, Discrete Mathematics and Theoretical Computer Science, pp. 115-124, 2005
- Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
SPIRAL: Code Generation for DSP Transforms
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005
2004
- Jeremy Johnson and Kang Chen
A Self-Adapting Distributed Memory Package for Fast Signal Transforms
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 44-, 2004
- Anthony F. Breitzman and Jeremy Johnson
Automatic Derivation and Implementation of Fast Convolution Algorithms
Journal of High Performance Computing and Applications, special issue on ``Computer Algebra and Signal Processing'', Vol. 37, No. 2, pp. 157-186, 2004
- Markus Püschel, Bryan Singer, Jianxin Xiong, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso and Robert W. Johnson
SPIRAL: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms
Journal of High Performance Computing and Applications, special issue on ``Automatic Performance Tuning'', Vol. 18, No. 1, pp. 21-45, 2004
2003
- Jeremy Johnson and Xu Xu
A Recursive Implementation of the Dimensionless FFT
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2003
2002
- Kang Chen and Jeremy Johnson
A Prototypical Self-Optimizing Package for Parallel Implementation of Fast Signal Transforms
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 58-63, 2002
2001
- Sebastian Egner, Jeremy Johnson, David Padua, Jianxin Xiong and Markus Püschel
Automatic Derivation and Implementation of Signal Processing Algorithms
ACM SIGSAM Bulletin Communications in Computer Algebra, Vol. 35, No. 2, pp. 1-19, 2001
- José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel, Bryan Singer, Manuela Veloso and Jianxin Xiong
Generating Platform-Adapted DSP Libraries using SPIRAL
Proc. High Performance Extreme Computing (HPEC), 2001
- Jianxin Xiong, Jeremy Johnson, Robert W. Johnson and David Padua
SPL: A Language and Compiler for DSP Algorithms
Proc. Programming Languages Design and Implementation (PLDI), pp. 298-308, 2001
2000
- Pinit Kumhom, Jeremy Johnson and Prawat Nagvajara
Design, optimization, and implementation of a universal FFT processor
Proc. IEEE ASIC/SOC Conference, IEEE, pp. 182-186, 2000
- Jeremy Johnson and Markus Püschel
In Search of the Optimal Walsh-Hadamard Transform
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 6, pp. 3347-3350, 2000
- José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel and Manuela Veloso
SPIRAL: Automatic Implementation of Signal Processing Algorithms
Proc. High Performance Extreme Computing (HPEC), 2000
Johnson, Robert W.
2005
- Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
SPIRAL: Code Generation for DSP Transforms
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005
2004
- Markus Püschel, Bryan Singer, Jianxin Xiong, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso and Robert W. Johnson
SPIRAL: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms
Journal of High Performance Computing and Applications, special issue on ``Automatic Performance Tuning'', Vol. 18, No. 1, pp. 21-45, 2004
2001
- José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel, Bryan Singer, Manuela Veloso and Jianxin Xiong
Generating Platform-Adapted DSP Libraries using SPIRAL
Proc. High Performance Extreme Computing (HPEC), 2001
- Jianxin Xiong, Jeremy Johnson, Robert W. Johnson and David Padua
SPL: A Language and Compiler for DSP Algorithms
Proc. Programming Languages Design and Implementation (PLDI), pp. 298-308, 2001
2000
- José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel and Manuela Veloso
SPIRAL: Automatic Implementation of Signal Processing Algorithms
Proc. High Performance Extreme Computing (HPEC), 2000
Jones, Douglas F.
2010
- Douglas F. Jones
Data Pump Architecture Simulator and Performance Model
Master thesis, Computer Science, Drexel University, 2010
Kaltenberger, F.
2002
- Franz Franchetti, F. Kaltenberger and Christoph W. Ueberhuber
FFT Kernels with FMA Utilization
Proc. APLIMAT Conference, pp. 333-339, 2002
Kang, Dongsoo
2000
- Neungsoo Park, Viktor K. Prasanna, Kiran Bondalapati and Dongsoo Kang
Dynamic Data Layouts for Cache-conscious Factorization of DFT
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 693-701, 2000
Kar, Soummya
2017
- Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
IEEE Control Systems Magazine, 2017
Karner, H.
2001
- Franz Franchetti, H. Karner, Stefan Kral and Christoph W. Ueberhuber
Architecture Independent Short Vector FFTs
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 1109-1112, 2001
Katagiri, T.
2021
- N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
Adaptation of A64 Scalable Vector Extension for Spiral
Proc. High Performance Computing Conference (HPCC), Japan, 2021
- N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
An Auto-tuning with Adaptation of A64 Scalable Vector Extension for SPIRAL
Proc. International Workshop on Automatic Performance Tuning (iWAPT), 2021
Kelly, P. H. J.
2017
- Matthias Bolten, Franz Franchetti, P. H. J. Kelly, Christian Lengauer and Marcus Mohr
Algebraic Description and Automatic Generation of Multigrid Methods in SPIRAL
Concurrency and Computation: Practice and Experience, 2017
2015
- Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
Proc. Workshop on Compilers for Parallel Computing (CPC), 2015
- Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2015
Kepner, J.
2016
- J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
Mathematical Foundations of the GraphBLAS
Proc. High Performance Extreme Computing (HPEC), 2016
Killey, Robert I.
2011
- Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Design studies for ASIC implementations of 28 GS/s optical QPSK- and 16-QAM-OFDM transceivers
Optics Express, Vol. 19, No. 21, pp. 20857-20864, 2011
2010
- Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, Christian Berger, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Design Studies for an ASIC Implementation of an Optical OFDM Transceiver
Proc. European Conference on Optical Communication (ECOC), pp. 1-3, 2010
- Yannis Benlachtar, Rachid Bouziane, Robert I. Killey, Christian Berger, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel and Madeleine Glick
Optical OFDM for the Data Center
Proc. International Conference on Transparent Optical Networks (ICTON), pp. 1-4, 2010
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Real-Time Digital Signal Processing for the Generation of Optical Orthogonal Frequency Division Multiplexed Signals
IEEE Journal of Selected Topics in Quantum Electronics, Vol. 16, No. 5, pp. 1235-1244 , 2010
2009
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
21.4 GS/s Real-Time DSP-Based Optical OFDM Signal Generation and Transmission Over 1600 km of Uncompensated Fibre
Proc. European Conference on Optical Communication (ECOC), pp. 1-2, 2009
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009
Kim, Kyungjoo
2019
- Mark Blanco, Tze-Meng Low and Kyungjoo Kim
Exploration of Fine-Grained Parallelism for Load Balancing Eager K-truss on GPU and CPU
Proc. High Performance Extreme Computing (HPEC), 2019
Kim, Munsung (Bill)
2017
- H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
Multirotor UAV State Prediction Through Multi-microphone Side-channel Fusion
Proc. Multisensor Fusion and Integration for Intelligent Systems (MFI), IEEE, pp. 15-21, 2017
- H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
Prediction of Quadcopter State through Multi-Microphone Side-Channel Fusion
Technical report UU-CS-2017-001, Dept. of Information and Computing Sciences, Utrecht University, 2017
Kitai, N.
2021
- N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
Adaptation of A64 Scalable Vector Extension for Spiral
Proc. High Performance Computing Conference (HPCC), Japan, 2021
- N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
An Auto-tuning with Adaptation of A64 Scalable Vector Extension for SPIRAL
Proc. International Workshop on Automatic Performance Tuning (iWAPT), 2021
Kong, Martin
2013
- Martin Kong, Richard Veras, Kevin Stock, Franz Franchetti, Louis-Noël Pouchet and P. Sadayappan
When Polyhedral Transformations Meet SIMD Code Generation
Proc. Programming Languages Design and Implementation (PLDI), Vol. 48, pp. 127-138, 2013
Koops, H. V.
2017
- H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
Multirotor UAV State Prediction Through Multi-microphone Side-channel Fusion
Proc. Multisensor Fusion and Integration for Intelligent Systems (MFI), IEEE, pp. 15-21, 2017
- H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
Prediction of Quadcopter State through Multi-Microphone Side-Channel Fusion
Technical report UU-CS-2017-001, Dept. of Information and Computing Sciences, Utrecht University, 2017
2015
- H. V. Koops and Franz Franchetti
An Ensemble Technique for Estimating Vehicle Speed and Gear Position from Acoustic Data
Proc. International Conference on Digital Signal Processing (DSP), 2015
Koutsoyannis, Robert
2012
- Robert Koutsoyannis, Peter A. Milder, Christian Berger, Madeleine Glick, James C. Hoe and Markus Püschel
Improving Fixed-Point Accuracy of FFT Cores in O-OFDM Systems
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2012
2011
- Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Design studies for ASIC implementations of 28 GS/s optical QPSK- and 16-QAM-OFDM transceivers
Optics Express, Vol. 19, No. 21, pp. 20857-20864, 2011
2010
- Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, Christian Berger, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Design Studies for an ASIC Implementation of an Optical OFDM Transceiver
Proc. European Conference on Optical Communication (ECOC), pp. 1-3, 2010
- Yannis Benlachtar, Rachid Bouziane, Robert I. Killey, Christian Berger, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel and Madeleine Glick
Optical OFDM for the Data Center
Proc. International Conference on Transparent Optical Networks (ICTON), pp. 1-4, 2010
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Real-Time Digital Signal Processing for the Generation of Optical Orthogonal Frequency Division Multiplexed Signals
IEEE Journal of Selected Topics in Quantum Electronics, Vol. 16, No. 5, pp. 1235-1244 , 2010
2009
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
21.4 GS/s Real-Time DSP-Based Optical OFDM Signal Generation and Transmission Over 1600 km of Uncompensated Fibre
Proc. European Conference on Optical Communication (ECOC), pp. 1-2, 2009
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009
Kovacevic, Jelena
2022
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
A Framework for Low Communication Approaches for Large Scale 3D Convolution
Proc. International Conference on Parallel Processing (ICPP), 2022
2020
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
Massive Scaling of MASSIF: Algorithm Development and Analysis for Simulation on GPUs
Proc. Platform for Advanced Scientific Computing (PASC), Article 13, pp. 1 - 10, 2020
2019
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
Algorithm Design at Scale: Porting Parallel FFT-based Fortran Simulations to GPUs
SIAM Conference on Computational Science and Engineering (CSE), 2019
2018
- Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
Algorithm Design for Large Scale FFT-Based Simulations on CPU-GPU Platforms
, 2018
- Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
Algorithm Design for Large Scale Parallel FFT-Based Simulations on Heterogeneous Platforms
Proc. High Performance Extreme Computing (HPEC), 2018
Kozyrakis, C.
2015
- M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
Computer, Vol. 48, No. 12, pp. 24-33, 2015
Kral, Stefan
2006
- F. Gygi, E. W. Draeger, M. Schulz, B. R. de Supinski, J. A. Gunnels, V. Austel, J. C. Sexton, Franz Franchetti, Stefan Kral, Christoph W. Ueberhuber and Juergen Lorenz
Large-Scale Electronic Structure Calculations of High-Z Metals on the BlueGene/L Platform
Proc. Supercomputing (SC), 2006
2005
- Franz Franchetti, Stefan Kral, Juergen Lorenz and Christoph W. Ueberhuber
Efficient Utilization of SIMD Extensions
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 409-425, 2005
- F. Gygi, E. W. Draeger, B. R. de Supinski, R. K. Yates, Franz Franchetti, Stefan Kral, Juergen Lorenz, Christoph W. Ueberhuber, J. A. Gunnels and J. C. Sexton
Large-Scale First-Principles Molecular Dynamics Simulations on the BlueGene/L Platform Using the Qbox Code
Proc. Supercomputing (SC), 2005
- Juergen Lorenz, Stefan Kral, Franz Franchetti and Christoph W. Ueberhuber
Vectorization Techniques for the BlueGene/L Double FPU
IBM Journal of Research and Development, Vol. 49, No. 2/3, pp. 437-446, 2005
2004
- Franz Franchetti, Stefan Kral, Juergen Lorenz, Markus Püschel, Christoph W. Ueberhuber and Peter Wurzinger
Automatically Tuned FFTs for BlueGene/L’s Double FPU
Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 3402, pp. 23-36, 2004
2003
- Stefan Kral, Franz Franchetti, Juergen Lorenz and Christoph W. Ueberhuber
SIMD Vectorization of Straight Line Code
Proc. Euro-Par Conference on Parallel and Distributed Computing, LNCS 2985, pp. 251-260, 2003
2001
- Franz Franchetti, H. Karner, Stefan Kral and Christoph W. Ueberhuber
Architecture Independent Short Vector FFTs
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 1109-1112, 2001
Krause, Andreas
2016
- Marcela Zuluaga, Andreas Krause and Markus Püschel
e-PAL: An Active Learning Approach to the Multi-Objective Optimization Problem
Journal of Machine Learning Research, Vol. 17, No. 104, pp. 1-32, 2016
2013
- Marcela Zuluaga, Andreas Krause, Guillaume Sergent and Markus Püschel
Active Learning for Multi-Objective Optimization
Proc. International Conference on Machine Learning (ICML), pp. 462-470, 2013
- Marcela Zuluaga, Andreas Krause and Markus Püschel
Multi-Objective Optimization for High-Level Synthesis
Proc. Workshop on High-Level Synthesis for High Performance Computing (HLS4HPC), 2013
2012
- Marcela Zuluaga, Andreas Krause, Peter A. Milder and Markus Püschel
"Smart" Design Space Sampling to Predict Pareto-Optimal Solutions
Proc. Languages, Compilers, Tools and Theory for Embedded Systems (LCTES), pp. 119-128 , 2012
Kulkarni, Anuva
2022
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
A Framework for Low Communication Approaches for Large Scale 3D Convolution
Proc. International Conference on Parallel Processing (ICPP), 2022
2020
- Anuva Kulkarni
An Approach for Large-Scale Three-Dimensional FFT-Based Approximate Convolutions on GPUs
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2020
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solver Libraries for Exascale: FFTX and SpectralPack
Exascale Computing Project (ECP) Annual Meeting, 2020, Poster
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
Massive Scaling of MASSIF: Algorithm Development and Analysis for Simulation on GPUs
Proc. Platform for Advanced Scientific Computing (PASC), Article 13, pp. 1 - 10, 2020
2019
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
Algorithm Design at Scale: Porting Parallel FFT-based Fortran Simulations to GPUs
SIAM Conference on Computational Science and Engineering (CSE), 2019
- Anuva Kulkarni, Daniele G. Spampinato and Franz Franchetti
Design and Specification of Large-scale Simulations for GPUs using FFTX
Supercomputing, 2019
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solvers for Exascale: FFTX and SpectralPACK
Exascale Computing Project (ECP) Annual Meeting, 2019
- Anuva Kulkarni, Daniele G. Spampinato and Franz Franchetti
FFTX for Micromechanical Stress-Strain Analysis
IEEE High Performance Extreme Computing Conference (HPEC), 2019
2018
- Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
Algorithm Design for Large Scale FFT-Based Simulations on CPU-GPU Platforms
, 2018
- Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
Algorithm Design for Large Scale Parallel FFT-Based Simulations on Heterogeneous Platforms
Proc. High Performance Extreme Computing (HPEC), 2018
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFTX and SpectralPack: A First Look
Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018
Kumar, M.
2016
- J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
Mathematical Foundations of the GraphBLAS
Proc. High Performance Extreme Computing (HPEC), 2016
Kumhom, Pinit
2001
- Pinit Kumhom
Design, Optimization, and Implementation of a Universal FFT Processor
PhD. thesis, Electrical and Computer Engineering, Drexel University, 2001, Also Tech. Report DU-MCS-01-01, Drexel University, 2001
2000
- Pinit Kumhom, Jeremy Johnson and Prawat Nagvajara
Design, optimization, and implementation of a universal FFT processor
Proc. IEEE ASIC/SOC Conference, IEEE, pp. 182-186, 2000
Kutuluru, A.
2020
- Sanil Rao, A. Kutuluru, Paul Brouwer, S. McMillan and Franz Franchetti
GBTLX: A First Look
Proc. IEEE High Performance Extreme Computing (HPEC), 2020
2018
- Tze-Meng Low, Daniele G. Spampinato, A. Kutuluru, U. Sridhar, Thom Popovici, Franz Franchetti and S. McMillan
Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency Matrices
Proc. High Performance Extreme Computing (HPEC), 2018
Kyrtatas, Nikolaos
2015
- Nikolaos Kyrtatas, Daniele G. Spampinato and Markus Püschel
A Basic Linear Algebra Compiler for Embedded Processors
Proc. Design, Automation and Test in Europe (DATE), pp. 1054-1059, 2015
2014
- Nikolaos Kyrtatas
A Basic Linear Algebra Compiler for Embedded Processors
Master thesis, Computer Science, ETH Zurich, Switzerland, 2014
Larkin, J.
2021
- Scott Mionis, Franz Franchetti and J. Larkin
Optimized Quantum Circuit Generation with SPIRAL
Proc. High Performance Extreme Computing (HPEC), 2021
2020
- Scott Mionis, Franz Franchetti and J. Larkin
Quantum Circuit Optimization with SPIRAL: A First Look
Proc. Supercomputing (SC), 2020
2014
- B. Duff, J. Larkin, M. Franusich and Franz Franchetti
Automatic Generation of 3-D FFTs
submitted for publication
Lee, C.-S.
2015
- M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
Computer, Vol. 48, No. 12, pp. 24-33, 2015
Lee, Matthew
2017
- Tze-Meng Low, Varun Rao, Matthew Lee, Thom Popovici, Franz Franchetti and S. McMillan
First Look: Linear Algebra-based Triangle Counting Without Matrix Multiplication
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-6, 2017
Lee, Yu-Chiang
2006
- Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
Parallelism in Spiral
Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006
Lengauer, Christian
2017
- Matthias Bolten, Franz Franchetti, P. H. J. Kelly, Christian Lengauer and Marcus Mohr
Algebraic Description and Automatic Generation of Multigrid Methods in SPIRAL
Concurrency and Computation: Practice and Experience, 2017
Li, Jonathan
2017
- H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
Multirotor UAV State Prediction Through Multi-microphone Side-channel Fusion
Proc. Multisensor Fusion and Integration for Intelligent Systems (MFI), IEEE, pp. 15-21, 2017
- H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
Prediction of Quadcopter State through Multi-Microphone Side-Channel Fusion
Technical report UU-CS-2017-001, Dept. of Information and Computing Sciences, Utrecht University, 2017
Li, Q.
2013
- Q. Li, Tao Cui, Yang Weng, Rohit Negi, Franz Franchetti and Marija Ilic
An Information-Theoretic Approach to PMU Placement in Electric Power Systems
IEEE Transactions on Smart Grid, Vol. 4, No. 1, pp. 446-456, 2013
2010
- Q. Li, Tao Cui, Rohit Negi, Franz Franchetti and Marija Ilic
On-line Decentralized Charging of Plug-In Electric Vehicles in Power Systems
, 2010
Li, Xiaoming
2005
- Xiaoming Li, María J. Garzarán and David Padua
Optimizing Sorting with Genetic Algorithm
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 99-110, 2005
2004
- Xiaoming Li, María J. Garzarán and David Padua
A Dynamically Tuned Sorting Library
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 111-124, 2004
Liu, R.
2014
- K. Vaidyanathan, R. Liu, H. E. Sumbul, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
Efficient and Secure Intellectual Property (IP) Design for Split Fabrication
Proc. IEEE International Symposium on Hardware-Oriented Security and Trust (HOST), 2014
Lorenz, Juergen
2006
- Andreas Bonelli, Franz Franchetti, Juergen Lorenz, Markus Püschel and Christoph W. Ueberhuber
Automatic Performance Optimization of the Discrete Fourier Transform on Distributed Memory Computers
Proc. International Symposium on Parallel and Distributed Processing and Application (ISPA), Lecture Notes In Computer Science, Springer, Vol. 4330, pp. 818-832, 2006
- F. Gygi, E. W. Draeger, M. Schulz, B. R. de Supinski, J. A. Gunnels, V. Austel, J. C. Sexton, Franz Franchetti, Stefan Kral, Christoph W. Ueberhuber and Juergen Lorenz
Large-Scale Electronic Structure Calculations of High-Z Metals on the BlueGene/L Platform
Proc. Supercomputing (SC), 2006
- Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
Parallelism in Spiral
Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006
2005
- Franz Franchetti, Stefan Kral, Juergen Lorenz and Christoph W. Ueberhuber
Efficient Utilization of SIMD Extensions
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 409-425, 2005
- F. Gygi, E. W. Draeger, B. R. de Supinski, R. K. Yates, Franz Franchetti, Stefan Kral, Juergen Lorenz, Christoph W. Ueberhuber, J. A. Gunnels and J. C. Sexton
Large-Scale First-Principles Molecular Dynamics Simulations on the BlueGene/L Platform Using the Qbox Code
Proc. Supercomputing (SC), 2005
- Juergen Lorenz, Stefan Kral, Franz Franchetti and Christoph W. Ueberhuber
Vectorization Techniques for the BlueGene/L Double FPU
IBM Journal of Research and Development, Vol. 49, No. 2/3, pp. 437-446, 2005
2004
- Franz Franchetti, Stefan Kral, Juergen Lorenz, Markus Püschel, Christoph W. Ueberhuber and Peter Wurzinger
Automatically Tuned FFTs for BlueGene/L’s Double FPU
Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 3402, pp. 23-36, 2004
2003
- Stefan Kral, Franz Franchetti, Juergen Lorenz and Christoph W. Ueberhuber
SIMD Vectorization of Straight Line Code
Proc. Euro-Par Conference on Parallel and Distributed Computing, LNCS 2985, pp. 251-260, 2003
Low, Tze-Meng
2021
- Mark Blanco, S. McMillan and Tze-Meng Low
Delayed Asynchronous Iterative Graph Algorithms
Proc. High Performance Extreme Computing (HPEC), 2021
2020
- Thom Popovici, Martin Schatz, Franz Franchetti and Tze-Meng Low
A Flexible Framework for Multi-Dimensional DFTs
SIAM Journal on Scientific Computing (SISC), Software and High-Performance Computing, 2020
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solver Libraries for Exascale: FFTX and SpectralPack
Exascale Computing Project (ECP) Annual Meeting, 2020, Poster
- Mark Blanco, S. McMillan and Tze-Meng Low
Towards an Objective Metric for the Performance of Exact Triangle Count
Proc. High Performance Extreme Computing (HPEC), 2020
2019
- Thom Popovici, Martin Schatz, Franz Franchetti and Tze-Meng Low
A Flexible Framework for Parallel Multi-Dimensional DFTs
arXiv (Technical Report), 2019
- F. Sadi, Joe Sweeney, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Efficient SpMV Operation for Large and Highly Sparse Matrices Using Scalable Multi-way Merge Parallelization
Proc. MICRO, 2019
- Mark Blanco, Tze-Meng Low and Kyungjoo Kim
Exploration of Fine-Grained Parallelism for Load Balancing Eager K-truss on GPU and CPU
Proc. High Performance Extreme Computing (HPEC), 2019
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solvers for Exascale: FFTX and SpectralPACK
Exascale Computing Project (ECP) Annual Meeting, 2019
2018
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFTX and SpectralPack: A First Look
Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018
- Jiyuan Zhang, Franz Franchetti and Tze-Meng Low
High Performance Zero-Memory Overhead Direct Convolutions
Proc. International Conference on Machine Learning (ICML), 2018
- Thom Popovici, Tze-Meng Low and Franz Franchetti
Large Bandwidth-Efficient FFTs on Multicore and Multi-Socket Systems
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), IEEE, 2018
- Tze-Meng Low, Daniele G. Spampinato, A. Kutuluru, U. Sridhar, Thom Popovici, Franz Franchetti and S. McMillan
Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency Matrices
Proc. High Performance Extreme Computing (HPEC), 2018
- F. Sadi, Joe Sweeney, S. McMillan, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
PageRank Acceleration for Large Graphs with Scalable Hardware and Two-Step SpMV
Proc. High Performance Extreme Computing (HPEC), 2018
- Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
SPIRAL: Extreme Performance Portability
Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018
2017
- Tze-Meng Low, Varun Rao, Matthew Lee, Thom Popovici, Franz Franchetti and S. McMillan
First Look: Linear Algebra-based Triangle Counting Without Matrix Multiplication
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-6, 2017
- Tze-Meng Low and Franz Franchetti
High Assurance Code Generation for Cyber-Physical Systems
Proc. IEEE International Symposium on High Assurance Systems Engineering (HASE), 2017
- Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
IEEE Control Systems Magazine, 2017
- Thom Popovici, Franz Franchetti and Tze-Meng Low
Mixed Data Layout Kernels for Vectorized Complex Arithmetic
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017
- G. Xu, Tze-Meng Low, James C. Hoe and Franz Franchetti
Optimizing FFT Resource Efficiency of FPGA using High-Level Synthesis
High Performance Extreme Computing Conference (HPEC), 2017
2016
- Richard Veras, Tze-Meng Low and Franz Franchetti
A Scale-Free Structure for Power-Law Graphs
Proc. High Performance Extreme Computing (HPEC), pp. 1-7, 2016
- Richard Veras, Thom Popovici, Tze-Meng Low and Franz Franchetti
Compilers, Hands-Off My Hands-On Optimizations
Proc. Workshop on Programming Models for SIMD/Vector Programming (WPMVP), 2016
2015
- Jiyuan Zhang, Tze-Meng Low, Qi Guo and Franz Franchetti
A 3D-Stacked Memory Manycore Stencil Accelerator System
Proc. Workshop on Near Data Processing (WONDP), 2015
- Qi Guo, Tze-Meng Low, N. Alachiotis, Berkin Akin, Lawrence Pileggi, James C. Hoe and Franz Franchetti
Enabling Portable Energy Efficiency with Memory Accelerated Library
Proc. MICRO, 2015
- Tze-Meng Low, Qi Guo and Franz Franchetti
Optimizing Space Time Adaptive Processing Through Accelerating Memory-Bounded Operations
Proc. High Performance Extreme Computing (HPEC), 2015
2014
- Qi Guo, N. Alachiotis, Berkin Akin, F. Sadi, G. Xu, Tze-Meng Low, Lawrence Pileggi, James C. Hoe and Franz Franchetti
3D-Stacked Memory-Side Acceleration: Accelerator and System Design
Proc. Workshop on Near Data Processing (WONDP), 2014
Lu, Yi
2020
- Jiyuan Zhang, Yi Lu, Daniele G. Spampinato and Franz Franchetti
FESIA: A Fast and Efficient Set Intersection Approach on Modern CPUs
Proc. IEEE International Conference on Data Engineering (ICDE), 2020
Lumsdaine, A.
2016
- J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
Mathematical Foundations of the GraphBLAS
Proc. High Performance Extreme Computing (HPEC), 2016
Madsen, G.
2003
- T. Fahringer, Franz Franchetti, M. Geissler, G. Madsen, H. Moritsch and R. Prodan
On Using ZENTURIO for Performance and Parameter Studies on Clusters and Grids
Proc. Euromicro Conference on Parallel Distributed and Network-based Processing (Euro PDP), pp. 185-192, 2003
Mai, K.
2024
- L. Tang, S. Chen, K. Harisrikanth, G. Xu, Franz Franchetti and K. Mai
A 1.19GHz 9.52Gsamples/sec Radix-8 FFT Hardware Accelerator in 28nm
Proc. Hot Chips Symposium (HCS), 2024
2022
- L. Tang, S. Chen, K. Harisrikanth, G. Xu, K. Mai and Franz Franchetti
A High Throughput Hardware Accelerator for FFTW Codelets: A First Look
Proc. High Performance Extreme Computing (HPEC), 2022
- J. Nguyen, M. Cai, Z. Zuo, L. Tang, K. Mai and Franz Franchetti
LIMA: Hardware for FFT based Large Integer Multiplication
Proc. High Performance Extreme Computing (HPEC), 2022
Maniatakos, M.
2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring-Processing Unit
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
2022
- N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
Towards Full-Stack Acceleration for Fully Homomorphic Encryption
Proc. High Performance Extreme Computing (HPEC), 2022
Mankad, H.
2023
- Sanil Rao, M. A. H. Monil, H. Mankad, J. Vetter and Franz Franchetti
FFTX-IRIS: Towards Performance Portability and Heterogeneity for SPIRAL Generated Code
Proc. Supercomputing (SC), 2023
2022
- Z. Gong, N. Zhu, M. Ngaw, Joao Rivera, L. Tang, E. Tang, H. Mankad and Franz Franchetti
Interval Arithmetic-based FFT for Large Integer Multiplication
Proc. High Performance Extreme Computing (HPEC), 2022
- H. Mankad, Sanil Rao, P. Colella, B. Van Straalen and Franz Franchetti
ProtoX: A First Look
Proc. High Performance Extreme Computing (HPEC), 2022
Markov, I.
2015
- M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
Computer, Vol. 48, No. 12, pp. 24-33, 2015
Mathew, C.
2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
Mattson, T.
2016
- J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
Mathematical Foundations of the GraphBLAS
Proc. High Performance Extreme Computing (HPEC), 2016
Maza, Marc Moreno
2010
- Lingchuan Meng, Jeremy Johnson, Franz Franchetti, Yevgen Voronenko, Marc Moreno Maza and Yuzhen Xie
Spiral-Generated Modular FFT Algorithms
Proc. Parallel Symbolic Computation (PASCO), pp. 169-170, 2010
McCorquodale, P.
2020
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solver Libraries for Exascale: FFTX and SpectralPack
Exascale Computing Project (ECP) Annual Meeting, 2020, Poster
2019
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solvers for Exascale: FFTX and SpectralPACK
Exascale Computing Project (ECP) Annual Meeting, 2019
2018
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFTX and SpectralPack: A First Look
Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018
McFarlin, Daniel
2011
- Daniel McFarlin, Volodymyr Arbatov, Franz Franchetti and Markus Püschel
Automatic SIMD Vectorization of Fast Fourier Transforms for the Larrabee and AVX Instruction Sets
Proc. International Conference on Supercomputing (ICS), 2011
2009
- Daniel McFarlin, Franz Franchetti and Markus Püschel
Automatic Generation of Vectorized Fast Fourier Transform Libraries for the Larrabee and AVX Instruction Set Extension
Proc. High Performance Extreme Computing (HPEC), 2009
- Daniel McFarlin, Franz Franchetti, José M. F. Moura and Markus Püschel
High Performance Synthetic Aperture Radar Image Formation On Commodity Architectures
Proc. SPIE Conference on Defense, Security, and Sensing, Proceedings of SPIE, Vol. 7337, pp. 733708, 2009
- Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
Operator Language: A Program Generation Framework for Fast Kernels
Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009
2008
- Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
Program Generation with Spiral: Beyond Transforms
Proc. High Performance Extreme Computing (HPEC), 2008
McMillan, S.
2021
- Mark Blanco, S. McMillan and Tze-Meng Low
Delayed Asynchronous Iterative Graph Algorithms
Proc. High Performance Extreme Computing (HPEC), 2021
2020
- Sanil Rao, A. Kutuluru, Paul Brouwer, S. McMillan and Franz Franchetti
GBTLX: A First Look
Proc. IEEE High Performance Extreme Computing (HPEC), 2020
- Mark Blanco, S. McMillan and Tze-Meng Low
Towards an Objective Metric for the Performance of Exact Triangle Count
Proc. High Performance Extreme Computing (HPEC), 2020
2018
- Tze-Meng Low, Daniele G. Spampinato, A. Kutuluru, U. Sridhar, Thom Popovici, Franz Franchetti and S. McMillan
Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency Matrices
Proc. High Performance Extreme Computing (HPEC), 2018
- F. Sadi, Joe Sweeney, S. McMillan, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
PageRank Acceleration for Large Graphs with Scalable Hardware and Two-Step SpMV
Proc. High Performance Extreme Computing (HPEC), 2018
- Jiyuan Zhang, Daniele G. Spampinato, S. McMillan and Franz Franchetti
Preliminary Exploration on Large-Scale Triangle Counting in Shared-Memory Multicore System
Proc. IEEE High Performance Extreme Computing (HPEC), 2018
2017
- Tze-Meng Low, Varun Rao, Matthew Lee, Thom Popovici, Franz Franchetti and S. McMillan
First Look: Linear Algebra-based Triangle Counting Without Matrix Multiplication
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-6, 2017
2016
- J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
Mathematical Foundations of the GraphBLAS
Proc. High Performance Extreme Computing (HPEC), 2016
Mendoza, Juan Pablo
2017
- Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
IEEE Control Systems Magazine, 2017
Meng, Lingchuan
2014
- Lingchuan Meng and Jeremy Johnson
Towards parallel general-size library generation for polynomial multiplication
ACM Communications in Computer Algebra, 2014
2013
- Lingchuan Meng and Jeremy Johnson
Automatic Parallel Library Generation for General-Size Modular FFT Algorithms
Proc. International Workshop on Computer Algebra in Scientific Computing, Springer, pp. 243-256, 2013
2010
- Lingchuan Meng, Jeremy Johnson, Franz Franchetti, Yevgen Voronenko, Marc Moreno Maza and Yuzhen Xie
Spiral-Generated Modular FFT Algorithms
Proc. Parallel Symbolic Computation (PASCO), pp. 169-170, 2010
Meyerhenke, H.
2016
- J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
Mathematical Foundations of the GraphBLAS
Proc. High Performance Extreme Computing (HPEC), 2016
Milder, Peter A.
2016
- Marcela Zuluaga, Peter A. Milder and Markus Püschel
Streaming Sorting Networks
ACM Transactions on Design Automation of Electronic Systems, Vol. 21, No. 4, pp. 55, 2016
2012
- Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
Algorithm and Architecture Optimization for Large Size Two Dimensional Discrete Fourier Transform
ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA), 2012
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Computer Generation of Hardware for Linear Digital Signal Processing Transforms
ACM Transactions on Design Automation of Electronic Systems, Vol. 17, No. 2, 2012
- Marcela Zuluaga, Peter A. Milder and Markus Püschel
Computer Generation of Streaming Sorting Networks
Proc. Design Automation Conference (DAC), pp. 1245-1253, 2012
- Robert Koutsoyannis, Peter A. Milder, Christian Berger, Madeleine Glick, James C. Hoe and Markus Püschel
Improving Fixed-Point Accuracy of FFT Cores in O-OFDM Systems
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2012
- Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
Memory Bandwidth Efficient Two-Dimensional Fast Fourier Transform Algorithm and Implementation for Large Problem Sizes
Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), 2012
- Marcela Zuluaga, Andreas Krause, Peter A. Milder and Markus Püschel
"Smart" Design Space Sampling to Predict Pareto-Optimal Solutions
Proc. Languages, Compilers, Tools and Theory for Embedded Systems (LCTES), pp. 119-128 , 2012
2011
- Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Design studies for ASIC implementations of 28 GS/s optical QPSK- and 16-QAM-OFDM transceivers
Optics Express, Vol. 19, No. 21, pp. 20857-20864, 2011
2010
- Peter A. Milder
A Mathematical Approach for Compiling and Optimizing Hardware Implementations of DSP Transforms
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
- Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, Christian Berger, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Design Studies for an ASIC Implementation of an Optical OFDM Transceiver
Proc. European Conference on Optical Communication (ECOC), pp. 1-3, 2010
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Hardware Implementation of the Discrete Fourier Transform with Non-Power-of-Two Problem Size
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2010
- Yannis Benlachtar, Rachid Bouziane, Robert I. Killey, Christian Berger, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel and Madeleine Glick
Optical OFDM for the Data Center
Proc. International Conference on Transparent Optical Networks (ICTON), pp. 1-4, 2010
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Real-Time Digital Signal Processing for the Generation of Optical Orthogonal Frequency Division Multiplexed Signals
IEEE Journal of Selected Topics in Quantum Electronics, Vol. 16, No. 5, pp. 1235-1244 , 2010
2009
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
21.4 GS/s Real-Time DSP-Based Optical OFDM Signal Generation and Transmission Over 1600 km of Uncompensated Fibre
Proc. European Conference on Optical Communication (ECOC), pp. 1-2, 2009
- Peter A. Milder, James C. Hoe and Markus Püschel
Automatic Generation of Streaming Datapaths for Arbitrary Fixed Permutations
Proc. Design, Automation and Test in Europe (DATE), pp. 1118-1123, 2009
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009
- Markus Püschel, Peter A. Milder and James C. Hoe
Permuting Streaming Data Using RAMs
Journal of the ACM, Vol. 56, No. 2, pp. 10:1-10:34, 2009
2008
- Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
Domain-Specific Library Generation for Parallel Software and Hardware Platforms
Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Formal Datapath Representation and Manipulation for Implementing DSP Transforms
Proc. Design Automation Conference (DAC), pp. 385-390, 2008
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Linear Transforms: From Math to Efficient Hardware
Proc. Workshop on High-Level Synthesis colocated with DAC, 2008
- Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
Program Generation with Spiral: Beyond Transforms
Proc. High Performance Extreme Computing (HPEC), 2008
2007
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Discrete Fourier Transform Compiler: From Mathematical Representation to Efficient Hardware
CSSI Technical Report #CSSI-07-01, Carnegie Mellon University, 2007
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Fast Fourier Transform on FPGA: Design Choices and Evaluation
Proc. International Symposium on Field-Programmable Gate Arrays (FPGA), 2007
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
FFT Compiler: From Math to Efficient Hardware
Proc. IEEE International High Level Design Validation and Test Workshop (HLDVT), 2007
- Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
Generating FPGA Accelerated DFT Libraries
Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007
2006
- Paolo D'Alberto, Peter A. Milder, Franz Franchetti, James C. Hoe, Markus Püschel and José M. F. Moura
Discrete Fourier Transform Compiler for FPGA and CPU/FPGA Partitioned Implementations
Proc. High Performance Extreme Computing (HPEC), 2006
- Peter A. Milder, Mohammad Ahmad, James C. Hoe and Markus Püschel
Fast and Accurate Resource Estimation of Automatically Generated Custom DFT IP Cores
Proc. FPGA, pp. 211-220, 2006
2005
- Grace Nordin, Peter A. Milder, James C. Hoe and Markus Püschel
Automatic Generation of Customized Discrete Fourier Transform IPs
Proc. Design Automation Conference (DAC), pp. 471-474, 2005
Mionis, Scott
2021
- Scott Mionis, Franz Franchetti and J. Larkin
Optimized Quantum Circuit Generation with SPIRAL
Proc. High Performance Extreme Computing (HPEC), 2021
2020
- Scott Mionis, Franz Franchetti and J. Larkin
Quantum Circuit Optimization with SPIRAL: A First Look
Proc. Supercomputing (SC), 2020
Misra, Smarahara
2004
- Adam C. Zelinski, Markus Püschel, Smarahara Misra and James C. Hoe
Automatic Cost Minimization for Multiplierless Implementations of Discrete Signal Transforms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-221-V-224, 2004
2003
- Smarahara Misra, Adam C. Zelinski, James C. Hoe and Markus Püschel
Custom Reduction of Arithmetic in Linear DSP Transforms
Proc. High Performance Extreme Computing (HPEC), 2003
- Smarahara Misra
Custom Reduction of Arithmetic in Multiplierless implementations of DSP Transforms
Master thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2003
2002
- Fang Fang, James C. Hoe, Markus Püschel and Smarahara Misra
Generation of Custom DSP Transform IP Cores: Case Study Walsh-Hadamard Transform
Proc. High Performance Extreme Computing (HPEC), 2002
Mitra, S.
2015
- M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
Computer, Vol. 48, No. 12, pp. 24-33, 2015
Mitsch, Stefan
2017
- Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
IEEE Control Systems Magazine, 2017
Mohr, Marcus
2017
- Matthias Bolten, Franz Franchetti, P. H. J. Kelly, Christian Lengauer and Marcus Mohr
Algebraic Description and Automatic Generation of Multigrid Methods in SPIRAL
Concurrency and Computation: Practice and Experience, 2017
Monil, M. A. H.
2023
- Sanil Rao, M. A. H. Monil, H. Mankad, J. Vetter and Franz Franchetti
FFTX-IRIS: Towards Performance Portability and Heterogeneity for SPIRAL Generated Code
Proc. Supercomputing (SC), 2023
Moopan, M. N. T.
2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
Moreira, J.
2022
- J. Nguyen, Sanil Rao, J. Moreira and Franz Franchetti
Automatic Generation of Matrix-Vector Code Using SPIRAL for the Power10 ISA
Proc. IEEE High Performance Extreme Computing (HPEC), 2022
2016
- J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
Mathematical Foundations of the GraphBLAS
Proc. High Performance Extreme Computing (HPEC), 2016
Moritsch, H.
2003
- T. Fahringer, Franz Franchetti, M. Geissler, G. Madsen, H. Moritsch and R. Prodan
On Using ZENTURIO for Performance and Parameter Studies on Clusters and Grids
Proc. Euromicro Conference on Parallel Distributed and Network-based Processing (Euro PDP), pp. 185-192, 2003
Moura, José M. F.
2018
- Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
SPIRAL: Extreme Performance Portability
Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018
2017
- Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
IEEE Control Systems Magazine, 2017
2016
- Joya Deri, Franz Franchetti and José M. F. Moura
Big Data Computation of Taxi Movement in New York City
Proc. IEEE International Conference on Big Data (Big Data), IEEE, pp. 2616-2625, 2016
2011
- W. Yu, Franz Franchetti, James C. Hoe, José M. F. Moura and Tsuhan Chen
Performance Portable Tracking of Evolving Surfaces
Proc. High Performance Extreme Computing (HPEC), 2011
2009
- Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
- Daniel McFarlin, Franz Franchetti, José M. F. Moura and Markus Püschel
High Performance Synthetic Aperture Radar Image Formation On Commodity Architectures
Proc. SPIE Conference on Defense, Security, and Sensing, Proceedings of SPIE, Vol. 7337, pp. 733708, 2009
2008
- Markus Püschel and José M. F. Moura
Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for DCTs and DSTs
IEEE Transactions on Signal Processing, Vol. 56, No. 4, pp. 1502-1521, 2008
- Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
Domain-Specific Library Generation for Parallel Software and Hardware Platforms
Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
- Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
Program Generation with Spiral: Beyond Transforms
Proc. High Performance Extreme Computing (HPEC), 2008
2007
- Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
Generating FPGA Accelerated DFT Libraries
Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007
2006
- Paolo D'Alberto, Peter A. Milder, Franz Franchetti, James C. Hoe, Markus Püschel and José M. F. Moura
Discrete Fourier Transform Compiler for FPGA and CPU/FPGA Partitioned Implementations
Proc. High Performance Extreme Computing (HPEC), 2006
- Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
Parallelism in Spiral
Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006
- Marek Telgarsky, James C. Hoe and José M. F. Moura
Spiral: Joint Runtime and Energy Optimization of Linear Transforms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 3, 2006
2005
- Thammanit Pipatsrisawat, Aca Gacic, Franz Franchetti, Markus Püschel and José M. F. Moura
Performance Analysis of the Filtered Backprojection Image Reconstruction Algorithms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. 153-156, 2005
- José M. F. Moura, Markus Püschel, David Padua and Jack Dongarra
Scanning the Issue: Special Issue on Program Generation, Optimization, and Platform Adaptation
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 211-215, 2005
- Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
SPIRAL: Code Generation for DSP Transforms
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005
2004
- Aca Gacic, Markus Püschel and José M. F. Moura
Automatically Generated High-Performance Code for Discrete Wavelet Transforms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-69-V-72, 2004
- Markus Püschel, Bryan Singer, Jianxin Xiong, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso and Robert W. Johnson
SPIRAL: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms
Journal of High Performance Computing and Applications, special issue on ``Automatic Performance Tuning'', Vol. 18, No. 1, pp. 21-45, 2004
2003
- Aca Gacic, Markus Püschel and José M. F. Moura
Fast Automatic Implementations of FIR Filters
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 541-544, 2003
- Markus Püschel and José M. F. Moura
SPIRAL: An Overview
Proc. Workshop on Optimizations for DSP and Embedded Systems (ODES), 2003
- Markus Püschel and José M. F. Moura
The Algebraic Approach to the Discrete Cosine and Sine Transforms and their Fast Algorithms
SIAM Journal of Computing, Vol. 32, No. 5, pp. 1280-1316, 2003
2002
- Markus Püschel and José M. F. Moura
Generation and Manipulation of DSP Transform Algorithms
Proc. Digital Signal Processing Workshop, 2002
- Franz Franchetti, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
Short Vector SIMD Code Generation for DSP Algorithms
Proc. High Performance Extreme Computing (HPEC), 2002
2001
- Markus Püschel, Bryan Singer, Manuela Veloso and José M. F. Moura
Fast Automatic Generation of DSP Algorithms
Proc. International Conference on Computational Science (ICCS), Lecture Notes In Computer Science, Springer, Vol. 2073, pp. 97-106, 2001
- José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel, Bryan Singer, Manuela Veloso and Jianxin Xiong
Generating Platform-Adapted DSP Libraries using SPIRAL
Proc. High Performance Extreme Computing (HPEC), 2001
2000
- José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel and Manuela Veloso
SPIRAL: Automatic Implementation of Signal Processing Algorithms
Proc. High Performance Extreme Computing (HPEC), 2000
Nabeel, M.
2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring-Processing Unit
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
Nagai, T.
2021
- N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
Adaptation of A64 Scalable Vector Extension for Spiral
Proc. High Performance Computing Conference (HPCC), Japan, 2021
- N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
An Auto-tuning with Adaptation of A64 Scalable Vector Extension for SPIRAL
Proc. International Workshop on Automatic Performance Tuning (iWAPT), 2021
Nagvajara, Prawat
2008
- Jeremy Johnson, Tim Chagnon, Petya Vachranukunkiet, Prawat Nagvajara and Chika Nwankpa
Sparse LU Decomposition using FPGA
Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
2000
- Pinit Kumhom, Jeremy Johnson and Prawat Nagvajara
Design, optimization, and implementation of a universal FFT processor
Proc. IEEE ASIC/SOC Conference, IEEE, pp. 182-186, 2000
Neda, N.
2023
- N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring-Processing Unit
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
2022
- N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
Towards Full-Stack Acceleration for Fully Homomorphic Encryption
Proc. High Performance Extreme Computing (HPEC), 2022
Neda, N.
2023
- N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring-Processing Unit
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
2022
- N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
Towards Full-Stack Acceleration for Fully Homomorphic Encryption
Proc. High Performance Extreme Computing (HPEC), 2022
Negi, Rohit
2013
- Q. Li, Tao Cui, Yang Weng, Rohit Negi, Franz Franchetti and Marija Ilic
An Information-Theoretic Approach to PMU Placement in Electric Power Systems
IEEE Transactions on Smart Grid, Vol. 4, No. 1, pp. 446-456, 2013
2010
- Q. Li, Tao Cui, Rohit Negi, Franz Franchetti and Marija Ilic
On-line Decentralized Charging of Plug-In Electric Vehicles in Power Systems
, 2010
Ngaw, M.
2022
- Z. Gong, N. Zhu, M. Ngaw, Joao Rivera, L. Tang, E. Tang, H. Mankad and Franz Franchetti
Interval Arithmetic-based FFT for Large Integer Multiplication
Proc. High Performance Extreme Computing (HPEC), 2022
Nguyen, J.
2022
- J. Nguyen, Sanil Rao, J. Moreira and Franz Franchetti
Automatic Generation of Matrix-Vector Code Using SPIRAL for the Power10 ISA
Proc. IEEE High Performance Extreme Computing (HPEC), 2022
- J. Nguyen, M. Cai, Z. Zuo, L. Tang, K. Mai and Franz Franchetti
LIMA: Hardware for FFT based Large Integer Multiplication
Proc. High Performance Extreme Computing (HPEC), 2022
Nolin, Thomas
2019
- Yoko Franchetti, Thomas Nolin and Franz Franchetti
Towards Precision Medicine: Simulation Based Parameter Estimation for Drug Metabolism
SIAM Conference on Computational Science and Engineering (CSE), 2019
Nordin, Grace
2005
- Grace Nordin, Peter A. Milder, James C. Hoe and Markus Püschel
Automatic Generation of Customized Discrete Fourier Transform IPs
Proc. Design Automation Conference (DAC), pp. 471-474, 2005
2004
- Grace Nordin, James C. Hoe and Markus Püschel
Discrete Fourier Transform IP Generator
Proc. High Performance Extreme Computing (HPEC), 2004
Nwankpa, Chika
2008
- Jeremy Johnson, Tim Chagnon, Petya Vachranukunkiet, Prawat Nagvajara and Chika Nwankpa
Sparse LU Decomposition using FPGA
Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
Odersky, Martin
2014
- Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
Language Support for the Construction of High Performance Code Generators
Proc. International Workshop on Adaptive Self-tuning Computing Systems (ADAPT), 2014
2013
- Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
Spiral in Scala: Towards the Systematic Construction of Generators for Performance Libraries
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 125-134, 2013
Ofenbeck, Georg
2017
- Georg Ofenbeck, Tiark Rompf and Markus Püschel
Staging for Generic Programming in Space and Time
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 15-28, 2017
2014
- Alen Stojanov, Georg Ofenbeck, Tiark Rompf and Markus Püschel
Abstracting Vector Architectures in Library Generators: Case Study Convolution Filters
Proc. ACM International Workshop on Libraries, Languages and Compilers for Array Programming (ARRAY), pp. 14, 2014
- Georg Ofenbeck, Ruedi Steinmann, Victoria Caparrós Cabezas, Daniele G. Spampinato and Markus Püschel
Applying the Roofline Model
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 76 - 85, 2014
- Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
Language Support for the Construction of High Performance Code Generators
Proc. International Workshop on Adaptive Self-tuning Computing Systems (ADAPT), 2014
2013
- Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
Spiral in Scala: Towards the Systematic Construction of Generators for Performance Libraries
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 125-134, 2013
Ohshima, S.
2021
- N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
Adaptation of A64 Scalable Vector Extension for Spiral
Proc. High Performance Computing Conference (HPCC), Japan, 2021
- N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
An Auto-tuning with Adaptation of A64 Scalable Vector Extension for SPIRAL
Proc. International Workshop on Automatic Performance Tuning (iWAPT), 2021
Olukoton, K.
2015
- M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
Computer, Vol. 48, No. 12, pp. 24-33, 2015
Oostema, P.
2022
- P. Oostema and Franz Franchetti
Approximating Manifolds and Geodesics with Curved Surfaces
Proc. High Performance Extreme Computing (HPEC), 2022
- P. Oostema and Franz Franchetti
On Manifold Graph Embedding
Proc. High Performance Extreme Computing (HPEC), 2022
2021
- P. Oostema and Franz Franchetti
Leveraging High Dimensional Spatial Graph Embedding as a Heuristic for Graph Algorithms
Proc. IEEE Workshop on Parallel / Distributed Combinatorics and Optimization (PDCO), 2021
Owens, J. D.
2016
- J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
Mathematical Foundations of the GraphBLAS
Proc. High Performance Extreme Computing (HPEC), 2016
Ozturk, T.
2015
- T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, R. Suter, Franz Franchetti and Anthony Rollett
Spectral Full-Field Deformation Modeling of Polycrystalline Materials
submitted for publication
2014
- T. Ozturk, Thom Popovici, C. Stein, R. Pokharel, Franz Franchetti and Anthony Rollett
Fast Fourier Transform Based Mechanical Behavior Formulation: Optimized Implementation and Sensitivity Analysis of the Method
Proc. Materials Science & Technology, 2014
- T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, Franz Franchetti, R. Suter and Anthony Rollett
Performance Evaluation, Algorithm Optimization and Sensitivity Analysis of the Spectral Full-Field Deformation Modeling of Polycrystalline Materials
submitted for publication
Padua, David
2017
- Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
IEEE Control Systems Magazine, 2017
2005
- Xiaoming Li, María J. Garzarán and David Padua
Optimizing Sorting with Genetic Algorithm
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 99-110, 2005
- José M. F. Moura, Markus Püschel, David Padua and Jack Dongarra
Scanning the Issue: Special Issue on Program Generation, Optimization, and Platform Adaptation
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 211-215, 2005
- Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
SPIRAL: Code Generation for DSP Transforms
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005
2004
- Xiaoming Li, María J. Garzarán and David Padua
A Dynamically Tuned Sorting Library
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 111-124, 2004
- Nicholas Rizzolo and David Padua
HiLO: High Level Optimization of FFTs
Proc. International Workshop on Languages and Compilers for Parallel Computing (LCPC), 2004
- Markus Püschel, Bryan Singer, Jianxin Xiong, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso and Robert W. Johnson
SPIRAL: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms
Journal of High Performance Computing and Applications, special issue on ``Automatic Performance Tuning'', Vol. 18, No. 1, pp. 21-45, 2004
2001
- Sebastian Egner, Jeremy Johnson, David Padua, Jianxin Xiong and Markus Püschel
Automatic Derivation and Implementation of Signal Processing Algorithms
ACM SIGSAM Bulletin Communications in Computer Algebra, Vol. 35, No. 2, pp. 1-19, 2001
- José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel, Bryan Singer, Manuela Veloso and Jianxin Xiong
Generating Platform-Adapted DSP Libraries using SPIRAL
Proc. High Performance Extreme Computing (HPEC), 2001
- Jianxin Xiong, Jeremy Johnson, Robert W. Johnson and David Padua
SPL: A Language and Compiler for DSP Algorithms
Proc. Programming Languages Design and Implementation (PLDI), pp. 298-308, 2001
2000
- José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel and Manuela Veloso
SPIRAL: Automatic Implementation of Signal Processing Algorithms
Proc. High Performance Extreme Computing (HPEC), 2000
Park, Neungsoo
2001
- Neungsoo Park and Viktor K. Prasanna
Cache Conscious Walsh-Hadamard Transform
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. II-1205-II-1208, 2001
2000
- Neungsoo Park, Viktor K. Prasanna, Kiran Bondalapati and Dongsoo Kang
Dynamic Data Layouts for Cache-conscious Factorization of DFT
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 693-701, 2000
Patterson, A.
2013
- H. E. Sumbul, A. Patterson, A. Tazzoli, G. Feeder, Franz Franchetti, G. Piazza and Lawrence Pileggi
Trusted Split-Fabrication System-on-Chip Design Technology and Methodology
Proc. Government Applications & Critical Technology Conference (GOMACTech), 2013
Pedram, M.
2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring-Processing Unit
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
Peng, Ronghui
2010
- Yevgen Voronenko, Volodymyr Arbatov, Christian Berger, Ronghui Peng, Markus Püschel and Franz Franchetti
Computer Generation of Platform-Adapted Physical Layer Software
Proc. Software Defined Radio (SDR), 2010
Peter, Thomas
2006
- Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
Parallelism in Spiral
Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006
Phaosawasdi, Amarin
2017
- Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
IEEE Control Systems Magazine, 2017
Piazza, G.
2013
- H. E. Sumbul, A. Patterson, A. Tazzoli, G. Feeder, Franz Franchetti, G. Piazza and Lawrence Pileggi
Trusted Split-Fabrication System-on-Chip Design Technology and Methodology
Proc. Government Applications & Critical Technology Conference (GOMACTech), 2013
Pileggi, Lawrence
2019
- F. Sadi, Joe Sweeney, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Efficient SpMV Operation for Large and Highly Sparse Matrices Using Scalable Multi-way Merge Parallelization
Proc. MICRO, 2019
2018
- F. Sadi, Joe Sweeney, S. McMillan, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
PageRank Acceleration for Large Graphs with Scalable Hardware and Two-Step SpMV
Proc. High Performance Extreme Computing (HPEC), 2018
2017
- F. Sadi, Lawrence Pileggi and Franz Franchetti
Algorithm and Hardware Co-Optimized Solution for Large SpMV Problems
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017
2016
- F. Sadi, Lawrence Pileggi and Franz Franchetti
3D DRAM Based Application Specific Hardware Accelerator for SpMV
High Performance Extreme Computing Conference (HPEC), 2016
2015
- H. E. Sumbul, K. Vaidyanathan, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
A Synthesis Methodology for Application-Specific Logic-in-Memory Designs
Proc. Design Automation Conference (DAC), 2015
- Qi Guo, Tze-Meng Low, N. Alachiotis, Berkin Akin, Lawrence Pileggi, James C. Hoe and Franz Franchetti
Enabling Portable Energy Efficiency with Memory Accelerated Library
Proc. MICRO, 2015
- M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
Computer, Vol. 48, No. 12, pp. 24-33, 2015
2014
- Qi Guo, N. Alachiotis, Berkin Akin, F. Sadi, G. Xu, Tze-Meng Low, Lawrence Pileggi, James C. Hoe and Franz Franchetti
3D-Stacked Memory-Side Acceleration: Accelerator and System Design
Proc. Workshop on Near Data Processing (WONDP), 2014
- F. Sadi, Berkin Akin, Thom Popovici, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Algorithm/Hardware Co-optimized SAR Image Reconstruction with 3D-stacked Logic in Memory
Proc. IEEE High Performance Extreme Computing (HPEC), 2014
- K. Vaidyanathan, R. Liu, H. E. Sumbul, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
Efficient and Secure Intellectual Property (IP) Design for Split Fabrication
Proc. IEEE International Symposium on Hardware-Oriented Security and Trust (HOST), 2014
2013
- Qiuling Zhu, Berkin Akin, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
A 3D-Stacked Logic-in-Memory Accelerator for Application-Specific Data Intensive Computing
Proc. IEEE International 3D Systems Integration Conference (3DIC), pp. 1-7, 2013
- Qiuling Zhu, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Accelerating Sparse Matrix-Matrix Multiplication with 3D-Stacked Logic-in-Memory Hardware
Proc. High Performance Extreme Computing (HPEC), pp. 1-6, 2013
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
A Smart Memory Accelerated Computed Tomography Parallel Backprojection
in From Algorithms to Circuits and System-on-Chip Design (VLSI-SoC 2012), Eds. A. Berg, A. Coskun, M. Guthaus, S. Katkoori, R. Reis, pp. 21-44, Springer, Berlin, Heidelberg 2013
- H. E. Sumbul, A. Patterson, A. Tazzoli, G. Feeder, Franz Franchetti, G. Piazza and Lawrence Pileggi
Trusted Split-Fabrication System-on-Chip Design Technology and Methodology
Proc. Government Applications & Critical Technology Conference (GOMACTech), 2013
2012
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
Cost-Effective Smart Memory Implementation for Parallel Backprojection in Computed Tomography
Proc. IFIP/IEEE Internationa Conference on Very Large Scale Integration, pp. 111-116, 2012
- Qiuling Zhu, K. Vaidyanathan, O. Shacham, M. Horowitz, Lawrence Pileggi and Franz Franchetti
Design Automation Framwork for Application-Specific Logic-in-Memory Blocks
Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 125-132, 2012
- Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
Local Interpolation-based Polar Format SAR: Algorithm, Hardware Implementation and Design Automation
Journal of Signal Processing Systems, 2012
- Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
Polar Format Synthetic Aperture Radar in Energy Efficient Application-Specific Logic-in-Memory
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1557-1560, 2012
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
Smart Memory Synthesis for Energy-Efficient Computed Tomography Reconstruction
Proc. SRC TECHCON, 2012
2011
- Qiuling Zhu, E. L. Turner, Christian Berger, Lawrence Pileggi and Franz Franchetti
Application-Specific Logic-in-Memory for Polar Format Synthetic Aperture Radar
Proc. High Performance Extreme Computing (HPEC), 2011
Pipatsrisawat, Thammanit
2005
- Thammanit Pipatsrisawat, Aca Gacic, Franz Franchetti, Markus Püschel and José M. F. Moura
Performance Analysis of the Filtered Backprojection Image Reconstruction Algorithms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. 153-156, 2005
Pitner, G.
2015
- M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
Computer, Vol. 48, No. 12, pp. 24-33, 2015
Platzer, Andre'
2017
- Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
IEEE Control Systems Magazine, 2017
Pokharel, R.
2015
- T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, R. Suter, Franz Franchetti and Anthony Rollett
Spectral Full-Field Deformation Modeling of Polycrystalline Materials
submitted for publication
2014
- T. Ozturk, Thom Popovici, C. Stein, R. Pokharel, Franz Franchetti and Anthony Rollett
Fast Fourier Transform Based Mechanical Behavior Formulation: Optimized Implementation and Sensitivity Analysis of the Method
Proc. Materials Science & Technology, 2014
- T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, Franz Franchetti, R. Suter and Anthony Rollett
Performance Evaluation, Algorithm Optimization and Sensitivity Analysis of the Spectral Full-Field Deformation Modeling of Polycrystalline Materials
submitted for publication
Polyakov, Y.
2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring-Processing Unit
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
2022
- N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
Towards Full-Stack Acceleration for Fully Homomorphic Encryption
Proc. High Performance Extreme Computing (HPEC), 2022
Pop, E.
2015
- M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
Computer, Vol. 48, No. 12, pp. 24-33, 2015
Popovici, Thom
2020
- Thom Popovici, Martin Schatz, Franz Franchetti and Tze-Meng Low
A Flexible Framework for Multi-Dimensional DFTs
SIAM Journal on Scientific Computing (SISC), Software and High-Performance Computing, 2020
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solver Libraries for Exascale: FFTX and SpectralPack
Exascale Computing Project (ECP) Annual Meeting, 2020, Poster
2019
- Thom Popovici, Martin Schatz, Franz Franchetti and Tze-Meng Low
A Flexible Framework for Parallel Multi-Dimensional DFTs
arXiv (Technical Report), 2019
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solvers for Exascale: FFTX and SpectralPACK
Exascale Computing Project (ECP) Annual Meeting, 2019
2018
- Thom Popovici
An Approach to Specifying and Automatically Optimizing Fourier Transform Based Operations
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2018
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFTX and SpectralPack: A First Look
Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018
- Thom Popovici, Tze-Meng Low and Franz Franchetti
Large Bandwidth-Efficient FFTs on Multicore and Multi-Socket Systems
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), IEEE, 2018
- Tze-Meng Low, Daniele G. Spampinato, A. Kutuluru, U. Sridhar, Thom Popovici, Franz Franchetti and S. McMillan
Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency Matrices
Proc. High Performance Extreme Computing (HPEC), 2018
- Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
SPIRAL: Extreme Performance Portability
Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018
2017
- Tze-Meng Low, Varun Rao, Matthew Lee, Thom Popovici, Franz Franchetti and S. McMillan
First Look: Linear Algebra-based Triangle Counting Without Matrix Multiplication
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-6, 2017
- Thom Popovici, Franz Franchetti and Tze-Meng Low
Mixed Data Layout Kernels for Vectorized Complex Arithmetic
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017
2016
- Richard Veras, Thom Popovici, Tze-Meng Low and Franz Franchetti
Compilers, Hands-Off My Hands-On Optimizations
Proc. Workshop on Programming Models for SIMD/Vector Programming (WPMVP), 2016
2015
- Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
Proc. Workshop on Compilers for Parallel Computing (CPC), 2015
- Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2015
- T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, R. Suter, Franz Franchetti and Anthony Rollett
Spectral Full-Field Deformation Modeling of Polycrystalline Materials
submitted for publication
2014
- F. Sadi, Berkin Akin, Thom Popovici, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Algorithm/Hardware Co-optimized SAR Image Reconstruction with 3D-stacked Logic in Memory
Proc. IEEE High Performance Extreme Computing (HPEC), 2014
- T. Ozturk, Thom Popovici, C. Stein, R. Pokharel, Franz Franchetti and Anthony Rollett
Fast Fourier Transform Based Mechanical Behavior Formulation: Optimized Implementation and Sensitivity Analysis of the Method
Proc. Materials Science & Technology, 2014
- T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, Franz Franchetti, R. Suter and Anthony Rollett
Performance Evaluation, Algorithm Optimization and Sensitivity Analysis of the Spectral Full-Field Deformation Modeling of Polycrystalline Materials
submitted for publication
Pouchet, Louis-Noël
2013
- Tom Henretty, Richard Veras, Franz Franchetti, Louis-Noël Pouchet, J. Ramanujam and P. Sadayappan
A Stencil Compiler for Short-Vector SIMD Architectures
Proc. ACM International Conference on Supercomputing , pp. 13-24, 2013
- Martin Kong, Richard Veras, Kevin Stock, Franz Franchetti, Louis-Noël Pouchet and P. Sadayappan
When Polyhedral Transformations Meet SIMD Code Generation
Proc. Programming Languages Design and Implementation (PLDI), Vol. 48, pp. 127-138, 2013
2011
- Tom Henretty, Kevin Stock, Louis-Noël Pouchet, Franz Franchetti, J. Ramanujam and P. Sadayappan
Data Layout Transformation for Stencil Computations on Short SIMD Architectures
Proc. International Conference on Compiler Construction (CC), 2011
Prakash, A.
2024
- Sanil Rao, A. Prakash and Franz Franchetti
Towards LibraryX: A Framework for Cross-Library Call Optimization
Proc. High Performance Extreme Computing (HPEC), 2024
Prasanna, Viktor K.
2001
- Neungsoo Park and Viktor K. Prasanna
Cache Conscious Walsh-Hadamard Transform
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. II-1205-II-1208, 2001
- José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel, Bryan Singer, Manuela Veloso and Jianxin Xiong
Generating Platform-Adapted DSP Libraries using SPIRAL
Proc. High Performance Extreme Computing (HPEC), 2001
2000
- Neungsoo Park, Viktor K. Prasanna, Kiran Bondalapati and Dongsoo Kang
Dynamic Data Layouts for Cache-conscious Factorization of DFT
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 693-701, 2000
- José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel and Manuela Veloso
SPIRAL: Automatic Implementation of Signal Processing Algorithms
Proc. High Performance Extreme Computing (HPEC), 2000
Prodan, R.
2003
- T. Fahringer, Franz Franchetti, M. Geissler, G. Madsen, H. Moritsch and R. Prodan
On Using ZENTURIO for Performance and Parameter Studies on Clusters and Grids
Proc. Euromicro Conference on Parallel Distributed and Network-based Processing (Euro PDP), pp. 185-192, 2003
Püschel, Markus
2022
- Joao Rivera, Franz Franchetti and Markus Püschel
A Compiler for Sound Floating-Point Computations using Affine Arithmetic
Proc. International Symposium on Code Generation and Optimization (CGO), 2022
2021
- Joao Rivera, Franz Franchetti and Markus Püschel
An Interval Compiler for Sound Floating Point Computations
Proc. International Symposium on Code Generation and Optimization (CGO), 2021
2018
- Daniele G. Spampinato, Diego Fabregat-Traver, Paolo Bientinesi and Markus Püschel
Program Generation for Small-Scale Linear Algebra Applications
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 327-339, 2018
- Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
SPIRAL: Extreme Performance Portability
Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018
2017
- Georg Ofenbeck, Tiark Rompf and Markus Püschel
Staging for Generic Programming in Space and Time
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 15-28, 2017
2016
- Daniele G. Spampinato and Markus Püschel
A Basic Linear Algebra Compiler for Structured Matrices
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 117-127, 2016
- Marcela Zuluaga, Andreas Krause and Markus Püschel
e-PAL: An Active Learning Approach to the Multi-Objective Optimization Problem
Journal of Machine Learning Research, Vol. 17, No. 104, pp. 1-32, 2016
- Francois Serre and Markus Püschel
Optimal Circuits for Streamed Linear Permutations using RAM
Proc. FPGA, pp. 215-223, 2016
- Marcela Zuluaga, Peter A. Milder and Markus Püschel
Streaming Sorting Networks
ACM Transactions on Design Automation of Electronic Systems, Vol. 21, No. 4, pp. 55, 2016
2015
- Nikolaos Kyrtatas, Daniele G. Spampinato and Markus Püschel
A Basic Linear Algebra Compiler for Embedded Processors
Proc. Design, Automation and Test in Europe (DATE), pp. 1054-1059, 2015
2014
- Daniele G. Spampinato and Markus Püschel
A Basic Linear Algebra Compiler
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 23-32, 2014
- Alen Stojanov, Georg Ofenbeck, Tiark Rompf and Markus Püschel
Abstracting Vector Architectures in Library Generators: Case Study Convolution Filters
Proc. ACM International Workshop on Libraries, Languages and Compilers for Array Programming (ARRAY), pp. 14, 2014
- Georg Ofenbeck, Ruedi Steinmann, Victoria Caparrós Cabezas, Daniele G. Spampinato and Markus Püschel
Applying the Roofline Model
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 76 - 85, 2014
- Benjamin Hess, Thomas Gross and Markus Püschel
Automatic Locality-Friendly Interface Extension of Numerical Functions
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 83-92, 2014
- Victoria Caparrós Cabezas and Markus Püschel
Extending the Roofline Model: Bottleneck Analysis with Microarchitectural Constraints
Proc. IEEE International Symposium on Workload Characterization (IISWC), pp. 222-231, 2014
- Jörn Schumacher and Markus Püschel
High-performance sparse fast Fourier transforms
Proc. IEEE Workshop on Signal Processing Systems (SIPS), pp. 1-6, 2014
- Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
Language Support for the Construction of High Performance Code Generators
Proc. International Workshop on Adaptive Self-tuning Computing Systems (ADAPT), 2014
2013
- Marcela Zuluaga, Andreas Krause, Guillaume Sergent and Markus Püschel
Active Learning for Multi-Objective Optimization
Proc. International Conference on Machine Learning (ICML), pp. 462-470, 2013
- Marcela Zuluaga, Andreas Krause and Markus Püschel
Multi-Objective Optimization for High-Level Synthesis
Proc. Workshop on High-Level Synthesis for High Performance Computing (HLS4HPC), 2013
- Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
Spiral in Scala: Towards the Systematic Construction of Generators for Performance Libraries
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 125-134, 2013
2012
- C. Angelopoulos, Franz Franchetti and Markus Püschel
Automatic Generation of FFT Libraries for GPUs
NVIDIA Research Summit at the GPU Technology Conference, 2012
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Computer Generation of Hardware for Linear Digital Signal Processing Transforms
ACM Transactions on Design Automation of Electronic Systems, Vol. 17, No. 2, 2012
- Marcela Zuluaga, Peter A. Milder and Markus Püschel
Computer Generation of Streaming Sorting Networks
Proc. Design Automation Conference (DAC), pp. 1245-1253, 2012
- Robert Koutsoyannis, Peter A. Milder, Christian Berger, Madeleine Glick, James C. Hoe and Markus Püschel
Improving Fixed-Point Accuracy of FFT Cores in O-OFDM Systems
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2012
- Marcela Zuluaga, Andreas Krause, Peter A. Milder and Markus Püschel
"Smart" Design Space Sampling to Predict Pareto-Optimal Solutions
Proc. Languages, Compilers, Tools and Theory for Embedded Systems (LCTES), pp. 119-128 , 2012
2011
- Daniel McFarlin, Volodymyr Arbatov, Franz Franchetti and Markus Püschel
Automatic SIMD Vectorization of Fast Fourier Transforms for the Larrabee and AVX Instruction Sets
Proc. International Conference on Supercomputing (ICS), 2011
- Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Design studies for ASIC implementations of 28 GS/s optical QPSK- and 16-QAM-OFDM transceivers
Optics Express, Vol. 19, No. 21, pp. 20857-20864, 2011
- Franz Franchetti and Markus Püschel
Fast Fourier Transform
in Encyclopedia of Parallel Computing, Eds. David Padua, Springer 2011
- Christian Berger, Volodymyr Arbatov, Yevgen Voronenko, Franz Franchetti and Markus Püschel
Real-Time Software Implementation of an IEEE 802.11a Baseband Receiver on Intel Multicore
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1693-1696 , 2011
- Markus Püschel, Franz Franchetti and Yevgen Voronenko
Spiral
in Encyclopedia of Parallel Computing, Eds. David Padua, pp. 1920-1933, Springer 2011
2010
- Frédéric de Mesmay, Srinivas Chellappa, Franz Franchetti and Markus Püschel
Computer Generation of Efficient Software Viterbi Decoders
Proc. International Conference on High Performance Embedded Architectures and Compilers (HiPEAC), Lecture Notes in Computer Science, Springer, Vol. 5952, pp. 353-368, 2010
- Yevgen Voronenko, Volodymyr Arbatov, Christian Berger, Ronghui Peng, Markus Püschel and Franz Franchetti
Computer Generation of Platform-Adapted Physical Layer Software
Proc. Software Defined Radio (SDR), 2010
- Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, Christian Berger, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Design Studies for an ASIC Implementation of an Optical OFDM Transceiver
Proc. European Conference on Optical Communication (ECOC), pp. 1-3, 2010
- C. Angelopoulos, Franz Franchetti and Markus Püschel
DFT Transform on the Fermi (GTX480): Automatic Program Generation
NVIDIA Research Summit at the GPU Technology Conference, 2010
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Hardware Implementation of the Discrete Fourier Transform with Non-Power-of-Two Problem Size
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2010
- Frédéric de Mesmay, Yevgen Voronenko and Markus Püschel
Offline Library Adaptation Using Automatically Generated Heuristics
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 1-10, 2010
- Yannis Benlachtar, Rachid Bouziane, Robert I. Killey, Christian Berger, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel and Madeleine Glick
Optical OFDM for the Data Center
Proc. International Conference on Transparent Optical Networks (ICTON), pp. 1-4, 2010
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Real-Time Digital Signal Processing for the Generation of Optical Orthogonal Frequency Division Multiplexed Signals
IEEE Journal of Selected Topics in Quantum Electronics, Vol. 16, No. 5, pp. 1235-1244 , 2010
2009
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
21.4 GS/s Real-Time DSP-Based Optical OFDM Signal Generation and Transmission Over 1600 km of Uncompensated Fibre
Proc. European Conference on Optical Communication (ECOC), pp. 1-2, 2009
- Yevgen Voronenko and Markus Püschel
Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for Real DFTs
IEEE Transactions on Signal Processing, Vol. 57, No. 1, pp. 205-222, 2009
- Peter A. Milder, James C. Hoe and Markus Püschel
Automatic Generation of Streaming Datapaths for Arbitrary Fixed Permutations
Proc. Design, Automation and Test in Europe (DATE), pp. 1118-1123, 2009
- Daniel McFarlin, Franz Franchetti and Markus Püschel
Automatic Generation of Vectorized Fast Fourier Transform Libraries for the Larrabee and AVX Instruction Set Extension
Proc. High Performance Extreme Computing (HPEC), 2009
- Basilio B. Fraguela, Yevgen Voronenko and Markus Püschel
Automatic Tuning of Discrete Fourier Transforms Driven by Analytical Modeling
Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 271-280, 2009
- Frédéric de Mesmay, Arpad Rimmel, Yevgen Voronenko and Markus Püschel
Bandit-Based Optimization on Graphs with Application to Library Performance Tuning
Proc. International Conference on Machine Learning (ICML), pp. 729-736, 2009
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
Computer Generation of Fast Fourier Transforms for the Cell Broadband Engine
Proc. International Conference on Supercomputing (ICS), pp. 26-35, 2009
- Yevgen Voronenko, Frédéric de Mesmay and Markus Püschel
Computer Generation of General Size Linear Transform Libraries
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 102-113, 2009
- Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
- Franz Franchetti and Markus Püschel
Generating High-Performance Pruned FFT Implementations
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 549-552, 2009
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
High Performance Linear Transform Program Generation for the Cell BE
Proc. High Performance Extreme Computing (HPEC), 2009
- Daniel McFarlin, Franz Franchetti, José M. F. Moura and Markus Püschel
High Performance Synthetic Aperture Radar Image Formation On Commodity Architectures
Proc. SPIE Conference on Defense, Security, and Sensing, Proceedings of SPIE, Vol. 7337, pp. 733708, 2009
- Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
Operator Language: A Program Generation Framework for Fast Kernels
Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009
- Markus Püschel, Peter A. Milder and James C. Hoe
Permuting Streaming Data Using RAMs
Journal of the ACM, Vol. 56, No. 2, pp. 10:1-10:34, 2009
2008
- Markus Püschel and José M. F. Moura
Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for DCTs and DSTs
IEEE Transactions on Signal Processing, Vol. 56, No. 4, pp. 1502-1521, 2008
- Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
Automatic Generation of Adaptive Libraries for Matrix-Multiplication
Parallel Matrix Algorithms and Applications (PMAA), 2008, Presentation (Abstract reviewed)
- Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
Automatic Generation of Multithreaded Vectorized Adaptive Libraries for Matrix Multiplication
Proc. International Workshop on Parallel Matrix Algorithms and Applications (PMAA), 2008
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
Automatic Linear Transform Program Generation for the Cell BE
Proc. Supercomputing (SC), 2008, Poster (Abstract reviewed)
- Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
Domain-Specific Library Generation for Parallel Software and Hardware Platforms
Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
FFT Program Generation for the Cell BE
Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Formal Datapath Representation and Manipulation for Implementing DSP Transforms
Proc. Design Automation Conference (DAC), pp. 385-390, 2008
- Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
Generating High-Performance General Size Linear Transform Libraries Using Spiral
Proc. High Performance Extreme Computing (HPEC), 2008
- Franz Franchetti and Markus Püschel
Generating SIMD Vectorized Permutations
Proc. International Conference on Compiler Construction (CC), Lecture Notes in Computer Science, Springer, Vol. 4959, pp. 116-131, 2008
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
How To Write Fast Numerical Code: A Small Introduction
Proc. Summer School on Generative and Transformational Techniques in Software Engineering (GTTSE), Lecture Notes in Computer Science, Springer, Vol. 5235, pp. 196-259, 2008
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Linear Transforms: From Math to Efficient Hardware
Proc. Workshop on High-Level Synthesis colocated with DAC, 2008
- Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
Program Generation with Spiral: Beyond Transforms
Proc. High Performance Extreme Computing (HPEC), 2008
- Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
System Demonstration of Spiral: Generator for High-Performance Linear Transform Libraries
Proc. Algebraic Methodology and Software Technology (AMAST), 2008
2007
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Discrete Fourier Transform Compiler: From Mathematical Representation to Efficient Hardware
CSSI Technical Report #CSSI-07-01, Carnegie Mellon University, 2007
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Fast Fourier Transform on FPGA: Design Choices and Evaluation
Proc. International Symposium on Field-Programmable Gate Arrays (FPGA), 2007
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
FFT Compiler: From Math to Efficient Hardware
Proc. IEEE International High Level Design Validation and Test Workshop (HLDVT), 2007
- Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
Generating FPGA Accelerated DFT Libraries
Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007
- Yevgen Voronenko and Markus Püschel
Mechanical Derivation of Fused Multiply-Add Algorithms for Linear Transforms
IEEE Transactions on Signal Processing, Vol. 55, No. 9, pp. 4458-4473, 2007
- Yevgen Voronenko and Markus Püschel
Multiplierless Multiple Constant Multiplication
ACM Transactions on Algorithms, Vol. 3, No. 2, 2007
- Paolo D'Alberto, Markus Püschel and Franz Franchetti
Performance/Energy Optimization of DSP Transforms on the XScale Processor
Proc. International Conference on High Performance Embedded Architectures and Compilers (HiPEAC), Lecture Notes in Computer Science, Springer, Vol. 4367, pp. 201-214, 2007
- Franz Franchetti and Markus Püschel
SIMD Vectorization of Non-Two-Power Sized FFTs
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. II-17-II-20, 2007
- Peter Tummeltshammer, James C. Hoe and Markus Püschel
Time-Multiplexed Multiple Constant Multiplication
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 26, No. 9, pp. 1551-1563, 2007
2006
- Yevgen Voronenko and Markus Püschel
Algebraic Derivation of General Radix Cooley-Tukey Algorithms for the Real Discrete Fourier Transform
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 3, 2006
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
A Rewriting System for the Vectorization of Signal Transforms
Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 4395, pp. 363-377, 2006
- Andreas Bonelli, Franz Franchetti, Juergen Lorenz, Markus Püschel and Christoph W. Ueberhuber
Automatic Performance Optimization of the Discrete Fourier Transform on Distributed Memory Computers
Proc. International Symposium on Parallel and Distributed Processing and Application (ISPA), Lecture Notes In Computer Science, Springer, Vol. 4330, pp. 818-832, 2006
- Paolo D'Alberto, Peter A. Milder, Franz Franchetti, James C. Hoe, Markus Püschel and José M. F. Moura
Discrete Fourier Transform Compiler for FPGA and CPU/FPGA Partitioned Implementations
Proc. High Performance Extreme Computing (HPEC), 2006
- Peter A. Milder, Mohammad Ahmad, James C. Hoe and Markus Püschel
Fast and Accurate Resource Estimation of Automatically Generated Custom DFT IP Cores
Proc. FPGA, pp. 211-220, 2006
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
FFT Program Generation for Shared Memory: SMP and Multicore
Proc. Supercomputing (SC), 2006
- Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
Parallelism in Spiral
Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006
- Sung-Chul Han, Franz Franchetti and Markus Püschel
Program Generation for the All-Pairs Shortest Path Problem
Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 222-232, 2006
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
Spiral: Generating Signal Processing Kernels for New Commodity Architectures
Proc. EDGE Workshop, pp. D49-D50, 2006
2005
- Roland Wunderlich, Markus Püschel and James C. Hoe
Accelerating Blocked Matrix-Matrix Multiplication using a Software-Managed Memory Hierarchy with DMA
Proc. High Performance Extreme Computing (HPEC), 2005
- Grace Nordin, Peter A. Milder, James C. Hoe and Markus Püschel
Automatic Generation of Customized Discrete Fourier Transform IPs
Proc. Design Automation Conference (DAC), pp. 471-474, 2005
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
Formal Loop Merging for Signal Transforms
Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326 , 2005
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
Loop Merging for Signal Transforms
Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326, 2005
- Thammanit Pipatsrisawat, Aca Gacic, Franz Franchetti, Markus Püschel and José M. F. Moura
Performance Analysis of the Filtered Backprojection Image Reconstruction Algorithms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. 153-156, 2005
- José M. F. Moura, Markus Püschel, David Padua and Jack Dongarra
Scanning the Issue: Special Issue on Program Generation, Optimization, and Platform Adaptation
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 211-215, 2005
- Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
SPIRAL: Code Generation for DSP Transforms
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005
2004
- Lawrence C. Chang, Inpyo Hong, Yevgen Voronenko and Markus Püschel
Adaptive Mapping of Linear DSP Algorithms to Fixed-Point Arithmetic
Proc. High Performance Extreme Computing (HPEC), 2004
- Aca Gacic, Markus Püschel and José M. F. Moura
Automatically Generated High-Performance Code for Discrete Wavelet Transforms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-69-V-72, 2004
- Franz Franchetti, Stefan Kral, Juergen Lorenz, Markus Püschel, Christoph W. Ueberhuber and Peter Wurzinger
Automatically Tuned FFTs for BlueGene/L’s Double FPU
Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 3402, pp. 23-36, 2004
- Adam C. Zelinski, Markus Püschel, Smarahara Misra and James C. Hoe
Automatic Cost Minimization for Multiplierless Implementations of Discrete Signal Transforms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-221-V-224, 2004
- Yevgen Voronenko and Markus Püschel
Automatic Generation of Implementations for DSP Transforms on Fused Multiply-Add Architectures
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-101-V-104, 2004
- Markus Püschel, Adam C. Zelinski and James C. Hoe
Custom-Optimized Multiplierless Implementations of DSP Algorithms
Proc. International Conference on Computer-Aided Design (ICCAD), pp. 175-182, 2004
- Grace Nordin, James C. Hoe and Markus Püschel
Discrete Fourier Transform IP Generator
Proc. High Performance Extreme Computing (HPEC), 2004
- Peter Tummeltshammer, James C. Hoe and Markus Püschel
Multiple Constant Multiplication By Time-Multiplexed Mapping of Addition Chains
Proc. Design Automation Conference (DAC), pp. 826-829, 2004
- Markus Püschel, Bryan Singer, Jianxin Xiong, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso and Robert W. Johnson
SPIRAL: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms
Journal of High Performance Computing and Applications, special issue on ``Automatic Performance Tuning'', Vol. 18, No. 1, pp. 21-45, 2004
2003
- Markus Püschel
Cooley-Tukey FFT like Algorithms for the DCT
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 501-504, 2003
- Smarahara Misra, Adam C. Zelinski, James C. Hoe and Markus Püschel
Custom Reduction of Arithmetic in Linear DSP Transforms
Proc. High Performance Extreme Computing (HPEC), 2003
- Aca Gacic, Markus Püschel and José M. F. Moura
Fast Automatic Implementations of FIR Filters
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 541-544, 2003
- Franz Franchetti and Markus Püschel
Short Vector Code Generation and Adaptation for DSP Algorithms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 537-540, 2003
- Franz Franchetti and Markus Püschel
Short Vector Code Generation for the Discrete Fourier Transform
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2003
- Markus Püschel and José M. F. Moura
SPIRAL: An Overview
Proc. Workshop on Optimizations for DSP and Embedded Systems (ODES), 2003
- Markus Püschel and José M. F. Moura
The Algebraic Approach to the Discrete Cosine and Sine Transforms and their Fast Algorithms
SIAM Journal of Computing, Vol. 32, No. 5, pp. 1280-1316, 2003
- Fang Fang, Rob A. Rutenbar, Markus Püschel and Tsuhan Chen
Toward Efficient Static Analysis of Finite-Precision Effects in DSP Applications via Affine Arithmetic Modeling
Proc. Design Automation Conference (DAC), pp. 496-501, 2003
2002
- Markus Püschel, Sebastian Egner and Thomas Beth
AREP
in New Reference Book on Computer Algebra, Eds. J. Grabmeier, E. Kaltofen, J. Grabmeier, E. Kaltofen, Springer 2002
- Franz Franchetti and Markus Püschel
A SIMD Vectorizing Compiler for Digital Signal Processing Algorithms
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 20-26, 2002
- Markus Püschel and José M. F. Moura
Generation and Manipulation of DSP Transform Algorithms
Proc. Digital Signal Processing Workshop, 2002
- Fang Fang, James C. Hoe, Markus Püschel and Smarahara Misra
Generation of Custom DSP Transform IP Cores: Case Study Walsh-Hadamard Transform
Proc. High Performance Extreme Computing (HPEC), 2002
- Franz Franchetti, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
Short Vector SIMD Code Generation for DSP Algorithms
Proc. High Performance Extreme Computing (HPEC), 2002
2001
- Sebastian Egner, Jeremy Johnson, David Padua, Jianxin Xiong and Markus Püschel
Automatic Derivation and Implementation of Signal Processing Algorithms
ACM SIGSAM Bulletin Communications in Computer Algebra, Vol. 35, No. 2, pp. 1-19, 2001
- Sebastian Egner and Markus Püschel
Automatic Generation of Fast Discrete Signal Transforms
IEEE Transactions on Signal Processing, Vol. 49, No. 9, pp. 1992-2002, 2001
- Markus Püschel, Bryan Singer, Manuela Veloso and José M. F. Moura
Fast Automatic Generation of DSP Algorithms
Proc. International Conference on Computational Science (ICCS), Lecture Notes In Computer Science, Springer, Vol. 2073, pp. 97-106, 2001
- José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel, Bryan Singer, Manuela Veloso and Jianxin Xiong
Generating Platform-Adapted DSP Libraries using SPIRAL
Proc. High Performance Extreme Computing (HPEC), 2001
2000
- Jeremy Johnson and Markus Püschel
In Search of the Optimal Walsh-Hadamard Transform
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 6, pp. 3347-3350, 2000
- José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel and Manuela Veloso
SPIRAL: Automatic Implementation of Signal Processing Algorithms
Proc. High Performance Extreme Computing (HPEC), 2000
Rabaey, J.
2015
- M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
Computer, Vol. 48, No. 12, pp. 24-33, 2015
Ramanujam, J.
2013
- Tom Henretty, Richard Veras, Franz Franchetti, Louis-Noël Pouchet, J. Ramanujam and P. Sadayappan
A Stencil Compiler for Short-Vector SIMD Architectures
Proc. ACM International Conference on Supercomputing , pp. 13-24, 2013
2011
- Tom Henretty, Kevin Stock, Louis-Noël Pouchet, Franz Franchetti, J. Ramanujam and P. Sadayappan
Data Layout Transformation for Stencil Computations on Short SIMD Architectures
Proc. International Conference on Compiler Construction (CC), 2011
Rangaraj, Deepak
2009
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009
Rao, Sanil
2024
- Sanil Rao, A. Prakash and Franz Franchetti
Towards LibraryX: A Framework for Cross-Library Call Optimization
Proc. High Performance Extreme Computing (HPEC), 2024
2023
- Sanil Rao, M. A. H. Monil, H. Mankad, J. Vetter and Franz Franchetti
FFTX-IRIS: Towards Performance Portability and Heterogeneity for SPIRAL Generated Code
Proc. Supercomputing (SC), 2023
2022
- J. Nguyen, Sanil Rao, J. Moreira and Franz Franchetti
Automatic Generation of Matrix-Vector Code Using SPIRAL for the Power10 ISA
Proc. IEEE High Performance Extreme Computing (HPEC), 2022
- H. Mankad, Sanil Rao, P. Colella, B. Van Straalen and Franz Franchetti
ProtoX: A First Look
Proc. High Performance Extreme Computing (HPEC), 2022
2020
- Sanil Rao, A. Kutuluru, Paul Brouwer, S. McMillan and Franz Franchetti
GBTLX: A First Look
Proc. IEEE High Performance Extreme Computing (HPEC), 2020
Rao, Varun
2017
- Tze-Meng Low, Varun Rao, Matthew Lee, Thom Popovici, Franz Franchetti and S. McMillan
First Look: Linear Algebra-based Triangle Counting Without Matrix Multiplication
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-6, 2017
Re, C.
2015
- M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
Computer, Vol. 48, No. 12, pp. 24-33, 2015
Reagen, B.
2023
- N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring-Processing Unit
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
2022
- N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
Towards Full-Stack Acceleration for Fully Homomorphic Encryption
Proc. High Performance Extreme Computing (HPEC), 2022
Reynwar, B.
2023
- N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring-Processing Unit
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
2022
- N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
Towards Full-Stack Acceleration for Fully Homomorphic Encryption
Proc. High Performance Extreme Computing (HPEC), 2022
Rimmel, Arpad
2009
- Frédéric de Mesmay, Arpad Rimmel, Yevgen Voronenko and Markus Püschel
Bandit-Based Optimization on Graphs with Application to Library Performance Tuning
Proc. International Conference on Machine Learning (ICML), pp. 729-736, 2009
Rivera, Joao
2022
- Joao Rivera, Franz Franchetti and Markus Püschel
A Compiler for Sound Floating-Point Computations using Affine Arithmetic
Proc. International Symposium on Code Generation and Optimization (CGO), 2022
- Z. Gong, N. Zhu, M. Ngaw, Joao Rivera, L. Tang, E. Tang, H. Mankad and Franz Franchetti
Interval Arithmetic-based FFT for Large Integer Multiplication
Proc. High Performance Extreme Computing (HPEC), 2022
2021
- Joao Rivera, Franz Franchetti and Markus Püschel
An Interval Compiler for Sound Floating Point Computations
Proc. International Symposium on Code Generation and Optimization (CGO), 2021
Rizzolo, Nicholas
2005
- Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
SPIRAL: Code Generation for DSP Transforms
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005
2004
- Nicholas Rizzolo and David Padua
HiLO: High Level Optimization of FFTs
Proc. International Workshop on Languages and Compilers for Parallel Computing (LCPC), 2004
Rollett, Anthony
2015
- T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, R. Suter, Franz Franchetti and Anthony Rollett
Spectral Full-Field Deformation Modeling of Polycrystalline Materials
submitted for publication
2014
- T. Ozturk, Thom Popovici, C. Stein, R. Pokharel, Franz Franchetti and Anthony Rollett
Fast Fourier Transform Based Mechanical Behavior Formulation: Optimized Implementation and Sensitivity Analysis of the Method
Proc. Materials Science & Technology, 2014
- T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, Franz Franchetti, R. Suter and Anthony Rollett
Performance Evaluation, Algorithm Optimization and Sensitivity Analysis of the Spectral Full-Field Deformation Modeling of Polycrystalline Materials
submitted for publication
Rompf, Tiark
2017
- Georg Ofenbeck, Tiark Rompf and Markus Püschel
Staging for Generic Programming in Space and Time
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 15-28, 2017
2014
- Alen Stojanov, Georg Ofenbeck, Tiark Rompf and Markus Püschel
Abstracting Vector Architectures in Library Generators: Case Study Convolution Filters
Proc. ACM International Workshop on Libraries, Languages and Compilers for Array Programming (ARRAY), pp. 14, 2014
- Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
Language Support for the Construction of High Performance Code Generators
Proc. International Workshop on Adaptive Self-tuning Computing Systems (ADAPT), 2014
2013
- Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
Spiral in Scala: Towards the Systematic Construction of Generators for Performance Libraries
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 125-134, 2013
Russell, F.
2015
- Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
Proc. Workshop on Compilers for Parallel Computing (CPC), 2015
- Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2015
Rutenbar, Rob A.
2003
- Fang Fang, Rob A. Rutenbar, Markus Püschel and Tsuhan Chen
Toward Efficient Static Analysis of Finite-Precision Effects in DSP Applications via Affine Arithmetic Modeling
Proc. Design Automation Conference (DAC), pp. 496-501, 2003
Ruzicka, Vit
2018
- Vit Ruzicka and Franz Franchetti
Fast and Accurate Object Detection in High Resolution 4K and 8K Video Using GPUs
Proc. IEEE High Performance Extreme Computing (HPEC), 2018
Sabry Aly, M. M.
2015
- M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
Computer, Vol. 48, No. 12, pp. 24-33, 2015
Sadayappan, P.
2013
- Tom Henretty, Richard Veras, Franz Franchetti, Louis-Noël Pouchet, J. Ramanujam and P. Sadayappan
A Stencil Compiler for Short-Vector SIMD Architectures
Proc. ACM International Conference on Supercomputing , pp. 13-24, 2013
- Martin Kong, Richard Veras, Kevin Stock, Franz Franchetti, Louis-Noël Pouchet and P. Sadayappan
When Polyhedral Transformations Meet SIMD Code Generation
Proc. Programming Languages Design and Implementation (PLDI), Vol. 48, pp. 127-138, 2013
2011
- Tom Henretty, Kevin Stock, Louis-Noël Pouchet, Franz Franchetti, J. Ramanujam and P. Sadayappan
Data Layout Transformation for Stencil Computations on Short SIMD Architectures
Proc. International Conference on Compiler Construction (CC), 2011
Sadi, F.
2019
- F. Sadi, Joe Sweeney, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Efficient SpMV Operation for Large and Highly Sparse Matrices Using Scalable Multi-way Merge Parallelization
Proc. MICRO, 2019
2018
- F. Sadi
Accelerating Sparse Matrix Kernels with Co-optimized Architecture
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2018
- F. Sadi, Joe Sweeney, S. McMillan, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
PageRank Acceleration for Large Graphs with Scalable Hardware and Two-Step SpMV
Proc. High Performance Extreme Computing (HPEC), 2018
2017
- F. Sadi, Lawrence Pileggi and Franz Franchetti
Algorithm and Hardware Co-Optimized Solution for Large SpMV Problems
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017
2016
- F. Sadi, Lawrence Pileggi and Franz Franchetti
3D DRAM Based Application Specific Hardware Accelerator for SpMV
High Performance Extreme Computing Conference (HPEC), 2016
2014
- Qi Guo, N. Alachiotis, Berkin Akin, F. Sadi, G. Xu, Tze-Meng Low, Lawrence Pileggi, James C. Hoe and Franz Franchetti
3D-Stacked Memory-Side Acceleration: Accelerator and System Design
Proc. Workshop on Near Data Processing (WONDP), 2014
- F. Sadi, Berkin Akin, Thom Popovici, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Algorithm/Hardware Co-optimized SAR Image Reconstruction with 3D-stacked Logic in Memory
Proc. IEEE High Performance Extreme Computing (HPEC), 2014
2013
- Qiuling Zhu, Berkin Akin, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
A 3D-Stacked Logic-in-Memory Accelerator for Application-Specific Data Intensive Computing
Proc. IEEE International 3D Systems Integration Conference (3DIC), pp. 1-7, 2013
- Qiuling Zhu, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Accelerating Sparse Matrix-Matrix Multiplication with 3D-Stacked Logic-in-Memory Hardware
Proc. High Performance Extreme Computing (HPEC), pp. 1-6, 2013
Sandryhaila, Aliaksei
2014
- Franz Franchetti, Aliaksei Sandryhaila and Jeremy Johnson
High Assurance SPIRAL
Proc. SPIE, Proceedings of SPIE 2014, 2014
2007
- Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
Generating FPGA Accelerated DFT Libraries
Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007
Schatz, Martin
2020
- Thom Popovici, Martin Schatz, Franz Franchetti and Tze-Meng Low
A Flexible Framework for Multi-Dimensional DFTs
SIAM Journal on Scientific Computing (SISC), Software and High-Performance Computing, 2020
2019
- Thom Popovici, Martin Schatz, Franz Franchetti and Tze-Meng Low
A Flexible Framework for Parallel Multi-Dimensional DFTs
arXiv (Technical Report), 2019
Schmidt, A. G.
2023
- N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring-Processing Unit
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
2022
- N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
Towards Full-Stack Acceleration for Fully Homomorphic Encryption
Proc. High Performance Extreme Computing (HPEC), 2022
Schulz, M.
2006
- F. Gygi, E. W. Draeger, M. Schulz, B. R. de Supinski, J. A. Gunnels, V. Austel, J. C. Sexton, Franz Franchetti, Stefan Kral, Christoph W. Ueberhuber and Juergen Lorenz
Large-Scale Electronic Structure Calculations of High-Z Metals on the BlueGene/L Platform
Proc. Supercomputing (SC), 2006
Schumacher, Jörn
2014
- Jörn Schumacher and Markus Püschel
High-performance sparse fast Fourier transforms
Proc. IEEE Workshop on Signal Processing Systems (SIPS), pp. 1-6, 2014
2013
- Jörn Schumacher
High Performance Sparse Fast Fourier Transform
Master thesis, Computer Science, ETH Zurich, Switzerland, 2013
Sepiashvili, David
2000
- David Sepiashvili
Performance Models and Search Methods for Optimal FFT Implementations
Master thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2000
Sergent, Guillaume
2013
- Marcela Zuluaga, Andreas Krause, Guillaume Sergent and Markus Püschel
Active Learning for Multi-Objective Optimization
Proc. International Conference on Machine Learning (ICML), pp. 462-470, 2013
Serre, Francois
2016
- Francois Serre and Markus Püschel
Optimal Circuits for Streamed Linear Permutations using RAM
Proc. FPGA, pp. 215-223, 2016
Sexton, J. C.
2006
- F. Gygi, E. W. Draeger, M. Schulz, B. R. de Supinski, J. A. Gunnels, V. Austel, J. C. Sexton, Franz Franchetti, Stefan Kral, Christoph W. Ueberhuber and Juergen Lorenz
Large-Scale Electronic Structure Calculations of High-Z Metals on the BlueGene/L Platform
Proc. Supercomputing (SC), 2006
2005
- F. Gygi, E. W. Draeger, B. R. de Supinski, R. K. Yates, Franz Franchetti, Stefan Kral, Juergen Lorenz, Christoph W. Ueberhuber, J. A. Gunnels and J. C. Sexton
Large-Scale First-Principles Molecular Dynamics Simulations on the BlueGene/L Platform Using the Qbox Code
Proc. Supercomputing (SC), 2005
Shacham, O.
2012
- Qiuling Zhu, K. Vaidyanathan, O. Shacham, M. Horowitz, Lawrence Pileggi and Franz Franchetti
Design Automation Framwork for Application-Specific Logic-in-Memory Blocks
Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 125-132, 2012
Shen, Hao
2008
- Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
Domain-Specific Library Generation for Parallel Software and Hardware Platforms
Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
- Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
Program Generation with Spiral: Beyond Transforms
Proc. High Performance Extreme Computing (HPEC), 2008
2006
- Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
Parallelism in Spiral
Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006
Shenoy, Pranab
2007
- Pranab Shenoy
Universal FFT Core Generator
Master thesis, Computer Science, Drexel University, 2007
Shulaker, M. M.
2015
- M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
Computer, Vol. 48, No. 12, pp. 24-33, 2015
Siddique, F.
2024
- T. Zhang, E. Tang, F. Siddique, K. Skadron and Franz Franchetti
Towards an End-to-End Processing-in-DRAM Acceleration of Spectral Library Search
Proc. High Performance Extreme Computing (HPEC), 2024
Singer, Bryan
2005
- Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
SPIRAL: Code Generation for DSP Transforms
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005
2004
- Markus Püschel, Bryan Singer, Jianxin Xiong, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso and Robert W. Johnson
SPIRAL: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms
Journal of High Performance Computing and Applications, special issue on ``Automatic Performance Tuning'', Vol. 18, No. 1, pp. 21-45, 2004
2002
- Bryan Singer and Manuela Veloso
Automating the Modeling and Optimization of the Performance of Signal Transforms
IEEE Transactions on Signal Processing, Vol. 50, No. 8, pp. 2003-2014, 2002
- Bryan Singer and Manuela Veloso
Learning to Construct Fast Signal Processing Implementations
Journal of Machine Learning Research, special issue on ``the Eighteenth International Conference on Machine Learning (ICML 2001)'', Vol. 3, pp. 887-919, 2002
2001
- Bryan Singer
Automating the Modeling and Optimization of the Performance of Signal Processing Algorithms
PhD. thesis, Computer Science, Carnegie Mellon University, 2001
- Markus Püschel, Bryan Singer, Manuela Veloso and José M. F. Moura
Fast Automatic Generation of DSP Algorithms
Proc. International Conference on Computational Science (ICCS), Lecture Notes In Computer Science, Springer, Vol. 2073, pp. 97-106, 2001
- José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel, Bryan Singer, Manuela Veloso and Jianxin Xiong
Generating Platform-Adapted DSP Libraries using SPIRAL
Proc. High Performance Extreme Computing (HPEC), 2001
- Bryan Singer and Manuela Veloso
Learning to Generate Fast Signal Processing Implementations
Proc. International Conference on Machine Learning (ICML), pp. 529-536, 2001
- Bryan Singer and Manuela Veloso
Stochastic Search for Signal Processing Algorithm Optimization
Proc. Supercomputing (SC), pp. 22, 2001
2000
- Bryan Singer and Manuela Veloso
Learning to Predict Performance from Formula Modeling and Training Data
Proc. International Conference on Machine Learning (ICML), pp. 887-894, 2000
Skadron, K.
2024
- T. Zhang, E. Tang, F. Siddique, K. Skadron and Franz Franchetti
Towards an End-to-End Processing-in-DRAM Acceleration of Spectral Library Search
Proc. High Performance Extreme Computing (HPEC), 2024
Skylaris, C-K.
2015
- Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
Proc. Workshop on Compilers for Parallel Computing (CPC), 2015
- Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2015
Soni, D.
2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring-Processing Unit
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
2022
- N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
Towards Full-Stack Acceleration for Fully Homomorphic Encryption
Proc. High Performance Extreme Computing (HPEC), 2022
Sozeau, Matthieu
2019
- Vadim Zaliva and Matthieu Sozeau
Reification of Shallow-Embedded DSLs in Coq with Automated Verification
Proc. International Workshop on Coq for Programming Languages (CoqPL), 2019
Spampinato, Daniele G.
2020
- Jiyuan Zhang, Yi Lu, Daniele G. Spampinato and Franz Franchetti
FESIA: A Fast and Efficient Set Intersection Approach on Modern CPUs
Proc. IEEE International Conference on Data Engineering (ICDE), 2020
- Jiyuan Zhang, Daniele G. Spampinato and Franz Franchetti
FESIA: A Fast and SIMD-Efficient Set Intersection Approach on Modern CPUs
North East Database Day (NEDB), 2020, Poster
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solver Libraries for Exascale: FFTX and SpectralPack
Exascale Computing Project (ECP) Annual Meeting, 2020, Poster
2019
- Anuva Kulkarni, Daniele G. Spampinato and Franz Franchetti
Design and Specification of Large-scale Simulations for GPUs using FFTX
Supercomputing, 2019
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solvers for Exascale: FFTX and SpectralPACK
Exascale Computing Project (ECP) Annual Meeting, 2019
- Anuva Kulkarni, Daniele G. Spampinato and Franz Franchetti
FFTX for Micromechanical Stress-Strain Analysis
IEEE High Performance Extreme Computing Conference (HPEC), 2019
2018
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFTX and SpectralPack: A First Look
Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018
- Tze-Meng Low, Daniele G. Spampinato, A. Kutuluru, U. Sridhar, Thom Popovici, Franz Franchetti and S. McMillan
Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency Matrices
Proc. High Performance Extreme Computing (HPEC), 2018
- Jiyuan Zhang, Daniele G. Spampinato, S. McMillan and Franz Franchetti
Preliminary Exploration on Large-Scale Triangle Counting in Shared-Memory Multicore System
Proc. IEEE High Performance Extreme Computing (HPEC), 2018
- Daniele G. Spampinato, Diego Fabregat-Traver, Paolo Bientinesi and Markus Püschel
Program Generation for Small-Scale Linear Algebra Applications
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 327-339, 2018
- Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
SPIRAL: Extreme Performance Portability
Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018
2017
- Daniele G. Spampinato
A Linear Algebra Compiler for Small Problem Sizes
PhD. thesis, Computer Science, ETH Zurich, Switzerland, 2017
2016
- Daniele G. Spampinato and Markus Püschel
A Basic Linear Algebra Compiler for Structured Matrices
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 117-127, 2016
2015
- Nikolaos Kyrtatas, Daniele G. Spampinato and Markus Püschel
A Basic Linear Algebra Compiler for Embedded Processors
Proc. Design, Automation and Test in Europe (DATE), pp. 1054-1059, 2015
2014
- Daniele G. Spampinato and Markus Püschel
A Basic Linear Algebra Compiler
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 23-32, 2014
- Georg Ofenbeck, Ruedi Steinmann, Victoria Caparrós Cabezas, Daniele G. Spampinato and Markus Püschel
Applying the Roofline Model
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 76 - 85, 2014
Sridhar, U.
2018
- Tze-Meng Low, Daniele G. Spampinato, A. Kutuluru, U. Sridhar, Thom Popovici, Franz Franchetti and S. McMillan
Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency Matrices
Proc. High Performance Extreme Computing (HPEC), 2018
Stein, C.
2015
- T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, R. Suter, Franz Franchetti and Anthony Rollett
Spectral Full-Field Deformation Modeling of Polycrystalline Materials
submitted for publication
2014
- T. Ozturk, Thom Popovici, C. Stein, R. Pokharel, Franz Franchetti and Anthony Rollett
Fast Fourier Transform Based Mechanical Behavior Formulation: Optimized Implementation and Sensitivity Analysis of the Method
Proc. Materials Science & Technology, 2014
- T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, Franz Franchetti, R. Suter and Anthony Rollett
Performance Evaluation, Algorithm Optimization and Sensitivity Analysis of the Spectral Full-Field Deformation Modeling of Polycrystalline Materials
submitted for publication
Steinmann, Ruedi
2014
- Georg Ofenbeck, Ruedi Steinmann, Victoria Caparrós Cabezas, Daniele G. Spampinato and Markus Püschel
Applying the Roofline Model
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 76 - 85, 2014
Stock, Kevin
2013
- Martin Kong, Richard Veras, Kevin Stock, Franz Franchetti, Louis-Noël Pouchet and P. Sadayappan
When Polyhedral Transformations Meet SIMD Code Generation
Proc. Programming Languages Design and Implementation (PLDI), Vol. 48, pp. 127-138, 2013
2011
- Tom Henretty, Kevin Stock, Louis-Noël Pouchet, Franz Franchetti, J. Ramanujam and P. Sadayappan
Data Layout Transformation for Stencil Computations on Short SIMD Architectures
Proc. International Conference on Compiler Construction (CC), 2011
Stojanov, Alen
2014
- Alen Stojanov, Georg Ofenbeck, Tiark Rompf and Markus Püschel
Abstracting Vector Architectures in Library Generators: Case Study Convolution Filters
Proc. ACM International Workshop on Libraries, Languages and Compilers for Array Programming (ARRAY), pp. 14, 2014
- Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
Language Support for the Construction of High Performance Code Generators
Proc. International Workshop on Adaptive Self-tuning Computing Systems (ADAPT), 2014
2013
- Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
Spiral in Scala: Towards the Systematic Construction of Generators for Performance Libraries
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 125-134, 2013
Sumbul, H. E.
2015
- H. E. Sumbul, K. Vaidyanathan, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
A Synthesis Methodology for Application-Specific Logic-in-Memory Designs
Proc. Design Automation Conference (DAC), 2015
2014
- K. Vaidyanathan, R. Liu, H. E. Sumbul, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
Efficient and Secure Intellectual Property (IP) Design for Split Fabrication
Proc. IEEE International Symposium on Hardware-Oriented Security and Trust (HOST), 2014
2013
- Qiuling Zhu, Berkin Akin, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
A 3D-Stacked Logic-in-Memory Accelerator for Application-Specific Data Intensive Computing
Proc. IEEE International 3D Systems Integration Conference (3DIC), pp. 1-7, 2013
- Qiuling Zhu, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Accelerating Sparse Matrix-Matrix Multiplication with 3D-Stacked Logic-in-Memory Hardware
Proc. High Performance Extreme Computing (HPEC), pp. 1-6, 2013
- H. E. Sumbul, A. Patterson, A. Tazzoli, G. Feeder, Franz Franchetti, G. Piazza and Lawrence Pileggi
Trusted Split-Fabrication System-on-Chip Design Technology and Methodology
Proc. Government Applications & Critical Technology Conference (GOMACTech), 2013
Sun, D.
2023
- D. Sun, N. Zhang and Franz Franchetti
Optimization and Performance Analysis of Shor’s Algorithm in Qiskit
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
Suter, R.
2015
- T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, R. Suter, Franz Franchetti and Anthony Rollett
Spectral Full-Field Deformation Modeling of Polycrystalline Materials
submitted for publication
2014
- T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, Franz Franchetti, R. Suter and Anthony Rollett
Performance Evaluation, Algorithm Optimization and Sensitivity Analysis of the Spectral Full-Field Deformation Modeling of Polycrystalline Materials
submitted for publication
Sweeney, Joe
2019
- F. Sadi, Joe Sweeney, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Efficient SpMV Operation for Large and Highly Sparse Matrices Using Scalable Multi-way Merge Parallelization
Proc. MICRO, 2019
2018
- F. Sadi, Joe Sweeney, S. McMillan, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
PageRank Acceleration for Large Graphs with Scalable Hardware and Two-Step SpMV
Proc. High Performance Extreme Computing (HPEC), 2018
Takahashi, Daisuke
2021
- N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
Adaptation of A64 Scalable Vector Extension for Spiral
Proc. High Performance Computing Conference (HPCC), Japan, 2021
- N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
An Auto-tuning with Adaptation of A64 Scalable Vector Extension for SPIRAL
Proc. International Workshop on Automatic Performance Tuning (iWAPT), 2021
2020
- Daisuke Takahashi and Franz Franchetti
FFTE on SVE: SPIRAL-Generated Kernels
Proc. International Conference on High Performance Computing in Asia-Pacific Region (HPCAsia), pp. 114-122, 2020
Tang, E.
2024
- E. Tang, James C. Hoe and Franz Franchetti
Magic Memory: A Memory-Centric Declarative Programming Paradigm to Enable High Productivity on Heterogeneous Systems
Proc. SRC TECHCON, 2024
- T. Zhang, E. Tang, F. Siddique, K. Skadron and Franz Franchetti
Towards an End-to-End Processing-in-DRAM Acceleration of Spectral Library Search
Proc. High Performance Extreme Computing (HPEC), 2024
2022
- Z. Gong, N. Zhu, M. Ngaw, Joao Rivera, L. Tang, E. Tang, H. Mankad and Franz Franchetti
Interval Arithmetic-based FFT for Large Integer Multiplication
Proc. High Performance Extreme Computing (HPEC), 2022
- E. Tang and Franz Franchetti
Magic Memory: A Programming Model For Big Data Analytics
Proc. High Performance Extreme Computing (HPEC), 2022
Tang, L.
2024
- L. Tang, S. Chen, K. Harisrikanth, G. Xu, Franz Franchetti and K. Mai
A 1.19GHz 9.52Gsamples/sec Radix-8 FFT Hardware Accelerator in 28nm
Proc. Hot Chips Symposium (HCS), 2024
- Y. Eum, N. Zhang, L. Tang and Franz Franchetti
Towards a RISC-V Instruction Set Extension for Multi-word Arithmetic
Proc. High Performance Extreme Computing (HPEC), 2024
2022
- L. Tang, S. Chen, K. Harisrikanth, G. Xu, K. Mai and Franz Franchetti
A High Throughput Hardware Accelerator for FFTW Codelets: A First Look
Proc. High Performance Extreme Computing (HPEC), 2022
- Z. Gong, N. Zhu, M. Ngaw, Joao Rivera, L. Tang, E. Tang, H. Mankad and Franz Franchetti
Interval Arithmetic-based FFT for Large Integer Multiplication
Proc. High Performance Extreme Computing (HPEC), 2022
- J. Nguyen, M. Cai, Z. Zuo, L. Tang, K. Mai and Franz Franchetti
LIMA: Hardware for FFT based Large Integer Multiplication
Proc. High Performance Extreme Computing (HPEC), 2022
Tazzoli, A.
2013
- H. E. Sumbul, A. Patterson, A. Tazzoli, G. Feeder, Franz Franchetti, G. Piazza and Lawrence Pileggi
Trusted Split-Fabrication System-on-Chip Design Technology and Methodology
Proc. Government Applications & Critical Technology Conference (GOMACTech), 2013
Telgarsky, Marek
2008
- Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
Domain-Specific Library Generation for Parallel Software and Hardware Platforms
Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
- Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
Program Generation with Spiral: Beyond Transforms
Proc. High Performance Extreme Computing (HPEC), 2008
2006
- Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
Parallelism in Spiral
Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006
- Marek Telgarsky, James C. Hoe and José M. F. Moura
Spiral: Joint Runtime and Energy Optimization of Linear Transforms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 3, 2006
Thoma, Cory
2013
- Cory Thoma, Tao Cui and Franz Franchetti
Privacy Preserving Smart Meter System Based Retail Level Electricity Market
Proc. IEEE Power and Energy Society General Meeting (PES-GM), 2013
2012
- Cory Thoma, Tao Cui and Franz Franchetti
Secure Multiparty Computation Based Privacy Preserving Smart Metering System
Proc. North American Power Symposium (NAPS), pp. 1-6, 2012
Tummeltshammer, Peter
2007
- Peter Tummeltshammer, James C. Hoe and Markus Püschel
Time-Multiplexed Multiple Constant Multiplication
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 26, No. 9, pp. 1551-1563, 2007
2004
- Peter Tummeltshammer, James C. Hoe and Markus Püschel
Multiple Constant Multiplication By Time-Multiplexed Mapping of Addition Chains
Proc. Design Automation Conference (DAC), pp. 826-829, 2004
Turner, E. L.
2012
- Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
Local Interpolation-based Polar Format SAR: Algorithm, Hardware Implementation and Design Automation
Journal of Signal Processing Systems, 2012
- Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
Polar Format Synthetic Aperture Radar in Energy Efficient Application-Specific Logic-in-Memory
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1557-1560, 2012
2011
- Qiuling Zhu, E. L. Turner, Christian Berger, Lawrence Pileggi and Franz Franchetti
Application-Specific Logic-in-Memory for Polar Format Synthetic Aperture Radar
Proc. High Performance Extreme Computing (HPEC), 2011
Ueberhuber, Christoph W.
2006
- Andreas Bonelli, Franz Franchetti, Juergen Lorenz, Markus Püschel and Christoph W. Ueberhuber
Automatic Performance Optimization of the Discrete Fourier Transform on Distributed Memory Computers
Proc. International Symposium on Parallel and Distributed Processing and Application (ISPA), Lecture Notes In Computer Science, Springer, Vol. 4330, pp. 818-832, 2006
- F. Gygi, E. W. Draeger, M. Schulz, B. R. de Supinski, J. A. Gunnels, V. Austel, J. C. Sexton, Franz Franchetti, Stefan Kral, Christoph W. Ueberhuber and Juergen Lorenz
Large-Scale Electronic Structure Calculations of High-Z Metals on the BlueGene/L Platform
Proc. Supercomputing (SC), 2006
- Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
Parallelism in Spiral
Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006
2005
- Franz Franchetti, Stefan Kral, Juergen Lorenz and Christoph W. Ueberhuber
Efficient Utilization of SIMD Extensions
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 409-425, 2005
- F. Gygi, E. W. Draeger, B. R. de Supinski, R. K. Yates, Franz Franchetti, Stefan Kral, Juergen Lorenz, Christoph W. Ueberhuber, J. A. Gunnels and J. C. Sexton
Large-Scale First-Principles Molecular Dynamics Simulations on the BlueGene/L Platform Using the Qbox Code
Proc. Supercomputing (SC), 2005
- Juergen Lorenz, Stefan Kral, Franz Franchetti and Christoph W. Ueberhuber
Vectorization Techniques for the BlueGene/L Double FPU
IBM Journal of Research and Development, Vol. 49, No. 2/3, pp. 437-446, 2005
2004
- Franz Franchetti, Stefan Kral, Juergen Lorenz, Markus Püschel, Christoph W. Ueberhuber and Peter Wurzinger
Automatically Tuned FFTs for BlueGene/L’s Double FPU
Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 3402, pp. 23-36, 2004
2003
- Stefan Kral, Franz Franchetti, Juergen Lorenz and Christoph W. Ueberhuber
SIMD Vectorization of Straight Line Code
Proc. Euro-Par Conference on Parallel and Distributed Computing, LNCS 2985, pp. 251-260, 2003
2002
- Franz Franchetti, F. Kaltenberger and Christoph W. Ueberhuber
FFT Kernels with FMA Utilization
Proc. APLIMAT Conference, pp. 333-339, 2002
- Franz Franchetti, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
Short Vector SIMD Code Generation for DSP Algorithms
Proc. High Performance Extreme Computing (HPEC), 2002
2001
- Franz Franchetti, H. Karner, Stefan Kral and Christoph W. Ueberhuber
Architecture Independent Short Vector FFTs
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 1109-1112, 2001
Vachranukunkiet, Petya
2008
- Jeremy Johnson, Tim Chagnon, Petya Vachranukunkiet, Prawat Nagvajara and Chika Nwankpa
Sparse LU Decomposition using FPGA
Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
Vaidyanathan, K.
2015
- H. E. Sumbul, K. Vaidyanathan, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
A Synthesis Methodology for Application-Specific Logic-in-Memory Designs
Proc. Design Automation Conference (DAC), 2015
2014
- K. Vaidyanathan, R. Liu, H. E. Sumbul, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
Efficient and Secure Intellectual Property (IP) Design for Split Fabrication
Proc. IEEE International Symposium on Hardware-Oriented Security and Trust (HOST), 2014
2012
- Qiuling Zhu, K. Vaidyanathan, O. Shacham, M. Horowitz, Lawrence Pileggi and Franz Franchetti
Design Automation Framwork for Application-Specific Logic-in-Memory Blocks
Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 125-132, 2012
Van Straalen, B.
2023
- B. Van Straalen, P. Colella, Franz Franchetti and P. Broderick
FFTX: Next-Generation Open-Source Software for Fast Fourier Transforms
to appear in Proc. Exascale Computing Project Annual Meeting (ECPAM), 2023
2022
- H. Mankad, Sanil Rao, P. Colella, B. Van Straalen and Franz Franchetti
ProtoX: A First Look
Proc. High Performance Extreme Computing (HPEC), 2022
2020
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solver Libraries for Exascale: FFTX and SpectralPack
Exascale Computing Project (ECP) Annual Meeting, 2020, Poster
2019
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solvers for Exascale: FFTX and SpectralPACK
Exascale Computing Project (ECP) Annual Meeting, 2019
2018
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFTX and SpectralPack: A First Look
Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018
Veloso, Manuela
2017
- Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
IEEE Control Systems Magazine, 2017
2005
- Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
SPIRAL: Code Generation for DSP Transforms
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005
2004
- Markus Püschel, Bryan Singer, Jianxin Xiong, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso and Robert W. Johnson
SPIRAL: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms
Journal of High Performance Computing and Applications, special issue on ``Automatic Performance Tuning'', Vol. 18, No. 1, pp. 21-45, 2004
2002
- Bryan Singer and Manuela Veloso
Automating the Modeling and Optimization of the Performance of Signal Transforms
IEEE Transactions on Signal Processing, Vol. 50, No. 8, pp. 2003-2014, 2002
- Bryan Singer and Manuela Veloso
Learning to Construct Fast Signal Processing Implementations
Journal of Machine Learning Research, special issue on ``the Eighteenth International Conference on Machine Learning (ICML 2001)'', Vol. 3, pp. 887-919, 2002
2001
- Markus Püschel, Bryan Singer, Manuela Veloso and José M. F. Moura
Fast Automatic Generation of DSP Algorithms
Proc. International Conference on Computational Science (ICCS), Lecture Notes In Computer Science, Springer, Vol. 2073, pp. 97-106, 2001
- José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel, Bryan Singer, Manuela Veloso and Jianxin Xiong
Generating Platform-Adapted DSP Libraries using SPIRAL
Proc. High Performance Extreme Computing (HPEC), 2001
- Bryan Singer and Manuela Veloso
Learning to Generate Fast Signal Processing Implementations
Proc. International Conference on Machine Learning (ICML), pp. 529-536, 2001
- Bryan Singer and Manuela Veloso
Stochastic Search for Signal Processing Algorithm Optimization
Proc. Supercomputing (SC), pp. 22, 2001
2000
- Bryan Singer and Manuela Veloso
Learning to Predict Performance from Formula Modeling and Training Data
Proc. International Conference on Machine Learning (ICML), pp. 887-894, 2000
- José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel and Manuela Veloso
SPIRAL: Automatic Implementation of Signal Processing Algorithms
Proc. High Performance Extreme Computing (HPEC), 2000
Veras, Richard
2018
- Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
SPIRAL: Extreme Performance Portability
Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018
2017
- Richard Veras and Franz Franchetti
A Scale-free Structure for Real World Networks
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017
- Richard Veras
The Automatic Generation of High-Performance Graph Analytic Code using Spiral
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2017
2016
- Richard Veras, Tze-Meng Low and Franz Franchetti
A Scale-Free Structure for Power-Law Graphs
Proc. High Performance Extreme Computing (HPEC), pp. 1-7, 2016
- Richard Veras, Thom Popovici, Tze-Meng Low and Franz Franchetti
Compilers, Hands-Off My Hands-On Optimizations
Proc. Workshop on Programming Models for SIMD/Vector Programming (WPMVP), 2016
2013
- Tom Henretty, Richard Veras, Franz Franchetti, Louis-Noël Pouchet, J. Ramanujam and P. Sadayappan
A Stencil Compiler for Short-Vector SIMD Architectures
Proc. ACM International Conference on Supercomputing , pp. 13-24, 2013
- Martin Kong, Richard Veras, Kevin Stock, Franz Franchetti, Louis-Noël Pouchet and P. Sadayappan
When Polyhedral Transformations Meet SIMD Code Generation
Proc. Programming Languages Design and Implementation (PLDI), Vol. 48, pp. 127-138, 2013
Vetter, J.
2023
- Sanil Rao, M. A. H. Monil, H. Mankad, J. Vetter and Franz Franchetti
FFTX-IRIS: Towards Performance Portability and Heterogeneity for SPIRAL Generated Code
Proc. Supercomputing (SC), 2023
Volk, Anja
2017
- H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
Multirotor UAV State Prediction Through Multi-microphone Side-channel Fusion
Proc. Multisensor Fusion and Integration for Intelligent Systems (MFI), IEEE, pp. 15-21, 2017
- H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
Prediction of Quadcopter State through Multi-Microphone Side-Channel Fusion
Technical report UU-CS-2017-001, Dept. of Information and Computing Sciences, Utrecht University, 2017
Voronenko, Yevgen
2012
- Franz Franchetti, Yevgen Voronenko and G. Almasi
Automatic Generation of the HPC Challenges Global FFT Benchmark for BlueGene/P
Proc. High Performance Computing for Computational Science (VECPAR), 2012
2011
- Christian Berger, Volodymyr Arbatov, Yevgen Voronenko, Franz Franchetti and Markus Püschel
Real-Time Software Implementation of an IEEE 802.11a Baseband Receiver on Intel Multicore
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1693-1696 , 2011
- Markus Püschel, Franz Franchetti and Yevgen Voronenko
Spiral
in Encyclopedia of Parallel Computing, Eds. David Padua, pp. 1920-1933, Springer 2011
2010
- Yevgen Voronenko, Volodymyr Arbatov, Christian Berger, Ronghui Peng, Markus Püschel and Franz Franchetti
Computer Generation of Platform-Adapted Physical Layer Software
Proc. Software Defined Radio (SDR), 2010
- Frédéric de Mesmay, Yevgen Voronenko and Markus Püschel
Offline Library Adaptation Using Automatically Generated Heuristics
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 1-10, 2010
- Lingchuan Meng, Jeremy Johnson, Franz Franchetti, Yevgen Voronenko, Marc Moreno Maza and Yuzhen Xie
Spiral-Generated Modular FFT Algorithms
Proc. Parallel Symbolic Computation (PASCO), pp. 169-170, 2010
2009
- Yevgen Voronenko and Markus Püschel
Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for Real DFTs
IEEE Transactions on Signal Processing, Vol. 57, No. 1, pp. 205-222, 2009
- Basilio B. Fraguela, Yevgen Voronenko and Markus Püschel
Automatic Tuning of Discrete Fourier Transforms Driven by Analytical Modeling
Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 271-280, 2009
- Frédéric de Mesmay, Arpad Rimmel, Yevgen Voronenko and Markus Püschel
Bandit-Based Optimization on Graphs with Application to Library Performance Tuning
Proc. International Conference on Machine Learning (ICML), pp. 729-736, 2009
- Yevgen Voronenko, Frédéric de Mesmay and Markus Püschel
Computer Generation of General Size Linear Transform Libraries
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 102-113, 2009
- Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
2008
- Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
Automatic Generation of Adaptive Libraries for Matrix-Multiplication
Parallel Matrix Algorithms and Applications (PMAA), 2008, Presentation (Abstract reviewed)
- Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
Automatic Generation of Multithreaded Vectorized Adaptive Libraries for Matrix Multiplication
Proc. International Workshop on Parallel Matrix Algorithms and Applications (PMAA), 2008
- Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
Domain-Specific Library Generation for Parallel Software and Hardware Platforms
Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
- Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
Generating High-Performance General Size Linear Transform Libraries Using Spiral
Proc. High Performance Extreme Computing (HPEC), 2008
- Yevgen Voronenko
Library Generation for Linear Transforms
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2008
- Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
Program Generation with Spiral: Beyond Transforms
Proc. High Performance Extreme Computing (HPEC), 2008
- Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
System Demonstration of Spiral: Generator for High-Performance Linear Transform Libraries
Proc. Algebraic Methodology and Software Technology (AMAST), 2008
2007
- Yevgen Voronenko and Markus Püschel
Mechanical Derivation of Fused Multiply-Add Algorithms for Linear Transforms
IEEE Transactions on Signal Processing, Vol. 55, No. 9, pp. 4458-4473, 2007
- Yevgen Voronenko and Markus Püschel
Multiplierless Multiple Constant Multiplication
ACM Transactions on Algorithms, Vol. 3, No. 2, 2007
2006
- Yevgen Voronenko and Markus Püschel
Algebraic Derivation of General Radix Cooley-Tukey Algorithms for the Real Discrete Fourier Transform
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 3, 2006
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
A Rewriting System for the Vectorization of Signal Transforms
Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 4395, pp. 363-377, 2006
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
FFT Program Generation for Shared Memory: SMP and Multicore
Proc. Supercomputing (SC), 2006
- Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
Parallelism in Spiral
Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
Spiral: Generating Signal Processing Kernels for New Commodity Architectures
Proc. EDGE Workshop, pp. D49-D50, 2006
2005
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
Formal Loop Merging for Signal Transforms
Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326 , 2005
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
Loop Merging for Signal Transforms
Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326, 2005
- Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
SPIRAL: Code Generation for DSP Transforms
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005
2004
- Lawrence C. Chang, Inpyo Hong, Yevgen Voronenko and Markus Püschel
Adaptive Mapping of Linear DSP Algorithms to Fixed-Point Arithmetic
Proc. High Performance Extreme Computing (HPEC), 2004
- Yevgen Voronenko and Markus Püschel
Automatic Generation of Implementations for DSP Transforms on Fused Multiply-Add Architectures
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-101-V-104, 2004
Włodarczyk, Tomasz Wiktor
2008
- Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
Program Generation with Spiral: Beyond Transforms
Proc. High Performance Extreme Computing (HPEC), 2008
Watts, Philip M.
2010
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Real-Time Digital Signal Processing for the Generation of Optical Orthogonal Frequency Division Multiplexed Signals
IEEE Journal of Selected Topics in Quantum Electronics, Vol. 16, No. 5, pp. 1235-1244 , 2010
2009
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
21.4 GS/s Real-Time DSP-Based Optical OFDM Signal Generation and Transmission Over 1600 km of Uncompensated Fibre
Proc. European Conference on Optical Communication (ECOC), pp. 1-2, 2009
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009
Weng, Yang
2013
- Q. Li, Tao Cui, Yang Weng, Rohit Negi, Franz Franchetti and Marija Ilic
An Information-Theoretic Approach to PMU Placement in Electric Power Systems
IEEE Transactions on Smart Grid, Vol. 4, No. 1, pp. 446-456, 2013
Wilkinson, K.
2015
- Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
Proc. Workshop on Compilers for Parallel Computing (CPC), 2015
- Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2015
Wong, H.-S.
2015
- M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
Computer, Vol. 48, No. 12, pp. 24-33, 2015
Wu, T. F.
2015
- M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
Computer, Vol. 48, No. 12, pp. 24-33, 2015
Wunderlich, Roland
2005
- Roland Wunderlich, Markus Püschel and James C. Hoe
Accelerating Blocked Matrix-Matrix Multiplication using a Software-Managed Memory Hierarchy with DMA
Proc. High Performance Extreme Computing (HPEC), 2005
Wurzinger, Peter
2004
- Franz Franchetti, Stefan Kral, Juergen Lorenz, Markus Püschel, Christoph W. Ueberhuber and Peter Wurzinger
Automatically Tuned FFTs for BlueGene/L’s Double FPU
Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 3402, pp. 23-36, 2004
Xie, Yuzhen
2010
- Lingchuan Meng, Jeremy Johnson, Franz Franchetti, Yevgen Voronenko, Marc Moreno Maza and Yuzhen Xie
Spiral-Generated Modular FFT Algorithms
Proc. Parallel Symbolic Computation (PASCO), pp. 169-170, 2010
Xiong, Jianxin
2005
- Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
SPIRAL: Code Generation for DSP Transforms
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005
2004
- Markus Püschel, Bryan Singer, Jianxin Xiong, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso and Robert W. Johnson
SPIRAL: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms
Journal of High Performance Computing and Applications, special issue on ``Automatic Performance Tuning'', Vol. 18, No. 1, pp. 21-45, 2004
2001
- Sebastian Egner, Jeremy Johnson, David Padua, Jianxin Xiong and Markus Püschel
Automatic Derivation and Implementation of Signal Processing Algorithms
ACM SIGSAM Bulletin Communications in Computer Algebra, Vol. 35, No. 2, pp. 1-19, 2001
- Jianxin Xiong
Automatic Optimization of DSP Algorithms
PhD. thesis, Computer Science, University of Illinois at Urbana-Champaign, 2001, Also as Technical Report UIUCDCS-R-2001-224, University of Illinois
- José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel, Bryan Singer, Manuela Veloso and Jianxin Xiong
Generating Platform-Adapted DSP Libraries using SPIRAL
Proc. High Performance Extreme Computing (HPEC), 2001
- Jianxin Xiong, Jeremy Johnson, Robert W. Johnson and David Padua
SPL: A Language and Compiler for DSP Algorithms
Proc. Programming Languages Design and Implementation (PLDI), pp. 298-308, 2001
Xu, G.
2024
- L. Tang, S. Chen, K. Harisrikanth, G. Xu, Franz Franchetti and K. Mai
A 1.19GHz 9.52Gsamples/sec Radix-8 FFT Hardware Accelerator in 28nm
Proc. Hot Chips Symposium (HCS), 2024
2023
- G. Xu, Franz Franchetti and James C. Hoe
An Approach to Generating Customized Load-Store Architectures
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2023
2022
- L. Tang, S. Chen, K. Harisrikanth, G. Xu, K. Mai and Franz Franchetti
A High Throughput Hardware Accelerator for FFTW Codelets: A First Look
Proc. High Performance Extreme Computing (HPEC), 2022
2017
- G. Xu, Tze-Meng Low, James C. Hoe and Franz Franchetti
Optimizing FFT Resource Efficiency of FPGA using High-Level Synthesis
High Performance Extreme Computing Conference (HPEC), 2017
2014
- Qi Guo, N. Alachiotis, Berkin Akin, F. Sadi, G. Xu, Tze-Meng Low, Lawrence Pileggi, James C. Hoe and Franz Franchetti
3D-Stacked Memory-Side Acceleration: Accelerator and System Design
Proc. Workshop on Near Data Processing (WONDP), 2014
Xu, Xu
2007
- Jeremy Johnson and Xu Xu
Generating Symmetric DFTs and Equivariant FFT Algorithms
Proc. ACM International Symposium on Symbolic and Algebraic Computation (ISSAC), ACM, pp. 195-202, 2007
2003
- Xu Xu
A Recursive Implementation of the Dimensionless FFT
Master thesis, Computer Science, Drexel University, 2003
- Jeremy Johnson and Xu Xu
A Recursive Implementation of the Dimensionless FFT
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2003
Yang, C.
2016
- J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
Mathematical Foundations of the GraphBLAS
Proc. High Performance Extreme Computing (HPEC), 2016
Yang, R.
2014
- Tao Cui, R. Yang, Gabriela Hug and Franz Franchetti
Accelerated AC Contingency Calculation on Commodity Multi-core SIMD CPUs
Proc. IEEE Power and Energy Society General Meeting (PES-GM), 2014
Yates, R. K.
2005
- F. Gygi, E. W. Draeger, B. R. de Supinski, R. K. Yates, Franz Franchetti, Stefan Kral, Juergen Lorenz, Christoph W. Ueberhuber, J. A. Gunnels and J. C. Sexton
Large-Scale First-Principles Molecular Dynamics Simulations on the BlueGene/L Platform Using the Qbox Code
Proc. Supercomputing (SC), 2005
Yoon, Irene
2021
- Yannick Zakowski, Calvin Beck, Irene Yoon, Ilia Zaichuk, Vadim Zaliva and Steve Zdancewic
Modular, Compositional, and Executable Formal Semantics for LLVMIR
Proc. International Conference on Functional Programming (ICFP), 2021
Yu, Qian
2008
- Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
Program Generation with Spiral: Beyond Transforms
Proc. High Performance Extreme Computing (HPEC), 2008
Yu, W.
2012
- W. Yu, Franz Franchetti, James C. Hoe and Tsuhan Chen
Highly Efficient Performance Portable Tracking of Evolving Surfaces
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 296-307, 2012
2011
- W. Yu, Franz Franchetti, James C. Hoe, José M. F. Moura and Tsuhan Chen
Performance Portable Tracking of Evolving Surfaces
Proc. High Performance Extreme Computing (HPEC), 2011
2010
- W. Yu, Franz Franchetti, James C. Hoe and Tsuhan Chen
Fast and Robust Active Contours for Image Segmentation
Proc. IEEE International Conference on Image Processing (ICIP), 2010
- W. Yu, Franz Franchetti, James C. Hoe, Y.-J. Chang and Tsuhan Chen
Fast Bilateral Filtering By Adapting Block Size
Proc. IEEE International Conference on Image Processing (ICIP), 2010
- W. Yu, Tsuhan Chen, Franz Franchetti and James C. Hoe
High Performance Stereo Vision Designed for Massively Data Parallel Platforms
IEEE Transactions on Circuits and Systems for Video Technology (T-CSVT), Vol. 20, No. 11, pp. 1509-1519, 2010
- W. Yu
Performance Portable Tracking of Evolving Surfaces
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
Zaichuk, Ilia
2021
- Yannick Zakowski, Calvin Beck, Irene Yoon, Ilia Zaichuk, Vadim Zaliva and Steve Zdancewic
Modular, Compositional, and Executable Formal Semantics for LLVMIR
Proc. International Conference on Functional Programming (ICFP), 2021
2020
- Vadim Zaliva, Ilia Zaichuk and Franz Franchetti
Verified Translation Between Purely Functional and Imperative Domain Specific Languages in HELIX
Proc. Working Conference on Verified Software: Theories, Tools, and Experiments (VSTTE), 2020
Zakowski, Yannick
2021
- Yannick Zakowski, Calvin Beck, Irene Yoon, Ilia Zaichuk, Vadim Zaliva and Steve Zdancewic
Modular, Compositional, and Executable Formal Semantics for LLVMIR
Proc. International Conference on Functional Programming (ICFP), 2021
Zalewski, M.
2016
- J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
Mathematical Foundations of the GraphBLAS
Proc. High Performance Extreme Computing (HPEC), 2016
Zaliva, Vadim
2021
- Yannick Zakowski, Calvin Beck, Irene Yoon, Ilia Zaichuk, Vadim Zaliva and Steve Zdancewic
Modular, Compositional, and Executable Formal Semantics for LLVMIR
Proc. International Conference on Functional Programming (ICFP), 2021
2020
- Vadim Zaliva
HELIX: From Math to Verified Code
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2020
- Vadim Zaliva, Ilia Zaichuk and Franz Franchetti
Verified Translation Between Purely Functional and Imperative Domain Specific Languages in HELIX
Proc. Working Conference on Verified Software: Theories, Tools, and Experiments (VSTTE), 2020
2019
- Vadim Zaliva and Matthieu Sozeau
Reification of Shallow-Embedded DSLs in Coq with Automated Verification
Proc. International Workshop on Coq for Programming Languages (CoqPL), 2019
2018
- Vadim Zaliva and Franz Franchetti
HELIX: A Case Study of a Formal Verification of High Performance Program Generation
Proc. Workshop on Functional High Performance Computing (FHPC), 2018
2014
- Vadim Zaliva and Franz Franchetti
Barometric and GPS Altitude Sensor Fusion
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014
Zdancewic, Steve
2021
- Yannick Zakowski, Calvin Beck, Irene Yoon, Ilia Zaichuk, Vadim Zaliva and Steve Zdancewic
Modular, Compositional, and Executable Formal Semantics for LLVMIR
Proc. International Conference on Functional Programming (ICFP), 2021
Zelinski, Adam C.
2004
- Adam C. Zelinski, Markus Püschel, Smarahara Misra and James C. Hoe
Automatic Cost Minimization for Multiplierless Implementations of Discrete Signal Transforms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-221-V-224, 2004
- Markus Püschel, Adam C. Zelinski and James C. Hoe
Custom-Optimized Multiplierless Implementations of DSP Algorithms
Proc. International Conference on Computer-Aided Design (ICCAD), pp. 175-182, 2004
2003
- Smarahara Misra, Adam C. Zelinski, James C. Hoe and Markus Püschel
Custom Reduction of Arithmetic in Linear DSP Transforms
Proc. High Performance Extreme Computing (HPEC), 2003
Zhang, B.
2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
Zhang, Jiyuan
2020
- Jiyuan Zhang
Accelerating the Motifs of Machine Learning Applications on Modern Processors
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2020
- Jiyuan Zhang, Yi Lu, Daniele G. Spampinato and Franz Franchetti
FESIA: A Fast and Efficient Set Intersection Approach on Modern CPUs
Proc. IEEE International Conference on Data Engineering (ICDE), 2020
- Jiyuan Zhang, Daniele G. Spampinato and Franz Franchetti
FESIA: A Fast and SIMD-Efficient Set Intersection Approach on Modern CPUs
North East Database Day (NEDB), 2020, Poster
2018
- Jiyuan Zhang, Franz Franchetti and Tze-Meng Low
High Performance Zero-Memory Overhead Direct Convolutions
Proc. International Conference on Machine Learning (ICML), 2018
- Jiyuan Zhang, Daniele G. Spampinato, S. McMillan and Franz Franchetti
Preliminary Exploration on Large-Scale Triangle Counting in Shared-Memory Multicore System
Proc. IEEE High Performance Extreme Computing (HPEC), 2018
2015
- Jiyuan Zhang, Tze-Meng Low, Qi Guo and Franz Franchetti
A 3D-Stacked Memory Manycore Stencil Accelerator System
Proc. Workshop on Near Data Processing (WONDP), 2015
Zhang, N.
2024
- S. Fu, N. Zhang and Franz Franchetti
Accelerating High-Precision Number Theoretic Transforms using Intel AVX-512
Proc. Parallel Architectures and Compilation Techniques (PACT), 2024
- Y. Eum, N. Zhang, L. Tang and Franz Franchetti
Towards a RISC-V Instruction Set Extension for Multi-word Arithmetic
Proc. High Performance Extreme Computing (HPEC), 2024
2023
- N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
- N. Zhang and Franz Franchetti
Generating Number Theoretic Transforms for Multi-Word Integer Data Types
Proc. International Symposium on Code Generation and Optimization (CGO), 2023
- D. Sun, N. Zhang and Franz Franchetti
Optimization and Performance Analysis of Shor’s Algorithm in Qiskit
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring-Processing Unit
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
- P. Brinich, N. Zhang, A. Ebel, Franz Franchetti and Jeremy Johnson
Twiddle Factor Generation for a Vectorized Number Theoretic Transform
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
2022
- N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
Towards Full-Stack Acceleration for Fully Homomorphic Encryption
Proc. High Performance Extreme Computing (HPEC), 2022
Zhang, T.
2024
- T. Zhang, E. Tang, F. Siddique, K. Skadron and Franz Franchetti
Towards an End-to-End Processing-in-DRAM Acceleration of Spectral Library Search
Proc. High Performance Extreme Computing (HPEC), 2024
Zhu, N.
2022
- Z. Gong, N. Zhu, M. Ngaw, Joao Rivera, L. Tang, E. Tang, H. Mankad and Franz Franchetti
Interval Arithmetic-based FFT for Large Integer Multiplication
Proc. High Performance Extreme Computing (HPEC), 2022
Zhu, Qiuling
2015
- H. E. Sumbul, K. Vaidyanathan, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
A Synthesis Methodology for Application-Specific Logic-in-Memory Designs
Proc. Design Automation Conference (DAC), 2015
2014
- K. Vaidyanathan, R. Liu, H. E. Sumbul, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
Efficient and Secure Intellectual Property (IP) Design for Split Fabrication
Proc. IEEE International Symposium on Hardware-Oriented Security and Trust (HOST), 2014
2013
- Qiuling Zhu, Berkin Akin, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
A 3D-Stacked Logic-in-Memory Accelerator for Application-Specific Data Intensive Computing
Proc. IEEE International 3D Systems Integration Conference (3DIC), pp. 1-7, 2013
- Qiuling Zhu, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Accelerating Sparse Matrix-Matrix Multiplication with 3D-Stacked Logic-in-Memory Hardware
Proc. High Performance Extreme Computing (HPEC), pp. 1-6, 2013
- Qiuling Zhu
Application Specific Logic in Memory
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2013
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
A Smart Memory Accelerated Computed Tomography Parallel Backprojection
in From Algorithms to Circuits and System-on-Chip Design (VLSI-SoC 2012), Eds. A. Berg, A. Coskun, M. Guthaus, S. Katkoori, R. Reis, pp. 21-44, Springer, Berlin, Heidelberg 2013
2012
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
Cost-Effective Smart Memory Implementation for Parallel Backprojection in Computed Tomography
Proc. IFIP/IEEE Internationa Conference on Very Large Scale Integration, pp. 111-116, 2012
- Qiuling Zhu, K. Vaidyanathan, O. Shacham, M. Horowitz, Lawrence Pileggi and Franz Franchetti
Design Automation Framwork for Application-Specific Logic-in-Memory Blocks
Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 125-132, 2012
- Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
Local Interpolation-based Polar Format SAR: Algorithm, Hardware Implementation and Design Automation
Journal of Signal Processing Systems, 2012
- Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
Polar Format Synthetic Aperture Radar in Energy Efficient Application-Specific Logic-in-Memory
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1557-1560, 2012
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
Smart Memory Synthesis for Energy-Efficient Computed Tomography Reconstruction
Proc. SRC TECHCON, 2012
2011
- Qiuling Zhu, E. L. Turner, Christian Berger, Lawrence Pileggi and Franz Franchetti
Application-Specific Logic-in-Memory for Polar Format Synthetic Aperture Radar
Proc. High Performance Extreme Computing (HPEC), 2011
Zuluaga, Marcela
2016
- Marcela Zuluaga, Andreas Krause and Markus Püschel
e-PAL: An Active Learning Approach to the Multi-Objective Optimization Problem
Journal of Machine Learning Research, Vol. 17, No. 104, pp. 1-32, 2016
- Marcela Zuluaga, Peter A. Milder and Markus Püschel
Streaming Sorting Networks
ACM Transactions on Design Automation of Electronic Systems, Vol. 21, No. 4, pp. 55, 2016
2013
- Marcela Zuluaga, Andreas Krause, Guillaume Sergent and Markus Püschel
Active Learning for Multi-Objective Optimization
Proc. International Conference on Machine Learning (ICML), pp. 462-470, 2013
- Marcela Zuluaga, Andreas Krause and Markus Püschel
Multi-Objective Optimization for High-Level Synthesis
Proc. Workshop on High-Level Synthesis for High Performance Computing (HLS4HPC), 2013
2012
- Marcela Zuluaga, Peter A. Milder and Markus Püschel
Computer Generation of Streaming Sorting Networks
Proc. Design Automation Conference (DAC), pp. 1245-1253, 2012
- Marcela Zuluaga, Andreas Krause, Peter A. Milder and Markus Püschel
"Smart" Design Space Sampling to Predict Pareto-Optimal Solutions
Proc. Languages, Compilers, Tools and Theory for Embedded Systems (LCTES), pp. 119-128 , 2012
Zuo, Z.
2022
- J. Nguyen, M. Cai, Z. Zuo, L. Tang, K. Mai and Franz Franchetti
LIMA: Hardware for FFT based Large Integer Multiplication
Proc. High Performance Extreme Computing (HPEC), 2022