Publications
Filtered as:
all types
- all years
- all authors
- all keywords
Sorted by:
keyword - year
Corresponding
bibtex list
3D-stacked
2016
- F. Sadi, Lawrence Pileggi and Franz Franchetti
3D DRAM Based Application Specific Hardware Accelerator for SpMV
High Performance Extreme Computing Conference (HPEC), 2016
2015
- Jiyuan Zhang, Tze-Meng Low, Qi Guo and Franz Franchetti
A 3D-Stacked Memory Manycore Stencil Accelerator System
Proc. Workshop on Near Data Processing (WONDP), 2015
- Berkin Akin, Franz Franchetti and James C. Hoe
Data Reorganization in Memory Using 3D-stacked DRAM
Proc. International Symposium on Computer Architectur (ISCA), 2015
2013
- Qiuling Zhu, Berkin Akin, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
A 3D-Stacked Logic-in-Memory Accelerator for Application-Specific Data Intensive Computing
Proc. IEEE International 3D Systems Integration Conference (3DIC), pp. 1-7, 2013
- Qiuling Zhu, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Accelerating Sparse Matrix-Matrix Multiplication with 3D-Stacked Logic-in-Memory Hardware
Proc. High Performance Extreme Computing (HPEC), pp. 1-6, 2013
Acceleration
2022
- L. Tang, S. Chen, K. Harisrikanth, G. Xu, K. Mai and Franz Franchetti
A High Throughput Hardware Accelerator for FFTW Codelets: A First Look
Proc. High Performance Extreme Computing (HPEC), 2022
- N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
Towards Full-Stack Acceleration for Fully Homomorphic Encryption
Proc. High Performance Extreme Computing (HPEC), 2022
2020
- Jiyuan Zhang
Accelerating the Motifs of Machine Learning Applications on Modern Processors
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2020
2018
- F. Sadi
Accelerating Sparse Matrix Kernels with Co-optimized Architecture
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2018
- F. Sadi, Joe Sweeney, S. McMillan, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
PageRank Acceleration for Large Graphs with Scalable Hardware and Two-Step SpMV
Proc. High Performance Extreme Computing (HPEC), 2018
2016
- F. Sadi, Lawrence Pileggi and Franz Franchetti
3D DRAM Based Application Specific Hardware Accelerator for SpMV
High Performance Extreme Computing Conference (HPEC), 2016
- Qi Guo, Tianshi Chen, Y. Chen and Franz Franchetti
Accelerating Architectural Simulation Via Statistical Techniques: A Survey
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 35, No. 3, pp. 433-446, 2016
2015
- Jiyuan Zhang, Tze-Meng Low, Qi Guo and Franz Franchetti
A 3D-Stacked Memory Manycore Stencil Accelerator System
Proc. Workshop on Near Data Processing (WONDP), 2015
- Qi Guo, Tze-Meng Low, N. Alachiotis, Berkin Akin, Lawrence Pileggi, James C. Hoe and Franz Franchetti
Enabling Portable Energy Efficiency with Memory Accelerated Library
Proc. MICRO, 2015
- Tze-Meng Low, Qi Guo and Franz Franchetti
Optimizing Space Time Adaptive Processing Through Accelerating Memory-Bounded Operations
Proc. High Performance Extreme Computing (HPEC), 2015
2014
- Qi Guo, N. Alachiotis, Berkin Akin, F. Sadi, G. Xu, Tze-Meng Low, Lawrence Pileggi, James C. Hoe and Franz Franchetti
3D-Stacked Memory-Side Acceleration: Accelerator and System Design
Proc. Workshop on Near Data Processing (WONDP), 2014
- Berkin Akin, Franz Franchetti and James C. Hoe
Understanding the Design Space of DRAM-optimized Hardware FFT Accelerators
Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 248-255, 2014
2013
- Qiuling Zhu, Berkin Akin, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
A 3D-Stacked Logic-in-Memory Accelerator for Application-Specific Data Intensive Computing
Proc. IEEE International 3D Systems Integration Conference (3DIC), pp. 1-7, 2013
- Qiuling Zhu, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Accelerating Sparse Matrix-Matrix Multiplication with 3D-Stacked Logic-in-Memory Hardware
Proc. High Performance Extreme Computing (HPEC), pp. 1-6, 2013
Adaptive processing
2015
- Tze-Meng Low, Qi Guo and Franz Franchetti
Optimizing Space Time Adaptive Processing Through Accelerating Memory-Bounded Operations
Proc. High Performance Extreme Computing (HPEC), 2015
Affine arithmetic
2022
- Joao Rivera, Franz Franchetti and Markus Püschel
A Compiler for Sound Floating-Point Computations using Affine Arithmetic
Proc. International Symposium on Code Generation and Optimization (CGO), 2022
Algebraic
2018
- Tze-Meng Low, Daniele G. Spampinato, A. Kutuluru, U. Sridhar, Thom Popovici, Franz Franchetti and S. McMillan
Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency Matrices
Proc. High Performance Extreme Computing (HPEC), 2018
2017
- Matthias Bolten, Franz Franchetti, P. H. J. Kelly, Christian Lengauer and Marcus Mohr
Algebraic Description and Automatic Generation of Multigrid Methods in SPIRAL
Concurrency and Computation: Practice and Experience, 2017
2016
- Daniele G. Spampinato and Markus Püschel
A Basic Linear Algebra Compiler for Structured Matrices
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 117-127, 2016
Algebraic signal processing theory
2009
- Yevgen Voronenko and Markus Püschel
Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for Real DFTs
IEEE Transactions on Signal Processing, Vol. 57, No. 1, pp. 205-222, 2009
2008
- Markus Püschel and José M. F. Moura
Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for DCTs and DSTs
IEEE Transactions on Signal Processing, Vol. 56, No. 4, pp. 1502-1521, 2008
2006
- Yevgen Voronenko and Markus Püschel
Algebraic Derivation of General Radix Cooley-Tukey Algorithms for the Real Discrete Fourier Transform
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 3, 2006
2003
- Markus Püschel
Cooley-Tukey FFT like Algorithms for the DCT
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 501-504, 2003
- Markus Püschel and José M. F. Moura
The Algebraic Approach to the Discrete Cosine and Sine Transforms and their Fast Algorithms
SIAM Journal of Computing, Vol. 32, No. 5, pp. 1280-1316, 2003
Algorithm
2024
- Sanil Rao, A. Prakash and Franz Franchetti
Towards LibraryX: A Framework for Cross-Library Call Optimization
Proc. High Performance Extreme Computing (HPEC), 2024
2021
- Mark Blanco, S. McMillan and Tze-Meng Low
Delayed Asynchronous Iterative Graph Algorithms
Proc. High Performance Extreme Computing (HPEC), 2021
- P. Oostema and Franz Franchetti
Leveraging High Dimensional Spatial Graph Embedding as a Heuristic for Graph Algorithms
Proc. IEEE Workshop on Parallel / Distributed Combinatorics and Optimization (PDCO), 2021
2020
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
Massive Scaling of MASSIF: Algorithm Development and Analysis for Simulation on GPUs
Proc. Platform for Advanced Scientific Computing (PASC), Article 13, pp. 1 - 10, 2020
2019
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
Algorithm Design at Scale: Porting Parallel FFT-based Fortran Simulations to GPUs
SIAM Conference on Computational Science and Engineering (CSE), 2019
- Mark Blanco, Tze-Meng Low and Kyungjoo Kim
Exploration of Fine-Grained Parallelism for Load Balancing Eager K-truss on GPU and CPU
Proc. High Performance Extreme Computing (HPEC), 2019
2018
- Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
Algorithm Design for Large Scale FFT-Based Simulations on CPU-GPU Platforms
, 2018
- Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
Algorithm Design for Large Scale Parallel FFT-Based Simulations on Heterogeneous Platforms
Proc. High Performance Extreme Computing (HPEC), 2018
- Tze-Meng Low, Daniele G. Spampinato, A. Kutuluru, U. Sridhar, Thom Popovici, Franz Franchetti and S. McMillan
Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency Matrices
Proc. High Performance Extreme Computing (HPEC), 2018
2013
- Lingchuan Meng and Jeremy Johnson
Automatic Parallel Library Generation for General-Size Modular FFT Algorithms
Proc. International Workshop on Computer Algebra in Scientific Computing, Springer, pp. 243-256, 2013
- Martin Kong, Richard Veras, Kevin Stock, Franz Franchetti, Louis-Noël Pouchet and P. Sadayappan
When Polyhedral Transformations Meet SIMD Code Generation
Proc. Programming Languages Design and Implementation (PLDI), Vol. 48, pp. 127-138, 2013
2010
- Lingchuan Meng, Jeremy Johnson, Franz Franchetti, Yevgen Voronenko, Marc Moreno Maza and Yuzhen Xie
Spiral-Generated Modular FFT Algorithms
Proc. Parallel Symbolic Computation (PASCO), pp. 169-170, 2010
Algorithm complexity
2010
- W. Yu, Franz Franchetti, James C. Hoe, Y.-J. Chang and Tsuhan Chen
Fast Bilateral Filtering By Adapting Block Size
Proc. IEEE International Conference on Image Processing (ICIP), 2010
Algorithm theory and analysis
2017
- F. Sadi, Lawrence Pileggi and Franz Franchetti
Algorithm and Hardware Co-Optimized Solution for Large SpMV Problems
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017
2016
- Marcela Zuluaga, Peter A. Milder and Markus Püschel
Streaming Sorting Networks
ACM Transactions on Design Automation of Electronic Systems, Vol. 21, No. 4, pp. 55, 2016
2014
- F. Sadi, Berkin Akin, Thom Popovici, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Algorithm/Hardware Co-optimized SAR Image Reconstruction with 3D-stacked Logic in Memory
Proc. IEEE High Performance Extreme Computing (HPEC), 2014
2012
- Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
Algorithm and Architecture Optimization for Large Size Two Dimensional Discrete Fourier Transform
ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA), 2012
2009
- Yevgen Voronenko and Markus Püschel
Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for Real DFTs
IEEE Transactions on Signal Processing, Vol. 57, No. 1, pp. 205-222, 2009
- Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
2008
- Markus Püschel and José M. F. Moura
Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for DCTs and DSTs
IEEE Transactions on Signal Processing, Vol. 56, No. 4, pp. 1502-1521, 2008
2007
- Jeremy Johnson and Xu Xu
Generating Symmetric DFTs and Equivariant FFT Algorithms
Proc. ACM International Symposium on Symbolic and Algebraic Computation (ISSAC), ACM, pp. 195-202, 2007
2006
- Yevgen Voronenko and Markus Püschel
Algebraic Derivation of General Radix Cooley-Tukey Algorithms for the Real Discrete Fourier Transform
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 3, 2006
- Pawel Hitczenko, Jeremy Johnson and Hung-Jen Huang
Distribution of a Class of Divide and Conquer Recurrences arising from the Computation of the Walsh-Hadamard Transform
Theoretical Computer Science, Vol. 352, pp. 8-30, 2006
2005
- Mihai Furis, Pawel Hitczenko and Jeremy Johnson
Cache Miss Analysis of WHT Algorithms
Proc. International Conference on Analysis of Algorithms, Discrete Mathematics and Theoretical Computer Science, pp. 115-124, 2005
2004
- Lawrence C. Chang, Inpyo Hong, Yevgen Voronenko and Markus Püschel
Adaptive Mapping of Linear DSP Algorithms to Fixed-Point Arithmetic
Proc. High Performance Extreme Computing (HPEC), 2004
2003
- Anthony F. Breitzman
Automatic Derivation and Implementation of Fast Convolution Algorithms
PhD. thesis, Computer Science, Drexel University, 2003
- Mihai Furis
Cache Miss Analysis of Walsh-Hadamard Transform Algorithms
Master thesis, Computer Science, Drexel University, 2003
- Markus Püschel
Cooley-Tukey FFT like Algorithms for the DCT
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 501-504, 2003
- Markus Püschel and José M. F. Moura
The Algebraic Approach to the Discrete Cosine and Sine Transforms and their Fast Algorithms
SIAM Journal of Computing, Vol. 32, No. 5, pp. 1280-1316, 2003
- Fang Fang, Rob A. Rutenbar, Markus Püschel and Tsuhan Chen
Toward Efficient Static Analysis of Finite-Precision Effects in DSP Applications via Affine Arithmetic Modeling
Proc. Design Automation Conference (DAC), pp. 496-501, 2003
2002
- Hung-Jen Huang
Performance Analysis of an Adaptive Algorithm for the Walsh-Hadamard Transform
Master thesis, Computer Science, Drexel University, 2002
2001
- Sebastian Egner, Jeremy Johnson, David Padua, Jianxin Xiong and Markus Püschel
Automatic Derivation and Implementation of Signal Processing Algorithms
ACM SIGSAM Bulletin Communications in Computer Algebra, Vol. 35, No. 2, pp. 1-19, 2001
- Sebastian Egner and Markus Püschel
Automatic Generation of Fast Discrete Signal Transforms
IEEE Transactions on Signal Processing, Vol. 49, No. 9, pp. 1992-2002, 2001
Application-specific accelleration
2024
- T. Zhang, E. Tang, F. Siddique, K. Skadron and Franz Franchetti
Towards an End-to-End Processing-in-DRAM Acceleration of Spectral Library Search
Proc. High Performance Extreme Computing (HPEC), 2024
Architecture
2023
- G. Xu, Franz Franchetti and James C. Hoe
An Approach to Generating Customized Load-Store Architectures
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2023
- N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
2018
- F. Sadi
Accelerating Sparse Matrix Kernels with Co-optimized Architecture
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2018
2016
- Qi Guo, Tianshi Chen, Y. Chen and Franz Franchetti
Accelerating Architectural Simulation Via Statistical Techniques: A Survey
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 35, No. 3, pp. 433-446, 2016
- Berkin Akin, Franz Franchetti and James C. Hoe
HAMLeT Architecture for Parallel Data Reorganization in Memory
IEEE Micro, Vol. 36, No. 1, pp. 14-23, 2016
2013
- Qiuling Zhu, Berkin Akin, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
A 3D-Stacked Logic-in-Memory Accelerator for Application-Specific Data Intensive Computing
Proc. IEEE International 3D Systems Integration Conference (3DIC), pp. 1-7, 2013
2012
- Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
Algorithm and Architecture Optimization for Large Size Two Dimensional Discrete Fourier Transform
ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA), 2012
2011
- Tom Henretty, Kevin Stock, Louis-Noël Pouchet, Franz Franchetti, J. Ramanujam and P. Sadayappan
Data Layout Transformation for Stencil Computations on Short SIMD Architectures
Proc. International Conference on Compiler Construction (CC), 2011
Arm SVE
2021
- N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
Adaptation of A64 Scalable Vector Extension for Spiral
Proc. High Performance Computing Conference (HPCC), Japan, 2021
Asynchronous
2021
- Mark Blanco, S. McMillan and Tze-Meng Low
Delayed Asynchronous Iterative Graph Algorithms
Proc. High Performance Extreme Computing (HPEC), 2021
Automatic generation
2022
- J. Nguyen, Sanil Rao, J. Moreira and Franz Franchetti
Automatic Generation of Matrix-Vector Code Using SPIRAL for the Power10 ISA
Proc. IEEE High Performance Extreme Computing (HPEC), 2022
2017
- Matthias Bolten, Franz Franchetti, P. H. J. Kelly, Christian Lengauer and Marcus Mohr
Algebraic Description and Automatic Generation of Multigrid Methods in SPIRAL
Concurrency and Computation: Practice and Experience, 2017
- Richard Veras
The Automatic Generation of High-Performance Graph Analytic Code using Spiral
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2017
2012
- C. Angelopoulos, Franz Franchetti and Markus Püschel
Automatic Generation of FFT Libraries for GPUs
NVIDIA Research Summit at the GPU Technology Conference, 2012
- Franz Franchetti, Yevgen Voronenko and G. Almasi
Automatic Generation of the HPC Challenges Global FFT Benchmark for BlueGene/P
Proc. High Performance Computing for Computational Science (VECPAR), 2012
- Qiuling Zhu, K. Vaidyanathan, O. Shacham, M. Horowitz, Lawrence Pileggi and Franz Franchetti
Design Automation Framwork for Application-Specific Logic-in-Memory Blocks
Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 125-132, 2012
Autotuning
2021
- N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
An Auto-tuning with Adaptation of A64 Scalable Vector Extension for SPIRAL
Proc. International Workshop on Automatic Performance Tuning (iWAPT), 2021
2011
- Tao Cui and Franz Franchetti
Autotuning a Random Walk Boolean Satisfiability Solver
Proc. International Workshop on Automatic Performance Tuning (iWAPT), 2011
Bandwidth
2018
- Thom Popovici, Tze-Meng Low and Franz Franchetti
Large Bandwidth-Efficient FFTs on Multicore and Multi-Socket Systems
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), IEEE, 2018
2012
- Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
Memory Bandwidth Efficient Two-Dimensional Fast Fourier Transform Algorithm and Implementation for Large Problem Sizes
Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), 2012
Beyond transforms
2018
- Daniele G. Spampinato, Diego Fabregat-Traver, Paolo Bientinesi and Markus Püschel
Program Generation for Small-Scale Linear Algebra Applications
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 327-339, 2018
2014
- Daniele G. Spampinato and Markus Püschel
A Basic Linear Algebra Compiler
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 23-32, 2014
2011
- Christian Berger, Volodymyr Arbatov, Yevgen Voronenko, Franz Franchetti and Markus Püschel
Real-Time Software Implementation of an IEEE 802.11a Baseband Receiver on Intel Multicore
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1693-1696 , 2011
2010
- Frédéric de Mesmay, Srinivas Chellappa, Franz Franchetti and Markus Püschel
Computer Generation of Efficient Software Viterbi Decoders
Proc. International Conference on High Performance Embedded Architectures and Compilers (HiPEAC), Lecture Notes in Computer Science, Springer, Vol. 5952, pp. 353-368, 2010
- Yevgen Voronenko, Volodymyr Arbatov, Christian Berger, Ronghui Peng, Markus Püschel and Franz Franchetti
Computer Generation of Platform-Adapted Physical Layer Software
Proc. Software Defined Radio (SDR), 2010
- Frédéric de Mesmay
On the Computer Generation of Adaptive Numerical Libraries
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
2009
- Daniel McFarlin, Franz Franchetti, José M. F. Moura and Markus Püschel
High Performance Synthetic Aperture Radar Image Formation On Commodity Architectures
Proc. SPIE Conference on Defense, Security, and Sensing, Proceedings of SPIE, Vol. 7337, pp. 733708, 2009
- Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
Operator Language: A Program Generation Framework for Fast Kernels
Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009
2008
- Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
Automatic Generation of Adaptive Libraries for Matrix-Multiplication
Parallel Matrix Algorithms and Applications (PMAA), 2008, Presentation (Abstract reviewed)
- Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
Program Generation with Spiral: Beyond Transforms
Proc. High Performance Extreme Computing (HPEC), 2008
- Jeremy Johnson, Tim Chagnon, Petya Vachranukunkiet, Prawat Nagvajara and Chika Nwankpa
Sparse LU Decomposition using FPGA
Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
Big data
2022
- E. Tang and Franz Franchetti
Magic Memory: A Programming Model For Big Data Analytics
Proc. High Performance Extreme Computing (HPEC), 2022
2016
- Joya Deri, Franz Franchetti and José M. F. Moura
Big Data Computation of Taxi Movement in New York City
Proc. IEEE International Conference on Big Data (Big Data), IEEE, pp. 2616-2625, 2016
Bilateral filtering
2010
- W. Yu, Franz Franchetti, James C. Hoe, Y.-J. Chang and Tsuhan Chen
Fast Bilateral Filtering By Adapting Block Size
Proc. IEEE International Conference on Image Processing (ICIP), 2010
Bioinformatics
2024
- T. Zhang, E. Tang, F. Siddique, K. Skadron and Franz Franchetti
Towards an End-to-End Processing-in-DRAM Acceleration of Spectral Library Search
Proc. High Performance Extreme Computing (HPEC), 2024
BLAS
2025
- N. Zhang and Franz Franchetti
Code Generation for Cryptographic Kernels using Multi-word Modular Arithmetic on GPU
Proc. International Symposium on Code Generation and Optimization (CGO), 2025
BlueGene
2012
- Franz Franchetti, Yevgen Voronenko and G. Almasi
Automatic Generation of the HPC Challenges Global FFT Benchmark for BlueGene/P
Proc. High Performance Computing for Computational Science (VECPAR), 2012
2006
- F. Gygi, E. W. Draeger, M. Schulz, B. R. de Supinski, J. A. Gunnels, V. Austel, J. C. Sexton, Franz Franchetti, Stefan Kral, Christoph W. Ueberhuber and Juergen Lorenz
Large-Scale Electronic Structure Calculations of High-Z Metals on the BlueGene/L Platform
Proc. Supercomputing (SC), 2006
2005
- F. Gygi, E. W. Draeger, B. R. de Supinski, R. K. Yates, Franz Franchetti, Stefan Kral, Juergen Lorenz, Christoph W. Ueberhuber, J. A. Gunnels and J. C. Sexton
Large-Scale First-Principles Molecular Dynamics Simulations on the BlueGene/L Platform Using the Qbox Code
Proc. Supercomputing (SC), 2005
- Juergen Lorenz, Stefan Kral, Franz Franchetti and Christoph W. Ueberhuber
Vectorization Techniques for the BlueGene/L Double FPU
IBM Journal of Research and Development, Vol. 49, No. 2/3, pp. 437-446, 2005
Boolean
2011
- Tao Cui and Franz Franchetti
Autotuning a Random Walk Boolean Satisfiability Solver
Proc. International Workshop on Automatic Performance Tuning (iWAPT), 2011
Bottleneck analysis
2014
- Victoria Caparrós Cabezas and Markus Püschel
Extending the Roofline Model: Bottleneck Analysis with Microarchitectural Constraints
Proc. IEEE International Symposium on Workload Characterization (IISWC), pp. 222-231, 2014
Cell BE Processor
2010
- Srinivas Chellappa
Computer Generation of Fourier Transform Libraries for Distributed Memory Architectures
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
2009
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
Computer Generation of Fast Fourier Transforms for the Cell Broadband Engine
Proc. International Conference on Supercomputing (ICS), pp. 26-35, 2009
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
High Performance Linear Transform Program Generation for the Cell BE
Proc. High Performance Extreme Computing (HPEC), 2009
2008
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
Automatic Linear Transform Program Generation for the Cell BE
Proc. Supercomputing (SC), 2008, Poster (Abstract reviewed)
- Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
Domain-Specific Library Generation for Parallel Software and Hardware Platforms
Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
FFT Program Generation for the Cell BE
Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
Chip generator
2012
- Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
Local Interpolation-based Polar Format SAR: Algorithm, Hardware Implementation and Design Automation
Journal of Signal Processing Systems, 2012
Circuit optimization
2021
- Scott Mionis, Franz Franchetti and J. Larkin
Optimized Quantum Circuit Generation with SPIRAL
Proc. High Performance Extreme Computing (HPEC), 2021
Code generator
2025
- N. Zhang and Franz Franchetti
Code Generation for Cryptographic Kernels using Multi-word Modular Arithmetic on GPU
Proc. International Symposium on Code Generation and Optimization (CGO), 2025
2024
- Sanil Rao, A. Prakash and Franz Franchetti
Towards LibraryX: A Framework for Cross-Library Call Optimization
Proc. High Performance Extreme Computing (HPEC), 2024
2023
- Sanil Rao, M. A. H. Monil, H. Mankad, J. Vetter and Franz Franchetti
FFTX-IRIS: Towards Performance Portability and Heterogeneity for SPIRAL Generated Code
Proc. Supercomputing (SC), 2023
2021
- Scott Mionis, Franz Franchetti and J. Larkin
Optimized Quantum Circuit Generation with SPIRAL
Proc. High Performance Extreme Computing (HPEC), 2021
2018
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFTX and SpectralPack: A First Look
Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018
2013
- Martin Kong, Richard Veras, Kevin Stock, Franz Franchetti, Louis-Noël Pouchet and P. Sadayappan
When Polyhedral Transformations Meet SIMD Code Generation
Proc. Programming Languages Design and Implementation (PLDI), Vol. 48, pp. 127-138, 2013
2010
- Lingchuan Meng, Jeremy Johnson, Franz Franchetti, Yevgen Voronenko, Marc Moreno Maza and Yuzhen Xie
Spiral-Generated Modular FFT Algorithms
Proc. Parallel Symbolic Computation (PASCO), pp. 169-170, 2010
Coding/Decoding
2023
- Sanil Rao, M. A. H. Monil, H. Mankad, J. Vetter and Franz Franchetti
FFTX-IRIS: Towards Performance Portability and Heterogeneity for SPIRAL Generated Code
Proc. Supercomputing (SC), 2023
2020
- Vadim Zaliva
HELIX: From Math to Verified Code
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2020
2018
- Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
SPIRAL: Extreme Performance Portability
Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018
2017
- Tze-Meng Low and Franz Franchetti
High Assurance Code Generation for Cyber-Physical Systems
Proc. IEEE International Symposium on High Assurance Systems Engineering (HASE), 2017
Compiler
2024
- Sanil Rao, A. Prakash and Franz Franchetti
Towards LibraryX: A Framework for Cross-Library Call Optimization
Proc. High Performance Extreme Computing (HPEC), 2024
2022
- Joao Rivera, Franz Franchetti and Markus Püschel
A Compiler for Sound Floating-Point Computations using Affine Arithmetic
Proc. International Symposium on Code Generation and Optimization (CGO), 2022
2021
- Joao Rivera, Franz Franchetti and Markus Püschel
An Interval Compiler for Sound Floating Point Computations
Proc. International Symposium on Code Generation and Optimization (CGO), 2021
- Scott Mionis, Franz Franchetti and J. Larkin
Optimized Quantum Circuit Generation with SPIRAL
Proc. High Performance Extreme Computing (HPEC), 2021
2016
- Daniele G. Spampinato and Markus Püschel
A Basic Linear Algebra Compiler for Structured Matrices
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 117-127, 2016
2013
- Tom Henretty, Richard Veras, Franz Franchetti, Louis-Noël Pouchet, J. Ramanujam and P. Sadayappan
A Stencil Compiler for Short-Vector SIMD Architectures
Proc. ACM International Conference on Supercomputing , pp. 13-24, 2013
- Martin Kong, Richard Veras, Kevin Stock, Franz Franchetti, Louis-Noël Pouchet and P. Sadayappan
When Polyhedral Transformations Meet SIMD Code Generation
Proc. Programming Languages Design and Implementation (PLDI), Vol. 48, pp. 127-138, 2013
Construction of Generators
2017
- Georg Ofenbeck, Tiark Rompf and Markus Püschel
Staging for Generic Programming in Space and Time
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 15-28, 2017
2014
- Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
Language Support for the Construction of High Performance Code Generators
Proc. International Workshop on Adaptive Self-tuning Computing Systems (ADAPT), 2014
2013
- Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
Spiral in Scala: Towards the Systematic Construction of Generators for Performance Libraries
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 125-134, 2013
Control systems
2017
- Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
IEEE Control Systems Magazine, 2017
Convolutions
2024
- Sanil Rao, A. Prakash and Franz Franchetti
Towards LibraryX: A Framework for Cross-Library Call Optimization
Proc. High Performance Extreme Computing (HPEC), 2024
2022
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
A Framework for Low Communication Approaches for Large Scale 3D Convolution
Proc. International Conference on Parallel Processing (ICPP), 2022
2020
- Anuva Kulkarni
An Approach for Large-Scale Three-Dimensional FFT-Based Approximate Convolutions on GPUs
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2020
Coq
2021
- Yannick Zakowski, Calvin Beck, Irene Yoon, Ilia Zaichuk, Vadim Zaliva and Steve Zdancewic
Modular, Compositional, and Executable Formal Semantics for LLVMIR
Proc. International Conference on Functional Programming (ICFP), 2021
2020
- Vadim Zaliva, Ilia Zaichuk and Franz Franchetti
Verified Translation Between Purely Functional and Imperative Domain Specific Languages in HELIX
Proc. Working Conference on Verified Software: Theories, Tools, and Experiments (VSTTE), 2020
2019
- Vadim Zaliva and Matthieu Sozeau
Reification of Shallow-Embedded DSLs in Coq with Automated Verification
Proc. International Workshop on Coq for Programming Languages (CoqPL), 2019
CPUs
2024
- S. Fu, N. Zhang and Franz Franchetti
Accelerating High-Precision Number Theoretic Transforms using Intel AVX-512
Proc. Parallel Architectures and Compilation Techniques (PACT), 2024
- E. Tang, James C. Hoe and Franz Franchetti
Magic Memory: A Memory-Centric Declarative Programming Paradigm to Enable High Productivity on Heterogeneous Systems
Proc. SRC TECHCON, 2024
2020
- Jiyuan Zhang
Accelerating the Motifs of Machine Learning Applications on Modern Processors
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2020
- Jiyuan Zhang, Yi Lu, Daniele G. Spampinato and Franz Franchetti
FESIA: A Fast and Efficient Set Intersection Approach on Modern CPUs
Proc. IEEE International Conference on Data Engineering (ICDE), 2020
- Jiyuan Zhang, Daniele G. Spampinato and Franz Franchetti
FESIA: A Fast and SIMD-Efficient Set Intersection Approach on Modern CPUs
North East Database Day (NEDB), 2020, Poster
- Mark Blanco, S. McMillan and Tze-Meng Low
Towards an Objective Metric for the Performance of Exact Triangle Count
Proc. High Performance Extreme Computing (HPEC), 2020
2019
- Mark Blanco, Tze-Meng Low and Kyungjoo Kim
Exploration of Fine-Grained Parallelism for Load Balancing Eager K-truss on GPU and CPU
Proc. High Performance Extreme Computing (HPEC), 2019
2018
- Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
Algorithm Design for Large Scale FFT-Based Simulations on CPU-GPU Platforms
, 2018
2014
- Tao Cui, R. Yang, Gabriela Hug and Franz Franchetti
Accelerated AC Contingency Calculation on Commodity Multi-core SIMD CPUs
Proc. IEEE Power and Energy Society General Meeting (PES-GM), 2014
2012
- Tao Cui and Franz Franchetti
Optimized Parallel Distribution Load Flow Solver on Commodity Multi-core CPU
Proc. IEEE High Performance Extreme Computing (HPEC), 2012
Cryptography
2025
- N. Zhang and Franz Franchetti
Code Generation for Cryptographic Kernels using Multi-word Modular Arithmetic on GPU
Proc. International Symposium on Code Generation and Optimization (CGO), 2025
2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring-Processing Unit
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
Curved surfaces
2022
- P. Oostema and Franz Franchetti
Approximating Manifolds and Geodesics with Curved Surfaces
Proc. High Performance Extreme Computing (HPEC), 2022
Cyber-physical systems
2017
- Tze-Meng Low and Franz Franchetti
High Assurance Code Generation for Cyber-Physical Systems
Proc. IEEE International Symposium on High Assurance Systems Engineering (HASE), 2017
Data pump architecture (DPA)
2010
- Douglas F. Jones
Data Pump Architecture Simulator and Performance Model
Master thesis, Computer Science, Drexel University, 2010
Data reorganization
2015
- Berkin Akin
A Formal Approach to Memory Access Optimization: Data Layout, Reorganization, and Near-Data Processing
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2015
- Berkin Akin, Franz Franchetti and James C. Hoe
Data Reorganization in Memory Using 3D-stacked DRAM
Proc. International Symposium on Computer Architectur (ISCA), 2015
Design
2019
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
Algorithm Design at Scale: Porting Parallel FFT-based Fortran Simulations to GPUs
SIAM Conference on Computational Science and Engineering (CSE), 2019
- Anuva Kulkarni, Daniele G. Spampinato and Franz Franchetti
Design and Specification of Large-scale Simulations for GPUs using FFTX
Supercomputing, 2019
2018
- Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
Algorithm Design for Large Scale FFT-Based Simulations on CPU-GPU Platforms
, 2018
- Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
Algorithm Design for Large Scale Parallel FFT-Based Simulations on Heterogeneous Platforms
Proc. High Performance Extreme Computing (HPEC), 2018
2015
- H. E. Sumbul, K. Vaidyanathan, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
A Synthesis Methodology for Application-Specific Logic-in-Memory Designs
Proc. Design Automation Conference (DAC), 2015
2014
- K. Vaidyanathan, R. Liu, H. E. Sumbul, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
Efficient and Secure Intellectual Property (IP) Design for Split Fabrication
Proc. IEEE International Symposium on Hardware-Oriented Security and Trust (HOST), 2014
- Berkin Akin, Franz Franchetti and James C. Hoe
Understanding the Design Space of DRAM-optimized Hardware FFT Accelerators
Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 248-255, 2014
2012
- Qiuling Zhu, K. Vaidyanathan, O. Shacham, M. Horowitz, Lawrence Pileggi and Franz Franchetti
Design Automation Framwork for Application-Specific Logic-in-Memory Blocks
Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 125-132, 2012
Dimensionaless FFT
2007
- Pranab Shenoy
Universal FFT Core Generator
Master thesis, Computer Science, Drexel University, 2007
2000
- Pinit Kumhom, Jeremy Johnson and Prawat Nagvajara
Design, optimization, and implementation of a universal FFT processor
Proc. IEEE ASIC/SOC Conference, IEEE, pp. 182-186, 2000
Discrete/fast cosine transforms
2010
- Peter A. Milder
A Mathematical Approach for Compiling and Optimizing Hardware Implementations of DSP Transforms
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
2008
- Markus Püschel and José M. F. Moura
Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for DCTs and DSTs
IEEE Transactions on Signal Processing, Vol. 56, No. 4, pp. 1502-1521, 2008
2003
- Markus Püschel
Cooley-Tukey FFT like Algorithms for the DCT
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 501-504, 2003
- Markus Püschel and José M. F. Moura
The Algebraic Approach to the Discrete Cosine and Sine Transforms and their Fast Algorithms
SIAM Journal of Computing, Vol. 32, No. 5, pp. 1280-1316, 2003
Discrete/fast Fourier transform
2019
- Thom Popovici, Martin Schatz, Franz Franchetti and Tze-Meng Low
A Flexible Framework for Parallel Multi-Dimensional DFTs
arXiv (Technical Report), 2019
2013
- Jörn Schumacher
High Performance Sparse Fast Fourier Transform
Master thesis, Computer Science, ETH Zurich, Switzerland, 2013
2012
- Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
Algorithm and Architecture Optimization for Large Size Two Dimensional Discrete Fourier Transform
ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA), 2012
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Computer Generation of Hardware for Linear Digital Signal Processing Transforms
ACM Transactions on Design Automation of Electronic Systems, Vol. 17, No. 2, 2012
2011
- Daniel McFarlin, Volodymyr Arbatov, Franz Franchetti and Markus Püschel
Automatic SIMD Vectorization of Fast Fourier Transforms for the Larrabee and AVX Instruction Sets
Proc. International Conference on Supercomputing (ICS), 2011
- Franz Franchetti and Markus Püschel
Fast Fourier Transform
in Encyclopedia of Parallel Computing, Eds. David Padua, Springer 2011
- Markus Püschel, Franz Franchetti and Yevgen Voronenko
Spiral
in Encyclopedia of Parallel Computing, Eds. David Padua, pp. 1920-1933, Springer 2011
2010
- Peter A. Milder
A Mathematical Approach for Compiling and Optimizing Hardware Implementations of DSP Transforms
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
- Srinivas Chellappa
Computer Generation of Fourier Transform Libraries for Distributed Memory Architectures
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
- C. Angelopoulos, Franz Franchetti and Markus Püschel
DFT Transform on the Fermi (GTX480): Automatic Program Generation
NVIDIA Research Summit at the GPU Technology Conference, 2010
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Hardware Implementation of the Discrete Fourier Transform with Non-Power-of-Two Problem Size
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2010
- Frédéric de Mesmay
On the Computer Generation of Adaptive Numerical Libraries
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Real-Time Digital Signal Processing for the Generation of Optical Orthogonal Frequency Division Multiplexed Signals
IEEE Journal of Selected Topics in Quantum Electronics, Vol. 16, No. 5, pp. 1235-1244 , 2010
2009
- Yevgen Voronenko and Markus Püschel
Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for Real DFTs
IEEE Transactions on Signal Processing, Vol. 57, No. 1, pp. 205-222, 2009
- Basilio B. Fraguela, Yevgen Voronenko and Markus Püschel
Automatic Tuning of Discrete Fourier Transforms Driven by Analytical Modeling
Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 271-280, 2009
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
Computer Generation of Fast Fourier Transforms for the Cell Broadband Engine
Proc. International Conference on Supercomputing (ICS), pp. 26-35, 2009
- Yevgen Voronenko, Frédéric de Mesmay and Markus Püschel
Computer Generation of General Size Linear Transform Libraries
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 102-113, 2009
- Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
- Franz Franchetti and Markus Püschel
Generating High-Performance Pruned FFT Implementations
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 549-552, 2009
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
High Performance Linear Transform Program Generation for the Cell BE
Proc. High Performance Extreme Computing (HPEC), 2009
2008
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
Automatic Linear Transform Program Generation for the Cell BE
Proc. Supercomputing (SC), 2008, Poster (Abstract reviewed)
- Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
Domain-Specific Library Generation for Parallel Software and Hardware Platforms
Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
FFT Program Generation for the Cell BE
Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Formal Datapath Representation and Manipulation for Implementing DSP Transforms
Proc. Design Automation Conference (DAC), pp. 385-390, 2008
- Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
Generating High-Performance General Size Linear Transform Libraries Using Spiral
Proc. High Performance Extreme Computing (HPEC), 2008
- Franz Franchetti and Markus Püschel
Generating SIMD Vectorized Permutations
Proc. International Conference on Compiler Construction (CC), Lecture Notes in Computer Science, Springer, Vol. 4959, pp. 116-131, 2008
- Yevgen Voronenko
Library Generation for Linear Transforms
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2008
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Linear Transforms: From Math to Efficient Hardware
Proc. Workshop on High-Level Synthesis colocated with DAC, 2008
- Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
System Demonstration of Spiral: Generator for High-Performance Linear Transform Libraries
Proc. Algebraic Methodology and Software Technology (AMAST), 2008
2007
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Discrete Fourier Transform Compiler: From Mathematical Representation to Efficient Hardware
CSSI Technical Report #CSSI-07-01, Carnegie Mellon University, 2007
- Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
Generating FPGA Accelerated DFT Libraries
Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007
- Jeremy Johnson and Xu Xu
Generating Symmetric DFTs and Equivariant FFT Algorithms
Proc. ACM International Symposium on Symbolic and Algebraic Computation (ISSAC), ACM, pp. 195-202, 2007
- Franz Franchetti and Markus Püschel
SIMD Vectorization of Non-Two-Power Sized FFTs
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. II-17-II-20, 2007
- Pranab Shenoy
Universal FFT Core Generator
Master thesis, Computer Science, Drexel University, 2007
2006
- Andreas Bonelli, Franz Franchetti, Juergen Lorenz, Markus Püschel and Christoph W. Ueberhuber
Automatic Performance Optimization of the Discrete Fourier Transform on Distributed Memory Computers
Proc. International Symposium on Parallel and Distributed Processing and Application (ISPA), Lecture Notes In Computer Science, Springer, Vol. 4330, pp. 818-832, 2006
- Paolo D'Alberto, Peter A. Milder, Franz Franchetti, James C. Hoe, Markus Püschel and José M. F. Moura
Discrete Fourier Transform Compiler for FPGA and CPU/FPGA Partitioned Implementations
Proc. High Performance Extreme Computing (HPEC), 2006
- Peter A. Milder, Mohammad Ahmad, James C. Hoe and Markus Püschel
Fast and Accurate Resource Estimation of Automatically Generated Custom DFT IP Cores
Proc. FPGA, pp. 211-220, 2006
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
FFT Program Generation for Shared Memory: SMP and Multicore
Proc. Supercomputing (SC), 2006
2005
- Grace Nordin, Peter A. Milder, James C. Hoe and Markus Püschel
Automatic Generation of Customized Discrete Fourier Transform IPs
Proc. Design Automation Conference (DAC), pp. 471-474, 2005
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
Formal Loop Merging for Signal Transforms
Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326 , 2005
2004
- Franz Franchetti, Stefan Kral, Juergen Lorenz, Markus Püschel, Christoph W. Ueberhuber and Peter Wurzinger
Automatically Tuned FFTs for BlueGene/L’s Double FPU
Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 3402, pp. 23-36, 2004
- Grace Nordin, James C. Hoe and Markus Püschel
Discrete Fourier Transform IP Generator
Proc. High Performance Extreme Computing (HPEC), 2004
- Nicholas Rizzolo and David Padua
HiLO: High Level Optimization of FFTs
Proc. International Workshop on Languages and Compilers for Parallel Computing (LCPC), 2004
2003
- Xu Xu
A Recursive Implementation of the Dimensionless FFT
Master thesis, Computer Science, Drexel University, 2003
- Jeremy Johnson and Xu Xu
A Recursive Implementation of the Dimensionless FFT
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2003
- Franz Franchetti and Markus Püschel
Short Vector Code Generation for the Discrete Fourier Transform
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2003
2002
- Mike Balog
A Flexible Framework for Implementing FFT Processors
Master thesis, Electrical and Computer Engineering, Drexel University, 2002
2001
- Pinit Kumhom
Design, Optimization, and Implementation of a Universal FFT Processor
PhD. thesis, Electrical and Computer Engineering, Drexel University, 2001, Also Tech. Report DU-MCS-01-01, Drexel University, 2001
2000
- Gavin Haentjens
An Investigation of Recursive FFT Implementations
Master thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2000
- Neungsoo Park, Viktor K. Prasanna, Kiran Bondalapati and Dongsoo Kang
Dynamic Data Layouts for Cache-conscious Factorization of DFT
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 693-701, 2000
- David Sepiashvili
Performance Models and Search Methods for Optimal FFT Implementations
Master thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2000
Distributed memory
2010
- Srinivas Chellappa
Computer Generation of Fourier Transform Libraries for Distributed Memory Architectures
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
2009
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
High Performance Linear Transform Program Generation for the Cell BE
Proc. High Performance Extreme Computing (HPEC), 2009
Distributed, parallel, and cluster computing
2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
Domain-specific
2020
- Vadim Zaliva, Ilia Zaichuk and Franz Franchetti
Verified Translation Between Purely Functional and Imperative Domain Specific Languages in HELIX
Proc. Working Conference on Verified Software: Theories, Tools, and Experiments (VSTTE), 2020
DRAM
2013
- Qiuling Zhu, Berkin Akin, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
A 3D-Stacked Logic-in-Memory Accelerator for Application-Specific Data Intensive Computing
Proc. IEEE International 3D Systems Integration Conference (3DIC), pp. 1-7, 2013
2012
- Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
Memory Bandwidth Efficient Two-Dimensional Fast Fourier Transform Algorithm and Implementation for Large Problem Sizes
Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), 2012
Drug metabolism
2019
- Yoko Franchetti, Thomas Nolin and Franz Franchetti
Towards Precision Medicine: Simulation Based Parameter Estimation for Drug Metabolism
SIAM Conference on Computational Science and Engineering (CSE), 2019
Eager K-truss
2019
- Mark Blanco, Tze-Meng Low and Kyungjoo Kim
Exploration of Fine-Grained Parallelism for Load Balancing Eager K-truss on GPU and CPU
Proc. High Performance Extreme Computing (HPEC), 2019
Edge-centric
2018
- Tze-Meng Low, Daniele G. Spampinato, A. Kutuluru, U. Sridhar, Thom Popovici, Franz Franchetti and S. McMillan
Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency Matrices
Proc. High Performance Extreme Computing (HPEC), 2018
Electric power systems
2013
- Q. Li, Tao Cui, Yang Weng, Rohit Negi, Franz Franchetti and Marija Ilic
An Information-Theoretic Approach to PMU Placement in Electric Power Systems
IEEE Transactions on Smart Grid, Vol. 4, No. 1, pp. 446-456, 2013
- Tao Cui and Franz Franchetti
A Software Performance Engineering Aproach to Fast Transmission Probabilistic Load Flow
IEEE PES General Meeting, 2013
- Tao Cui and Franz Franchetti
Power System Probabilistic and Security Analysis on Commodity High Performance Computing Systems
Proc. High Performance Computing, Networking and Analytics for the Power Grid (HiPCNA-PG), 2013
2012
- Tao Cui and Franz Franchetti
A Multi-Core High Performance Computing Framework for Probabilistic Solutions of Distribution Systems
Proc. IEEE Power and Energy Society General Meeting (PES-GM), pp. 1-6, 2012
2011
- Tao Cui and Franz Franchetti
A Monte Carlo Framework for Probabilistic Distribution Power Flow
Carnegie Mellon Conference on the Electricity Industry, 2011
- Tao Cui and Franz Franchetti
A Multi-core High Performance Computing Framework for Distribution Power Flow
Proc. North American Power Symposium (NAPS), 2011
Electric vehicles
2010
- Q. Li, Tao Cui, Rohit Negi, Franz Franchetti and Marija Ilic
On-line Decentralized Charging of Plug-In Electric Vehicles in Power Systems
, 2010
Embedded DSLs
2019
- Vadim Zaliva and Matthieu Sozeau
Reification of Shallow-Embedded DSLs in Coq with Automated Verification
Proc. International Workshop on Coq for Programming Languages (CoqPL), 2019
Embedded Processors
2014
- Nikolaos Kyrtatas
A Basic Linear Algebra Compiler for Embedded Processors
Master thesis, Computer Science, ETH Zurich, Switzerland, 2014
Embedding
2022
- P. Oostema and Franz Franchetti
On Manifold Graph Embedding
Proc. High Performance Extreme Computing (HPEC), 2022
Energy-efficient
2015
- Qi Guo, Tze-Meng Low, N. Alachiotis, Berkin Akin, Lawrence Pileggi, James C. Hoe and Franz Franchetti
Enabling Portable Energy Efficiency with Memory Accelerated Library
Proc. MICRO, 2015
- M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
Computer, Vol. 48, No. 12, pp. 24-33, 2015
2012
- Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
Polar Format Synthetic Aperture Radar in Energy Efficient Application-Specific Logic-in-Memory
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1557-1560, 2012
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
Smart Memory Synthesis for Energy-Efficient Computed Tomography Reconstruction
Proc. SRC TECHCON, 2012
Evolving surfaces
2012
- W. Yu, Franz Franchetti, James C. Hoe and Tsuhan Chen
Highly Efficient Performance Portable Tracking of Evolving Surfaces
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 296-307, 2012
2011
- W. Yu, Franz Franchetti, James C. Hoe, José M. F. Moura and Tsuhan Chen
Performance Portable Tracking of Evolving Surfaces
Proc. High Performance Extreme Computing (HPEC), 2011
2010
- W. Yu
Performance Portable Tracking of Evolving Surfaces
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
Exascale
2020
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solver Libraries for Exascale: FFTX and SpectralPack
Exascale Computing Project (ECP) Annual Meeting, 2020, Poster
2019
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solvers for Exascale: FFTX and SpectralPACK
Exascale Computing Project (ECP) Annual Meeting, 2019
2018
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFTX and SpectralPack: A First Look
Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018
Fast algorithms
2013
- Jörn Schumacher
High Performance Sparse Fast Fourier Transform
Master thesis, Computer Science, ETH Zurich, Switzerland, 2013
2010
- Srinivas Chellappa
Computer Generation of Fourier Transform Libraries for Distributed Memory Architectures
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
2009
- Yevgen Voronenko and Markus Püschel
Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for Real DFTs
IEEE Transactions on Signal Processing, Vol. 57, No. 1, pp. 205-222, 2009
- Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
- Franz Franchetti and Markus Püschel
Generating High-Performance Pruned FFT Implementations
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 549-552, 2009
2008
- Markus Püschel and José M. F. Moura
Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for DCTs and DSTs
IEEE Transactions on Signal Processing, Vol. 56, No. 4, pp. 1502-1521, 2008
2006
- Yevgen Voronenko and Markus Püschel
Algebraic Derivation of General Radix Cooley-Tukey Algorithms for the Real Discrete Fourier Transform
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 3, 2006
2003
- Markus Püschel
Cooley-Tukey FFT like Algorithms for the DCT
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 501-504, 2003
- Markus Püschel and José M. F. Moura
The Algebraic Approach to the Discrete Cosine and Sine Transforms and their Fast Algorithms
SIAM Journal of Computing, Vol. 32, No. 5, pp. 1280-1316, 2003
Fast Fourier Transform
2025
- N. Zhang, Sanil Rao, M. Franusich and Franz Franchetti
Towards Semantics Lifting for Scientific Computing: A Case Study on FFT
Proc. ACM SIGPLAN Symposium on Principles of Programming Languages (POPL), 2025
2023
- B. Van Straalen, P. Colella, Franz Franchetti and P. Broderick
FFTX: Next-Generation Open-Source Software for Fast Fourier Transforms
to appear in Proc. Exascale Computing Project Annual Meeting (ECPAM), 2023
2022
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
A Framework for Low Communication Approaches for Large Scale 3D Convolution
Proc. International Conference on Parallel Processing (ICPP), 2022
- L. Tang, S. Chen, K. Harisrikanth, G. Xu, K. Mai and Franz Franchetti
A High Throughput Hardware Accelerator for FFTW Codelets: A First Look
Proc. High Performance Extreme Computing (HPEC), 2022
- Z. Gong, N. Zhu, M. Ngaw, Joao Rivera, L. Tang, E. Tang, H. Mankad and Franz Franchetti
Interval Arithmetic-based FFT for Large Integer Multiplication
Proc. High Performance Extreme Computing (HPEC), 2022
- J. Nguyen, M. Cai, Z. Zuo, L. Tang, K. Mai and Franz Franchetti
LIMA: Hardware for FFT based Large Integer Multiplication
Proc. High Performance Extreme Computing (HPEC), 2022
2021
- Scott Mionis, Franz Franchetti and J. Larkin
Optimized Quantum Circuit Generation with SPIRAL
Proc. High Performance Extreme Computing (HPEC), 2021
2020
- Anuva Kulkarni
An Approach for Large-Scale Three-Dimensional FFT-Based Approximate Convolutions on GPUs
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2020
2018
- Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
Algorithm Design for Large Scale FFT-Based Simulations on CPU-GPU Platforms
, 2018
- Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
Algorithm Design for Large Scale Parallel FFT-Based Simulations on Heterogeneous Platforms
Proc. High Performance Extreme Computing (HPEC), 2018
- Thom Popovici
An Approach to Specifying and Automatically Optimizing Fourier Transform Based Operations
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2018
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFTX and SpectralPack: A First Look
Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018
- Thom Popovici, Tze-Meng Low and Franz Franchetti
Large Bandwidth-Efficient FFTs on Multicore and Multi-Socket Systems
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), IEEE, 2018
2017
- G. Xu, Tze-Meng Low, James C. Hoe and Franz Franchetti
Optimizing FFT Resource Efficiency of FPGA using High-Level Synthesis
High Performance Extreme Computing Conference (HPEC), 2017
2016
- Francois Serre and Markus Püschel
Optimal Circuits for Streamed Linear Permutations using RAM
Proc. FPGA, pp. 215-223, 2016
2015
- Berkin Akin, Franz Franchetti and James C. Hoe
FFTs with Near-Optimal Memory Access Through Block Data Layouts: Algorithm, Architecture and Design Automation
Journal of Signal Processing Systems, 2015
- Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
Proc. Workshop on Compilers for Parallel Computing (CPC), 2015
2014
- T. Ozturk, Thom Popovici, C. Stein, R. Pokharel, Franz Franchetti and Anthony Rollett
Fast Fourier Transform Based Mechanical Behavior Formulation: Optimized Implementation and Sensitivity Analysis of the Method
Proc. Materials Science & Technology, 2014
- Berkin Akin, Franz Franchetti and James C. Hoe
FFTs with Near-Optimal Memory Access Through Block Data Layouts
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014
- Jörn Schumacher and Markus Püschel
High-performance sparse fast Fourier transforms
Proc. IEEE Workshop on Signal Processing Systems (SIPS), pp. 1-6, 2014
2013
- Lingchuan Meng and Jeremy Johnson
Automatic Parallel Library Generation for General-Size Modular FFT Algorithms
Proc. International Workshop on Computer Algebra in Scientific Computing, Springer, pp. 243-256, 2013
2012
- C. Angelopoulos, Franz Franchetti and Markus Püschel
Automatic Generation of FFT Libraries for GPUs
NVIDIA Research Summit at the GPU Technology Conference, 2012
- Franz Franchetti, Yevgen Voronenko and G. Almasi
Automatic Generation of the HPC Challenges Global FFT Benchmark for BlueGene/P
Proc. High Performance Computing for Computational Science (VECPAR), 2012
- Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
Memory Bandwidth Efficient Two-Dimensional Fast Fourier Transform Algorithm and Implementation for Large Problem Sizes
Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), 2012
2010
- Lingchuan Meng, Jeremy Johnson, Franz Franchetti, Yevgen Voronenko, Marc Moreno Maza and Yuzhen Xie
Spiral-Generated Modular FFT Algorithms
Proc. Parallel Symbolic Computation (PASCO), pp. 169-170, 2010
2003
- Franz Franchetti
A Portable Short Vector Version of FFTW
Proc. IMACS Symposium on Mathematical Modelling (MATHMOD), Vol. 2, pp. 1539-1548, 2003
2002
- Franz Franchetti, F. Kaltenberger and Christoph W. Ueberhuber
FFT Kernels with FMA Utilization
Proc. APLIMAT Conference, pp. 333-339, 2002
2001
- Franz Franchetti, H. Karner, Stefan Kral and Christoph W. Ueberhuber
Architecture Independent Short Vector FFTs
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 1109-1112, 2001
2000
- Franz Franchetti
Short Vector FFTs
Master thesis, Vienna University of Technology, 2000
FFT
2025
- N. Zhang, Sanil Rao, M. Franusich and Franz Franchetti
Towards Semantics Lifting for Scientific Computing: A Case Study on FFT
Proc. ACM SIGPLAN Symposium on Principles of Programming Languages (POPL), 2025
2024
- Sanil Rao, A. Prakash and Franz Franchetti
Towards LibraryX: A Framework for Cross-Library Call Optimization
Proc. High Performance Extreme Computing (HPEC), 2024
2022
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
A Framework for Low Communication Approaches for Large Scale 3D Convolution
Proc. International Conference on Parallel Processing (ICPP), 2022
- L. Tang, S. Chen, K. Harisrikanth, G. Xu, K. Mai and Franz Franchetti
A High Throughput Hardware Accelerator for FFTW Codelets: A First Look
Proc. High Performance Extreme Computing (HPEC), 2022
- Z. Gong, N. Zhu, M. Ngaw, Joao Rivera, L. Tang, E. Tang, H. Mankad and Franz Franchetti
Interval Arithmetic-based FFT for Large Integer Multiplication
Proc. High Performance Extreme Computing (HPEC), 2022
- J. Nguyen, M. Cai, Z. Zuo, L. Tang, K. Mai and Franz Franchetti
LIMA: Hardware for FFT based Large Integer Multiplication
Proc. High Performance Extreme Computing (HPEC), 2022
2020
- Anuva Kulkarni
An Approach for Large-Scale Three-Dimensional FFT-Based Approximate Convolutions on GPUs
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2020
2019
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
Algorithm Design at Scale: Porting Parallel FFT-based Fortran Simulations to GPUs
SIAM Conference on Computational Science and Engineering (CSE), 2019
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solvers for Exascale: FFTX and SpectralPACK
Exascale Computing Project (ECP) Annual Meeting, 2019
2018
- Thom Popovici
An Approach to Specifying and Automatically Optimizing Fourier Transform Based Operations
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2018
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFTX and SpectralPack: A First Look
Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018
2013
- Lingchuan Meng and Jeremy Johnson
Automatic Parallel Library Generation for General-Size Modular FFT Algorithms
Proc. International Workshop on Computer Algebra in Scientific Computing, Springer, pp. 243-256, 2013
2012
- Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
Memory Bandwidth Efficient Two-Dimensional Fast Fourier Transform Algorithm and Implementation for Large Problem Sizes
Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), 2012
FFTE
2020
- Daisuke Takahashi and Franz Franchetti
FFTE on SVE: SPIRAL-Generated Kernels
Proc. International Conference on High Performance Computing in Asia-Pacific Region (HPCAsia), pp. 114-122, 2020
FFTW
2022
- L. Tang, S. Chen, K. Harisrikanth, G. Xu, K. Mai and Franz Franchetti
A High Throughput Hardware Accelerator for FFTW Codelets: A First Look
Proc. High Performance Extreme Computing (HPEC), 2022
FFTX
2023
- Sanil Rao, M. A. H. Monil, H. Mankad, J. Vetter and Franz Franchetti
FFTX-IRIS: Towards Performance Portability and Heterogeneity for SPIRAL Generated Code
Proc. Supercomputing (SC), 2023
- B. Van Straalen, P. Colella, Franz Franchetti and P. Broderick
FFTX: Next-Generation Open-Source Software for Fast Fourier Transforms
to appear in Proc. Exascale Computing Project Annual Meeting (ECPAM), 2023
2020
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solver Libraries for Exascale: FFTX and SpectralPack
Exascale Computing Project (ECP) Annual Meeting, 2020, Poster
2019
- Anuva Kulkarni, Daniele G. Spampinato and Franz Franchetti
Design and Specification of Large-scale Simulations for GPUs using FFTX
Supercomputing, 2019
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solvers for Exascale: FFTX and SpectralPACK
Exascale Computing Project (ECP) Annual Meeting, 2019
- Anuva Kulkarni, Daniele G. Spampinato and Franz Franchetti
FFTX for Micromechanical Stress-Strain Analysis
IEEE High Performance Extreme Computing Conference (HPEC), 2019
FHE Hardward
2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
Filtering/Convolution/Wavelet transforms
2008
- Yevgen Voronenko
Library Generation for Linear Transforms
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2008
2004
- Aca Gacic, Markus Püschel and José M. F. Moura
Automatically Generated High-Performance Code for Discrete Wavelet Transforms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-69-V-72, 2004
- Anthony F. Breitzman and Jeremy Johnson
Automatic Derivation and Implementation of Fast Convolution Algorithms
Journal of High Performance Computing and Applications, special issue on ``Computer Algebra and Signal Processing'', Vol. 37, No. 2, pp. 157-186, 2004
- Aca Gacic
Automatic Implementation and Platform Adaptation of Discrete Filtering and Wavelet Algorithms
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2004
2003
- Aca Gacic, Markus Püschel and José M. F. Moura
Fast Automatic Implementations of FIR Filters
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 541-544, 2003
Floating-point arithmetic
2022
- Joao Rivera, Franz Franchetti and Markus Püschel
A Compiler for Sound Floating-Point Computations using Affine Arithmetic
Proc. International Symposium on Code Generation and Optimization (CGO), 2022
2021
- Joao Rivera, Franz Franchetti and Markus Püschel
An Interval Compiler for Sound Floating Point Computations
Proc. International Symposium on Code Generation and Optimization (CGO), 2021
FPGA
2024
- E. Tang, James C. Hoe and Franz Franchetti
Magic Memory: A Memory-Centric Declarative Programming Paradigm to Enable High Productivity on Heterogeneous Systems
Proc. SRC TECHCON, 2024
2017
- G. Xu, Tze-Meng Low, James C. Hoe and Franz Franchetti
Optimizing FFT Resource Efficiency of FPGA using High-Level Synthesis
High Performance Extreme Computing Conference (HPEC), 2017
2012
- Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
Algorithm and Architecture Optimization for Large Size Two Dimensional Discrete Fourier Transform
ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA), 2012
- Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
Memory Bandwidth Efficient Two-Dimensional Fast Fourier Transform Algorithm and Implementation for Large Problem Sizes
Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), 2012
Future quantum frameworks
2020
- Scott Mionis, Franz Franchetti and J. Larkin
Quantum Circuit Optimization with SPIRAL: A First Look
Proc. Supercomputing (SC), 2020
GBTLX
2020
- Sanil Rao, A. Kutuluru, Paul Brouwer, S. McMillan and Franz Franchetti
GBTLX: A First Look
Proc. IEEE High Performance Extreme Computing (HPEC), 2020
General size libraries
2014
- B. Duff, J. Larkin, M. Franusich and Franz Franchetti
Automatic Generation of 3-D FFTs
submitted for publication
2012
- C. Angelopoulos, Franz Franchetti and Markus Püschel
Automatic Generation of FFT Libraries for GPUs
NVIDIA Research Summit at the GPU Technology Conference, 2012
2011
- Markus Püschel, Franz Franchetti and Yevgen Voronenko
Spiral
in Encyclopedia of Parallel Computing, Eds. David Padua, pp. 1920-1933, Springer 2011
2010
- Frédéric de Mesmay, Yevgen Voronenko and Markus Püschel
Offline Library Adaptation Using Automatically Generated Heuristics
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 1-10, 2010
- Frédéric de Mesmay
On the Computer Generation of Adaptive Numerical Libraries
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
2009
- Frédéric de Mesmay, Arpad Rimmel, Yevgen Voronenko and Markus Püschel
Bandit-Based Optimization on Graphs with Application to Library Performance Tuning
Proc. International Conference on Machine Learning (ICML), pp. 729-736, 2009
- Yevgen Voronenko, Frédéric de Mesmay and Markus Püschel
Computer Generation of General Size Linear Transform Libraries
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 102-113, 2009
2008
- Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
Generating High-Performance General Size Linear Transform Libraries Using Spiral
Proc. High Performance Extreme Computing (HPEC), 2008
- Yevgen Voronenko
Library Generation for Linear Transforms
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2008
- Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
System Demonstration of Spiral: Generator for High-Performance Linear Transform Libraries
Proc. Algebraic Methodology and Software Technology (AMAST), 2008
Geodesics
2022
- P. Oostema and Franz Franchetti
Approximating Manifolds and Geodesics with Curved Surfaces
Proc. High Performance Extreme Computing (HPEC), 2022
GPUs
2024
- E. Tang, James C. Hoe and Franz Franchetti
Magic Memory: A Memory-Centric Declarative Programming Paradigm to Enable High Productivity on Heterogeneous Systems
Proc. SRC TECHCON, 2024
2022
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
A Framework for Low Communication Approaches for Large Scale 3D Convolution
Proc. International Conference on Parallel Processing (ICPP), 2022
2020
- Jiyuan Zhang
Accelerating the Motifs of Machine Learning Applications on Modern Processors
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2020
- Anuva Kulkarni
An Approach for Large-Scale Three-Dimensional FFT-Based Approximate Convolutions on GPUs
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2020
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
Massive Scaling of MASSIF: Algorithm Development and Analysis for Simulation on GPUs
Proc. Platform for Advanced Scientific Computing (PASC), Article 13, pp. 1 - 10, 2020
2019
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
Algorithm Design at Scale: Porting Parallel FFT-based Fortran Simulations to GPUs
SIAM Conference on Computational Science and Engineering (CSE), 2019
- Anuva Kulkarni, Daniele G. Spampinato and Franz Franchetti
Design and Specification of Large-scale Simulations for GPUs using FFTX
Supercomputing, 2019
- Mark Blanco, Tze-Meng Low and Kyungjoo Kim
Exploration of Fine-Grained Parallelism for Load Balancing Eager K-truss on GPU and CPU
Proc. High Performance Extreme Computing (HPEC), 2019
2018
- Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
Algorithm Design for Large Scale FFT-Based Simulations on CPU-GPU Platforms
, 2018
- Vit Ruzicka and Franz Franchetti
Fast and Accurate Object Detection in High Resolution 4K and 8K Video Using GPUs
Proc. IEEE High Performance Extreme Computing (HPEC), 2018
2012
- C. Angelopoulos, Franz Franchetti and Markus Püschel
Automatic Generation of FFT Libraries for GPUs
NVIDIA Research Summit at the GPU Technology Conference, 2012
2010
- C. Angelopoulos, Franz Franchetti and Markus Püschel
DFT Transform on the Fermi (GTX480): Automatic Program Generation
NVIDIA Research Summit at the GPU Technology Conference, 2010
Graph-algorithms
2021
- Mark Blanco, S. McMillan and Tze-Meng Low
Delayed Asynchronous Iterative Graph Algorithms
Proc. High Performance Extreme Computing (HPEC), 2021
- P. Oostema and Franz Franchetti
Leveraging High Dimensional Spatial Graph Embedding as a Heuristic for Graph Algorithms
Proc. IEEE Workshop on Parallel / Distributed Combinatorics and Optimization (PDCO), 2021
2020
- Mark Blanco, S. McMillan and Tze-Meng Low
Towards an Objective Metric for the Performance of Exact Triangle Count
Proc. High Performance Extreme Computing (HPEC), 2020
2019
- Mark Blanco, Tze-Meng Low and Kyungjoo Kim
Exploration of Fine-Grained Parallelism for Load Balancing Eager K-truss on GPU and CPU
Proc. High Performance Extreme Computing (HPEC), 2019
2018
- Tze-Meng Low, Daniele G. Spampinato, A. Kutuluru, U. Sridhar, Thom Popovici, Franz Franchetti and S. McMillan
Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency Matrices
Proc. High Performance Extreme Computing (HPEC), 2018
GraphBLAS
2020
- Sanil Rao, A. Kutuluru, Paul Brouwer, S. McMillan and Franz Franchetti
GBTLX: A First Look
Proc. IEEE High Performance Extreme Computing (HPEC), 2020
2017
- Richard Veras and Franz Franchetti
A Scale-free Structure for Real World Networks
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017
2016
- J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
Mathematical Foundations of the GraphBLAS
Proc. High Performance Extreme Computing (HPEC), 2016
Graphs
2022
- P. Oostema and Franz Franchetti
On Manifold Graph Embedding
Proc. High Performance Extreme Computing (HPEC), 2022
2021
- Mark Blanco, S. McMillan and Tze-Meng Low
Delayed Asynchronous Iterative Graph Algorithms
Proc. High Performance Extreme Computing (HPEC), 2021
2018
- Tze-Meng Low, Daniele G. Spampinato, A. Kutuluru, U. Sridhar, Thom Popovici, Franz Franchetti and S. McMillan
Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency Matrices
Proc. High Performance Extreme Computing (HPEC), 2018
- F. Sadi, Joe Sweeney, S. McMillan, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
PageRank Acceleration for Large Graphs with Scalable Hardware and Two-Step SpMV
Proc. High Performance Extreme Computing (HPEC), 2018
2017
- Richard Veras
The Automatic Generation of High-Performance Graph Analytic Code using Spiral
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2017
Greedy algorithm
2013
- Q. Li, Tao Cui, Yang Weng, Rohit Negi, Franz Franchetti and Marija Ilic
An Information-Theoretic Approach to PMU Placement in Electric Power Systems
IEEE Transactions on Smart Grid, Vol. 4, No. 1, pp. 446-456, 2013
Hardware
2024
- L. Tang, S. Chen, K. Harisrikanth, G. Xu, Franz Franchetti and K. Mai
A 1.19GHz 9.52Gsamples/sec Radix-8 FFT Hardware Accelerator in 28nm
Proc. Hot Chips Symposium (HCS), 2024
2022
- J. Nguyen, M. Cai, Z. Zuo, L. Tang, K. Mai and Franz Franchetti
LIMA: Hardware for FFT based Large Integer Multiplication
Proc. High Performance Extreme Computing (HPEC), 2022
2018
- F. Sadi, Joe Sweeney, S. McMillan, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
PageRank Acceleration for Large Graphs with Scalable Hardware and Two-Step SpMV
Proc. High Performance Extreme Computing (HPEC), 2018
2017
- F. Sadi, Lawrence Pileggi and Franz Franchetti
Algorithm and Hardware Co-Optimized Solution for Large SpMV Problems
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017
2016
- Francois Serre and Markus Püschel
Optimal Circuits for Streamed Linear Permutations using RAM
Proc. FPGA, pp. 215-223, 2016
2014
- Berkin Akin, James C. Hoe and Franz Franchetti
HAMLeT: Hardware Accelerated Memory Layout Transform within 3D-stacked DRAM
Proc. IEEE High Performance Extreme Computing (HPEC), 2014
- Berkin Akin, Franz Franchetti and James C. Hoe
Understanding the Design Space of DRAM-optimized Hardware FFT Accelerators
Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 248-255, 2014
2013
- Qiuling Zhu, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Accelerating Sparse Matrix-Matrix Multiplication with 3D-Stacked Logic-in-Memory Hardware
Proc. High Performance Extreme Computing (HPEC), pp. 1-6, 2013
2012
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
Cost-Effective Smart Memory Implementation for Parallel Backprojection in Computed Tomography
Proc. IFIP/IEEE Internationa Conference on Very Large Scale Integration, pp. 111-116, 2012
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
Smart Memory Synthesis for Energy-Efficient Computed Tomography Reconstruction
Proc. SRC TECHCON, 2012
Hardware accelerator
2024
- L. Tang, S. Chen, K. Harisrikanth, G. Xu, Franz Franchetti and K. Mai
A 1.19GHz 9.52Gsamples/sec Radix-8 FFT Hardware Accelerator in 28nm
Proc. Hot Chips Symposium (HCS), 2024
2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
Hardware architecture
2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
HELIX
2021
- Yannick Zakowski, Calvin Beck, Irene Yoon, Ilia Zaichuk, Vadim Zaliva and Steve Zdancewic
Modular, Compositional, and Executable Formal Semantics for LLVMIR
Proc. International Conference on Functional Programming (ICFP), 2021
2020
- Vadim Zaliva
HELIX: From Math to Verified Code
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2020
- Vadim Zaliva, Ilia Zaichuk and Franz Franchetti
Verified Translation Between Purely Functional and Imperative Domain Specific Languages in HELIX
Proc. Working Conference on Verified Software: Theories, Tools, and Experiments (VSTTE), 2020
High assurance
2017
- Tze-Meng Low and Franz Franchetti
High Assurance Code Generation for Cyber-Physical Systems
Proc. IEEE International Symposium on High Assurance Systems Engineering (HASE), 2017
- Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
IEEE Control Systems Magazine, 2017
High performance
2023
- Sanil Rao, M. A. H. Monil, H. Mankad, J. Vetter and Franz Franchetti
FFTX-IRIS: Towards Performance Portability and Heterogeneity for SPIRAL Generated Code
Proc. Supercomputing (SC), 2023
- N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring-Processing Unit
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
2020
- Mark Blanco, S. McMillan and Tze-Meng Low
Towards an Objective Metric for the Performance of Exact Triangle Count
Proc. High Performance Extreme Computing (HPEC), 2020
2019
- Mark Blanco, Tze-Meng Low and Kyungjoo Kim
Exploration of Fine-Grained Parallelism for Load Balancing Eager K-truss on GPU and CPU
Proc. High Performance Extreme Computing (HPEC), 2019
2018
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFTX and SpectralPack: A First Look
Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018
- Vadim Zaliva and Franz Franchetti
HELIX: A Case Study of a Formal Verification of High Performance Program Generation
Proc. Workshop on Functional High Performance Computing (FHPC), 2018
- Jiyuan Zhang, Franz Franchetti and Tze-Meng Low
High Performance Zero-Memory Overhead Direct Convolutions
Proc. International Conference on Machine Learning (ICML), 2018
- Tze-Meng Low, Daniele G. Spampinato, A. Kutuluru, U. Sridhar, Thom Popovici, Franz Franchetti and S. McMillan
Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency Matrices
Proc. High Performance Extreme Computing (HPEC), 2018
- Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
SPIRAL: Extreme Performance Portability
Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018
2017
- Richard Veras
The Automatic Generation of High-Performance Graph Analytic Code using Spiral
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2017
2013
- Tao Cui
Power System Probabilistic and Security Analysis Using Commodity High Performance Computing Systems
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2013
- Martin Kong, Richard Veras, Kevin Stock, Franz Franchetti, Louis-Noël Pouchet and P. Sadayappan
When Polyhedral Transformations Meet SIMD Code Generation
Proc. Programming Languages Design and Implementation (PLDI), Vol. 48, pp. 127-138, 2013
2012
- Tao Cui and Franz Franchetti
A Multi-Core High Performance Computing Framework for Probabilistic Solutions of Distribution Systems
Proc. IEEE Power and Energy Society General Meeting (PES-GM), pp. 1-6, 2012
- W. Yu, Franz Franchetti, James C. Hoe and Tsuhan Chen
Highly Efficient Performance Portable Tracking of Evolving Surfaces
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 296-307, 2012
2011
- Tao Cui and Franz Franchetti
A Multi-core High Performance Computing Framework for Distribution Power Flow
Proc. North American Power Symposium (NAPS), 2011
2010
- W. Yu, Tsuhan Chen, Franz Franchetti and James C. Hoe
High Performance Stereo Vision Designed for Massively Data Parallel Platforms
IEEE Transactions on Circuits and Systems for Video Technology (T-CSVT), Vol. 20, No. 11, pp. 1509-1519, 2010
Homomorphic encryption
2024
- S. Fu, N. Zhang and Franz Franchetti
Accelerating High-Precision Number Theoretic Transforms using Intel AVX-512
Proc. Parallel Architectures and Compilation Techniques (PACT), 2024
2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring-Processing Unit
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
2022
- N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
Towards Full-Stack Acceleration for Fully Homomorphic Encryption
Proc. High Performance Extreme Computing (HPEC), 2022
Image segmentation
2010
- W. Yu, Franz Franchetti, James C. Hoe and Tsuhan Chen
Fast and Robust Active Contours for Image Segmentation
Proc. IEEE International Conference on Image Processing (ICIP), 2010
Intellectual Property (IP)
2014
- K. Vaidyanathan, R. Liu, H. E. Sumbul, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
Efficient and Secure Intellectual Property (IP) Design for Split Fabrication
Proc. IEEE International Symposium on Hardware-Oriented Security and Trust (HOST), 2014
IP cores for FPGA/ASIC
2016
- Francois Serre and Markus Püschel
Optimal Circuits for Streamed Linear Permutations using RAM
Proc. FPGA, pp. 215-223, 2016
- Marcela Zuluaga, Peter A. Milder and Markus Püschel
Streaming Sorting Networks
ACM Transactions on Design Automation of Electronic Systems, Vol. 21, No. 4, pp. 55, 2016
2012
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Computer Generation of Hardware for Linear Digital Signal Processing Transforms
ACM Transactions on Design Automation of Electronic Systems, Vol. 17, No. 2, 2012
- Marcela Zuluaga, Peter A. Milder and Markus Püschel
Computer Generation of Streaming Sorting Networks
Proc. Design Automation Conference (DAC), pp. 1245-1253, 2012
- Robert Koutsoyannis, Peter A. Milder, Christian Berger, Madeleine Glick, James C. Hoe and Markus Püschel
Improving Fixed-Point Accuracy of FFT Cores in O-OFDM Systems
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2012
- Marcela Zuluaga, Andreas Krause, Peter A. Milder and Markus Püschel
"Smart" Design Space Sampling to Predict Pareto-Optimal Solutions
Proc. Languages, Compilers, Tools and Theory for Embedded Systems (LCTES), pp. 119-128 , 2012
2011
- Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Design studies for ASIC implementations of 28 GS/s optical QPSK- and 16-QAM-OFDM transceivers
Optics Express, Vol. 19, No. 21, pp. 20857-20864, 2011
2010
- Peter A. Milder
A Mathematical Approach for Compiling and Optimizing Hardware Implementations of DSP Transforms
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
- Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, Christian Berger, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Design Studies for an ASIC Implementation of an Optical OFDM Transceiver
Proc. European Conference on Optical Communication (ECOC), pp. 1-3, 2010
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Hardware Implementation of the Discrete Fourier Transform with Non-Power-of-Two Problem Size
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2010
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Real-Time Digital Signal Processing for the Generation of Optical Orthogonal Frequency Division Multiplexed Signals
IEEE Journal of Selected Topics in Quantum Electronics, Vol. 16, No. 5, pp. 1235-1244 , 2010
2009
- Peter A. Milder, James C. Hoe and Markus Püschel
Automatic Generation of Streaming Datapaths for Arbitrary Fixed Permutations
Proc. Design, Automation and Test in Europe (DATE), pp. 1118-1123, 2009
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009
- Markus Püschel, Peter A. Milder and James C. Hoe
Permuting Streaming Data Using RAMs
Journal of the ACM, Vol. 56, No. 2, pp. 10:1-10:34, 2009
2008
- Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
Domain-Specific Library Generation for Parallel Software and Hardware Platforms
Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Formal Datapath Representation and Manipulation for Implementing DSP Transforms
Proc. Design Automation Conference (DAC), pp. 385-390, 2008
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Linear Transforms: From Math to Efficient Hardware
Proc. Workshop on High-Level Synthesis colocated with DAC, 2008
- Jeremy Johnson, Tim Chagnon, Petya Vachranukunkiet, Prawat Nagvajara and Chika Nwankpa
Sparse LU Decomposition using FPGA
Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
2007
- Sung-Chul Han
A Flexible Decoder and Performance Evaluation of Array-Structured LDPC Codes
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2007
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Discrete Fourier Transform Compiler: From Mathematical Representation to Efficient Hardware
CSSI Technical Report #CSSI-07-01, Carnegie Mellon University, 2007
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
FFT Compiler: From Math to Efficient Hardware
Proc. IEEE International High Level Design Validation and Test Workshop (HLDVT), 2007
- Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
Generating FPGA Accelerated DFT Libraries
Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007
- Yevgen Voronenko and Markus Püschel
Multiplierless Multiple Constant Multiplication
ACM Transactions on Algorithms, Vol. 3, No. 2, 2007
- Peter Tummeltshammer, James C. Hoe and Markus Püschel
Time-Multiplexed Multiple Constant Multiplication
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 26, No. 9, pp. 1551-1563, 2007
- Pranab Shenoy
Universal FFT Core Generator
Master thesis, Computer Science, Drexel University, 2007
2006
- Peter A. Milder, Mohammad Ahmad, James C. Hoe and Markus Püschel
Fast and Accurate Resource Estimation of Automatically Generated Custom DFT IP Cores
Proc. FPGA, pp. 211-220, 2006
2005
- Grace Nordin, Peter A. Milder, James C. Hoe and Markus Püschel
Automatic Generation of Customized Discrete Fourier Transform IPs
Proc. Design Automation Conference (DAC), pp. 471-474, 2005
2004
- Markus Püschel, Adam C. Zelinski and James C. Hoe
Custom-Optimized Multiplierless Implementations of DSP Algorithms
Proc. International Conference on Computer-Aided Design (ICCAD), pp. 175-182, 2004
- Grace Nordin, James C. Hoe and Markus Püschel
Discrete Fourier Transform IP Generator
Proc. High Performance Extreme Computing (HPEC), 2004
- Peter Tummeltshammer, James C. Hoe and Markus Püschel
Multiple Constant Multiplication By Time-Multiplexed Mapping of Addition Chains
Proc. Design Automation Conference (DAC), pp. 826-829, 2004
2002
- Fang Fang, James C. Hoe, Markus Püschel and Smarahara Misra
Generation of Custom DSP Transform IP Cores: Case Study Walsh-Hadamard Transform
Proc. High Performance Extreme Computing (HPEC), 2002
2001
- Peter Becker
A High Speed VLSI Architecture for the Discrete Haar Wavelet Transform
Master thesis, Electrical and Computer Engineering, Drexel University, 2001
- Pinit Kumhom
Design, Optimization, and Implementation of a Universal FFT Processor
PhD. thesis, Electrical and Computer Engineering, Drexel University, 2001, Also Tech. Report DU-MCS-01-01, Drexel University, 2001
2000
- Pinit Kumhom, Jeremy Johnson and Prawat Nagvajara
Design, optimization, and implementation of a universal FFT processor
Proc. IEEE ASIC/SOC Conference, IEEE, pp. 182-186, 2000
IRIS
2023
- Sanil Rao, M. A. H. Monil, H. Mankad, J. Vetter and Franz Franchetti
FFTX-IRIS: Towards Performance Portability and Heterogeneity for SPIRAL Generated Code
Proc. Supercomputing (SC), 2023
Iterative
2021
- Mark Blanco, S. McMillan and Tze-Meng Low
Delayed Asynchronous Iterative Graph Algorithms
Proc. High Performance Extreme Computing (HPEC), 2021
Kokkos
2019
- Mark Blanco, Tze-Meng Low and Kyungjoo Kim
Exploration of Fine-Grained Parallelism for Load Balancing Eager K-truss on GPU and CPU
Proc. High Performance Extreme Computing (HPEC), 2019
K-truss
2019
- Mark Blanco, Tze-Meng Low and Kyungjoo Kim
Exploration of Fine-Grained Parallelism for Load Balancing Eager K-truss on GPU and CPU
Proc. High Performance Extreme Computing (HPEC), 2019
2018
- Tze-Meng Low, Daniele G. Spampinato, A. Kutuluru, U. Sridhar, Thom Popovici, Franz Franchetti and S. McMillan
Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency Matrices
Proc. High Performance Extreme Computing (HPEC), 2018
Large integer multiplication
2022
- Z. Gong, N. Zhu, M. Ngaw, Joao Rivera, L. Tang, E. Tang, H. Mankad and Franz Franchetti
Interval Arithmetic-based FFT for Large Integer Multiplication
Proc. High Performance Extreme Computing (HPEC), 2022
- J. Nguyen, M. Cai, Z. Zuo, L. Tang, K. Mai and Franz Franchetti
LIMA: Hardware for FFT based Large Integer Multiplication
Proc. High Performance Extreme Computing (HPEC), 2022
Large Scale
2020
- Anuva Kulkarni
An Approach for Large-Scale Three-Dimensional FFT-Based Approximate Convolutions on GPUs
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2020
2019
- Anuva Kulkarni, Daniele G. Spampinato and Franz Franchetti
Design and Specification of Large-scale Simulations for GPUs using FFTX
Supercomputing, 2019
2018
- Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
Algorithm Design for Large Scale FFT-Based Simulations on CPU-GPU Platforms
, 2018
- Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
Algorithm Design for Large Scale Parallel FFT-Based Simulations on Heterogeneous Platforms
Proc. High Performance Extreme Computing (HPEC), 2018
- Jiyuan Zhang, Daniele G. Spampinato, S. McMillan and Franz Franchetti
Preliminary Exploration on Large-Scale Triangle Counting in Shared-Memory Multicore System
Proc. IEEE High Performance Extreme Computing (HPEC), 2018
2013
- Tao Cui and Franz Franchetti
A Quasi-Monte Carlo Approach for Radial Distribution System Probabilistic Load Flow
Proc. IEEE Innovative Smart Grid Technology Conference, 2013
- Tao Cui and Franz Franchetti
Power System Probabilistic and Security Analysis on Commodity High Performance Computing Systems
Proc. High Performance Computing, Networking and Analytics for the Power Grid (HiPCNA-PG), 2013
LDPC codes
2007
- Sung-Chul Han
A Flexible Decoder and Performance Evaluation of Array-Structured LDPC Codes
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2007
Learn the current Spiral system
2012
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Computer Generation of Hardware for Linear Digital Signal Processing Transforms
ACM Transactions on Design Automation of Electronic Systems, Vol. 17, No. 2, 2012
2011
- Markus Püschel, Franz Franchetti and Yevgen Voronenko
Spiral
in Encyclopedia of Parallel Computing, Eds. David Padua, pp. 1920-1933, Springer 2011
2010
- Frédéric de Mesmay, Yevgen Voronenko and Markus Püschel
Offline Library Adaptation Using Automatically Generated Heuristics
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 1-10, 2010
- Frédéric de Mesmay
On the Computer Generation of Adaptive Numerical Libraries
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
2009
- Yevgen Voronenko, Frédéric de Mesmay and Markus Püschel
Computer Generation of General Size Linear Transform Libraries
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 102-113, 2009
- Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
- Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
Operator Language: A Program Generation Framework for Fast Kernels
Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009
2008
- Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
Domain-Specific Library Generation for Parallel Software and Hardware Platforms
Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Formal Datapath Representation and Manipulation for Implementing DSP Transforms
Proc. Design Automation Conference (DAC), pp. 385-390, 2008
- Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
Generating High-Performance General Size Linear Transform Libraries Using Spiral
Proc. High Performance Extreme Computing (HPEC), 2008
- Franz Franchetti and Markus Püschel
Generating SIMD Vectorized Permutations
Proc. International Conference on Compiler Construction (CC), Lecture Notes in Computer Science, Springer, Vol. 4959, pp. 116-131, 2008
- Yevgen Voronenko
Library Generation for Linear Transforms
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2008
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Linear Transforms: From Math to Efficient Hardware
Proc. Workshop on High-Level Synthesis colocated with DAC, 2008
- Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
Program Generation with Spiral: Beyond Transforms
Proc. High Performance Extreme Computing (HPEC), 2008
2007
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
FFT Compiler: From Math to Efficient Hardware
Proc. IEEE International High Level Design Validation and Test Workshop (HLDVT), 2007
- Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
Generating FPGA Accelerated DFT Libraries
Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007
- Yevgen Voronenko and Markus Püschel
Mechanical Derivation of Fused Multiply-Add Algorithms for Linear Transforms
IEEE Transactions on Signal Processing, Vol. 55, No. 9, pp. 4458-4473, 2007
- Paolo D'Alberto, Markus Püschel and Franz Franchetti
Performance/Energy Optimization of DSP Transforms on the XScale Processor
Proc. International Conference on High Performance Embedded Architectures and Compilers (HiPEAC), Lecture Notes in Computer Science, Springer, Vol. 4367, pp. 201-214, 2007
2006
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
A Rewriting System for the Vectorization of Signal Transforms
Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 4395, pp. 363-377, 2006
- Andreas Bonelli, Franz Franchetti, Juergen Lorenz, Markus Püschel and Christoph W. Ueberhuber
Automatic Performance Optimization of the Discrete Fourier Transform on Distributed Memory Computers
Proc. International Symposium on Parallel and Distributed Processing and Application (ISPA), Lecture Notes In Computer Science, Springer, Vol. 4330, pp. 818-832, 2006
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
FFT Program Generation for Shared Memory: SMP and Multicore
Proc. Supercomputing (SC), 2006
2005
- Grace Nordin, Peter A. Milder, James C. Hoe and Markus Püschel
Automatic Generation of Customized Discrete Fourier Transform IPs
Proc. Design Automation Conference (DAC), pp. 471-474, 2005
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
Formal Loop Merging for Signal Transforms
Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326 , 2005
- Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
SPIRAL: Code Generation for DSP Transforms
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005
2004
- Aca Gacic
Automatic Implementation and Platform Adaptation of Discrete Filtering and Wavelet Algorithms
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2004
Library
2024
- Sanil Rao, A. Prakash and Franz Franchetti
Towards LibraryX: A Framework for Cross-Library Call Optimization
Proc. High Performance Extreme Computing (HPEC), 2024
Linear algebra
2019
- Mark Blanco, Tze-Meng Low and Kyungjoo Kim
Exploration of Fine-Grained Parallelism for Load Balancing Eager K-truss on GPU and CPU
Proc. High Performance Extreme Computing (HPEC), 2019
2018
- Tze-Meng Low, Daniele G. Spampinato, A. Kutuluru, U. Sridhar, Thom Popovici, Franz Franchetti and S. McMillan
Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency Matrices
Proc. High Performance Extreme Computing (HPEC), 2018
- Daniele G. Spampinato, Diego Fabregat-Traver, Paolo Bientinesi and Markus Püschel
Program Generation for Small-Scale Linear Algebra Applications
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 327-339, 2018
2017
- Daniele G. Spampinato
A Linear Algebra Compiler for Small Problem Sizes
PhD. thesis, Computer Science, ETH Zurich, Switzerland, 2017
- Tze-Meng Low, Varun Rao, Matthew Lee, Thom Popovici, Franz Franchetti and S. McMillan
First Look: Linear Algebra-based Triangle Counting Without Matrix Multiplication
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-6, 2017
2016
- Daniele G. Spampinato and Markus Püschel
A Basic Linear Algebra Compiler for Structured Matrices
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 117-127, 2016
2014
- Daniele G. Spampinato and Markus Püschel
A Basic Linear Algebra Compiler
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 23-32, 2014
- Nikolaos Kyrtatas
A Basic Linear Algebra Compiler for Embedded Processors
Master thesis, Computer Science, ETH Zurich, Switzerland, 2014
2010
- Frédéric de Mesmay
On the Computer Generation of Adaptive Numerical Libraries
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
2009
- Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
Operator Language: A Program Generation Framework for Fast Kernels
Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009
2008
- Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
Automatic Generation of Adaptive Libraries for Matrix-Multiplication
Parallel Matrix Algorithms and Applications (PMAA), 2008, Presentation (Abstract reviewed)
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
Automatic Linear Transform Program Generation for the Cell BE
Proc. Supercomputing (SC), 2008, Poster (Abstract reviewed)
- Jeremy Johnson, Tim Chagnon, Petya Vachranukunkiet, Prawat Nagvajara and Chika Nwankpa
Sparse LU Decomposition using FPGA
Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
LLVM
2021
- Yannick Zakowski, Calvin Beck, Irene Yoon, Ilia Zaichuk, Vadim Zaliva and Steve Zdancewic
Modular, Compositional, and Executable Formal Semantics for LLVMIR
Proc. International Conference on Functional Programming (ICFP), 2021
Load flow
2012
- Tao Cui and Franz Franchetti
Optimized Parallel Distribution Load Flow Solver on Commodity Multi-core CPU
Proc. IEEE High Performance Extreme Computing (HPEC), 2012
Load-store
2023
- G. Xu, Franz Franchetti and James C. Hoe
An Approach to Generating Customized Load-Store Architectures
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2023
Locality optimization
2014
- Benjamin Hess, Thomas Gross and Markus Püschel
Automatic Locality-Friendly Interface Extension of Numerical Functions
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 83-92, 2014
Logic-in-memory
2013
- Qiuling Zhu, Berkin Akin, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
A 3D-Stacked Logic-in-Memory Accelerator for Application-Specific Data Intensive Computing
Proc. IEEE International 3D Systems Integration Conference (3DIC), pp. 1-7, 2013
- Qiuling Zhu, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Accelerating Sparse Matrix-Matrix Multiplication with 3D-Stacked Logic-in-Memory Hardware
Proc. High Performance Extreme Computing (HPEC), pp. 1-6, 2013
- Qiuling Zhu
Application Specific Logic in Memory
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2013
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
A Smart Memory Accelerated Computed Tomography Parallel Backprojection
in From Algorithms to Circuits and System-on-Chip Design (VLSI-SoC 2012), Eds. A. Berg, A. Coskun, M. Guthaus, S. Katkoori, R. Reis, pp. 21-44, Springer, Berlin, Heidelberg 2013
2012
- Qiuling Zhu, K. Vaidyanathan, O. Shacham, M. Horowitz, Lawrence Pileggi and Franz Franchetti
Design Automation Framwork for Application-Specific Logic-in-Memory Blocks
Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 125-132, 2012
- Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
Local Interpolation-based Polar Format SAR: Algorithm, Hardware Implementation and Design Automation
Journal of Signal Processing Systems, 2012
- Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
Polar Format Synthetic Aperture Radar in Energy Efficient Application-Specific Logic-in-Memory
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1557-1560, 2012
2011
- Qiuling Zhu, E. L. Turner, Christian Berger, Lawrence Pileggi and Franz Franchetti
Application-Specific Logic-in-Memory for Polar Format Synthetic Aperture Radar
Proc. High Performance Extreme Computing (HPEC), 2011
Machine Learning
2020
- Jiyuan Zhang
Accelerating the Motifs of Machine Learning Applications on Modern Processors
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2020
Manifolds
2022
- P. Oostema and Franz Franchetti
Approximating Manifolds and Geodesics with Curved Surfaces
Proc. High Performance Extreme Computing (HPEC), 2022
Mass spectrometry
2024
- T. Zhang, E. Tang, F. Siddique, K. Skadron and Franz Franchetti
Towards an End-to-End Processing-in-DRAM Acceleration of Spectral Library Search
Proc. High Performance Extreme Computing (HPEC), 2024
Matrix-vector code
2022
- J. Nguyen, Sanil Rao, J. Moreira and Franz Franchetti
Automatic Generation of Matrix-Vector Code Using SPIRAL for the Power10 ISA
Proc. IEEE High Performance Extreme Computing (HPEC), 2022
Medicine
2019
- Yoko Franchetti, Thomas Nolin and Franz Franchetti
Towards Precision Medicine: Simulation Based Parameter Estimation for Drug Metabolism
SIAM Conference on Computational Science and Engineering (CSE), 2019
Memory
2018
- Jiyuan Zhang, Franz Franchetti and Tze-Meng Low
High Performance Zero-Memory Overhead Direct Convolutions
Proc. International Conference on Machine Learning (ICML), 2018
- Jiyuan Zhang, Daniele G. Spampinato, S. McMillan and Franz Franchetti
Preliminary Exploration on Large-Scale Triangle Counting in Shared-Memory Multicore System
Proc. IEEE High Performance Extreme Computing (HPEC), 2018
2016
- Berkin Akin, Franz Franchetti and James C. Hoe
HAMLeT Architecture for Parallel Data Reorganization in Memory
IEEE Micro, Vol. 36, No. 1, pp. 14-23, 2016
2015
- Jiyuan Zhang, Tze-Meng Low, Qi Guo and Franz Franchetti
A 3D-Stacked Memory Manycore Stencil Accelerator System
Proc. Workshop on Near Data Processing (WONDP), 2015
- Berkin Akin
A Formal Approach to Memory Access Optimization: Data Layout, Reorganization, and Near-Data Processing
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2015
- H. E. Sumbul, K. Vaidyanathan, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
A Synthesis Methodology for Application-Specific Logic-in-Memory Designs
Proc. Design Automation Conference (DAC), 2015
- Berkin Akin, Franz Franchetti and James C. Hoe
Data Reorganization in Memory Using 3D-stacked DRAM
Proc. International Symposium on Computer Architectur (ISCA), 2015
- Qi Guo, Tze-Meng Low, N. Alachiotis, Berkin Akin, Lawrence Pileggi, James C. Hoe and Franz Franchetti
Enabling Portable Energy Efficiency with Memory Accelerated Library
Proc. MICRO, 2015
- Tze-Meng Low, Qi Guo and Franz Franchetti
Optimizing Space Time Adaptive Processing Through Accelerating Memory-Bounded Operations
Proc. High Performance Extreme Computing (HPEC), 2015
2014
- Qi Guo, N. Alachiotis, Berkin Akin, F. Sadi, G. Xu, Tze-Meng Low, Lawrence Pileggi, James C. Hoe and Franz Franchetti
3D-Stacked Memory-Side Acceleration: Accelerator and System Design
Proc. Workshop on Near Data Processing (WONDP), 2014
- Berkin Akin, James C. Hoe and Franz Franchetti
HAMLeT: Hardware Accelerated Memory Layout Transform within 3D-stacked DRAM
Proc. IEEE High Performance Extreme Computing (HPEC), 2014
2013
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
A Smart Memory Accelerated Computed Tomography Parallel Backprojection
in From Algorithms to Circuits and System-on-Chip Design (VLSI-SoC 2012), Eds. A. Berg, A. Coskun, M. Guthaus, S. Katkoori, R. Reis, pp. 21-44, Springer, Berlin, Heidelberg 2013
2012
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
Cost-Effective Smart Memory Implementation for Parallel Backprojection in Computed Tomography
Proc. IFIP/IEEE Internationa Conference on Very Large Scale Integration, pp. 111-116, 2012
- Qiuling Zhu, K. Vaidyanathan, O. Shacham, M. Horowitz, Lawrence Pileggi and Franz Franchetti
Design Automation Framwork for Application-Specific Logic-in-Memory Blocks
Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 125-132, 2012
- Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
Memory Bandwidth Efficient Two-Dimensional Fast Fourier Transform Algorithm and Implementation for Large Problem Sizes
Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), 2012
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
Smart Memory Synthesis for Energy-Efficient Computed Tomography Reconstruction
Proc. SRC TECHCON, 2012
Merge parallelization
2019
- F. Sadi, Joe Sweeney, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Efficient SpMV Operation for Large and Highly Sparse Matrices Using Scalable Multi-way Merge Parallelization
Proc. MICRO, 2019
Modeling
2015
- T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, R. Suter, Franz Franchetti and Anthony Rollett
Spectral Full-Field Deformation Modeling of Polycrystalline Materials
submitted for publication
2014
- T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, Franz Franchetti, R. Suter and Anthony Rollett
Performance Evaluation, Algorithm Optimization and Sensitivity Analysis of the Spectral Full-Field Deformation Modeling of Polycrystalline Materials
submitted for publication
Modular
2025
- N. Zhang and Franz Franchetti
Code Generation for Cryptographic Kernels using Multi-word Modular Arithmetic on GPU
Proc. International Symposium on Code Generation and Optimization (CGO), 2025
Monads
2021
- Yannick Zakowski, Calvin Beck, Irene Yoon, Ilia Zaichuk, Vadim Zaliva and Steve Zdancewic
Modular, Compositional, and Executable Formal Semantics for LLVMIR
Proc. International Conference on Functional Programming (ICFP), 2021
Monte Carlo
2013
- Tao Cui and Franz Franchetti
A Quasi-Monte Carlo Approach for Radial Distribution System Probabilistic Load Flow
Proc. IEEE Innovative Smart Grid Technology Conference, 2013
2011
- Tao Cui and Franz Franchetti
A Monte Carlo Framework for Probabilistic Distribution Power Flow
Carnegie Mellon Conference on the Electricity Industry, 2011
Multibuffering
2010
- Srinivas Chellappa
Computer Generation of Fourier Transform Libraries for Distributed Memory Architectures
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
2009
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
High Performance Linear Transform Program Generation for the Cell BE
Proc. High Performance Extreme Computing (HPEC), 2009
Multicore Systems
2018
- Thom Popovici, Tze-Meng Low and Franz Franchetti
Large Bandwidth-Efficient FFTs on Multicore and Multi-Socket Systems
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), IEEE, 2018
- Jiyuan Zhang, Daniele G. Spampinato, S. McMillan and Franz Franchetti
Preliminary Exploration on Large-Scale Triangle Counting in Shared-Memory Multicore System
Proc. IEEE High Performance Extreme Computing (HPEC), 2018
2012
- Tao Cui and Franz Franchetti
A Multi-Core High Performance Computing Framework for Probabilistic Solutions of Distribution Systems
Proc. IEEE Power and Energy Society General Meeting (PES-GM), pp. 1-6, 2012
- Tao Cui and Franz Franchetti
Optimized Parallel Distribution Load Flow Solver on Commodity Multi-core CPU
Proc. IEEE High Performance Extreme Computing (HPEC), 2012
2011
- Tao Cui and Franz Franchetti
A Multi-core High Performance Computing Framework for Distribution Power Flow
Proc. North American Power Symposium (NAPS), 2011
Multidimensional DFT
2020
- Thom Popovici, Martin Schatz, Franz Franchetti and Tze-Meng Low
A Flexible Framework for Multi-Dimensional DFTs
SIAM Journal on Scientific Computing (SISC), Software and High-Performance Computing, 2020
2019
- Thom Popovici, Martin Schatz, Franz Franchetti and Tze-Meng Low
A Flexible Framework for Parallel Multi-Dimensional DFTs
arXiv (Technical Report), 2019
2007
- Jeremy Johnson and Xu Xu
Generating Symmetric DFTs and Equivariant FFT Algorithms
Proc. ACM International Symposium on Symbolic and Algebraic Computation (ISSAC), ACM, pp. 195-202, 2007
- Pranab Shenoy
Universal FFT Core Generator
Master thesis, Computer Science, Drexel University, 2007
2000
- Pinit Kumhom, Jeremy Johnson and Prawat Nagvajara
Design, optimization, and implementation of a universal FFT processor
Proc. IEEE ASIC/SOC Conference, IEEE, pp. 182-186, 2000
Multiplier block
2007
- Yevgen Voronenko and Markus Püschel
Multiplierless Multiple Constant Multiplication
ACM Transactions on Algorithms, Vol. 3, No. 2, 2007
- Peter Tummeltshammer, James C. Hoe and Markus Püschel
Time-Multiplexed Multiple Constant Multiplication
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 26, No. 9, pp. 1551-1563, 2007
2004
- Peter Tummeltshammer, James C. Hoe and Markus Püschel
Multiple Constant Multiplication By Time-Multiplexed Mapping of Addition Chains
Proc. Design Automation Conference (DAC), pp. 826-829, 2004
Multiplierless
2007
- Yevgen Voronenko and Markus Püschel
Multiplierless Multiple Constant Multiplication
ACM Transactions on Algorithms, Vol. 3, No. 2, 2007
- Peter Tummeltshammer, James C. Hoe and Markus Püschel
Time-Multiplexed Multiple Constant Multiplication
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 26, No. 9, pp. 1551-1563, 2007
2004
- Adam C. Zelinski, Markus Püschel, Smarahara Misra and James C. Hoe
Automatic Cost Minimization for Multiplierless Implementations of Discrete Signal Transforms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-221-V-224, 2004
- Markus Püschel, Adam C. Zelinski and James C. Hoe
Custom-Optimized Multiplierless Implementations of DSP Algorithms
Proc. International Conference on Computer-Aided Design (ICCAD), pp. 175-182, 2004
- Peter Tummeltshammer, James C. Hoe and Markus Püschel
Multiple Constant Multiplication By Time-Multiplexed Mapping of Addition Chains
Proc. Design Automation Conference (DAC), pp. 826-829, 2004
2003
- Smarahara Misra, Adam C. Zelinski, James C. Hoe and Markus Püschel
Custom Reduction of Arithmetic in Linear DSP Transforms
Proc. High Performance Extreme Computing (HPEC), 2003
- Smarahara Misra
Custom Reduction of Arithmetic in Multiplierless implementations of DSP Transforms
Master thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2003
Multithreading
2011
- Christian Berger, Volodymyr Arbatov, Yevgen Voronenko, Franz Franchetti and Markus Püschel
Real-Time Software Implementation of an IEEE 802.11a Baseband Receiver on Intel Multicore
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1693-1696 , 2011
2009
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
Computer Generation of Fast Fourier Transforms for the Cell Broadband Engine
Proc. International Conference on Supercomputing (ICS), pp. 26-35, 2009
- Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
- Franz Franchetti and Markus Püschel
Generating High-Performance Pruned FFT Implementations
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 549-552, 2009
- Daniel McFarlin, Franz Franchetti, José M. F. Moura and Markus Püschel
High Performance Synthetic Aperture Radar Image Formation On Commodity Architectures
Proc. SPIE Conference on Defense, Security, and Sensing, Proceedings of SPIE, Vol. 7337, pp. 733708, 2009
2008
- Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
Domain-Specific Library Generation for Parallel Software and Hardware Platforms
Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
FFT Program Generation for the Cell BE
Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
- Yevgen Voronenko
Library Generation for Linear Transforms
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2008
2006
- Andreas Bonelli, Franz Franchetti, Juergen Lorenz, Markus Püschel and Christoph W. Ueberhuber
Automatic Performance Optimization of the Discrete Fourier Transform on Distributed Memory Computers
Proc. International Symposium on Parallel and Distributed Processing and Application (ISPA), Lecture Notes In Computer Science, Springer, Vol. 4330, pp. 818-832, 2006
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
FFT Program Generation for Shared Memory: SMP and Multicore
Proc. Supercomputing (SC), 2006
2004
- Jeremy Johnson and Kang Chen
A Self-Adapting Distributed Memory Package for Fast Signal Transforms
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 44-, 2004
2002
- Kang Chen
A Prototypical Self-Optimizing Package for Parallel Implementation of Fast Signal Transforms
Master thesis, Computer Science, Drexel University, 2002
- Kang Chen and Jeremy Johnson
A Prototypical Self-Optimizing Package for Parallel Implementation of Fast Signal Transforms
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 58-63, 2002
Multi-word arithmetic
2025
- N. Zhang and Franz Franchetti
Code Generation for Cryptographic Kernels using Multi-word Modular Arithmetic on GPU
Proc. International Symposium on Code Generation and Optimization (CGO), 2025
2024
- Y. Eum, N. Zhang, L. Tang and Franz Franchetti
Towards a RISC-V Instruction Set Extension for Multi-word Arithmetic
Proc. High Performance Extreme Computing (HPEC), 2024
Multi-word integer
2023
- N. Zhang and Franz Franchetti
Generating Number Theoretic Transforms for Multi-Word Integer Data Types
Proc. International Symposium on Code Generation and Optimization (CGO), 2023
Near-data processing
2024
- T. Zhang, E. Tang, F. Siddique, K. Skadron and Franz Franchetti
Towards an End-to-End Processing-in-DRAM Acceleration of Spectral Library Search
Proc. High Performance Extreme Computing (HPEC), 2024
Next-generation open-source software
2023
- B. Van Straalen, P. Colella, Franz Franchetti and P. Broderick
FFTX: Next-Generation Open-Source Software for Fast Fourier Transforms
to appear in Proc. Exascale Computing Project Annual Meeting (ECPAM), 2023
NTT
2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
Number theoretic transforms
2025
- N. Zhang and Franz Franchetti
Code Generation for Cryptographic Kernels using Multi-word Modular Arithmetic on GPU
Proc. International Symposium on Code Generation and Optimization (CGO), 2025
2024
- S. Fu, N. Zhang and Franz Franchetti
Accelerating High-Precision Number Theoretic Transforms using Intel AVX-512
Proc. Parallel Architectures and Compilation Techniques (PACT), 2024
2023
- N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
- N. Zhang and Franz Franchetti
Generating Number Theoretic Transforms for Multi-Word Integer Data Types
Proc. International Symposium on Code Generation and Optimization (CGO), 2023
- P. Brinich, N. Zhang, A. Ebel, Franz Franchetti and Jeremy Johnson
Twiddle Factor Generation for a Vectorized Number Theoretic Transform
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
Numerical kernels we consider
2017
- Thom Popovici, Franz Franchetti and Tze-Meng Low
Mixed Data Layout Kernels for Vectorized Complex Arithmetic
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017
2014
- Daniele G. Spampinato and Markus Püschel
A Basic Linear Algebra Compiler
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 23-32, 2014
2010
- Frédéric de Mesmay
On the Computer Generation of Adaptive Numerical Libraries
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
2009
- Daniel McFarlin, Franz Franchetti and Markus Püschel
Automatic Generation of Vectorized Fast Fourier Transform Libraries for the Larrabee and AVX Instruction Set Extension
Proc. High Performance Extreme Computing (HPEC), 2009
- Daniel McFarlin, Franz Franchetti, José M. F. Moura and Markus Püschel
High Performance Synthetic Aperture Radar Image Formation On Commodity Architectures
Proc. SPIE Conference on Defense, Security, and Sensing, Proceedings of SPIE, Vol. 7337, pp. 733708, 2009
- Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
Operator Language: A Program Generation Framework for Fast Kernels
Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009
2007
- Yevgen Voronenko and Markus Püschel
Multiplierless Multiple Constant Multiplication
ACM Transactions on Algorithms, Vol. 3, No. 2, 2007
2006
- Sung-Chul Han, Franz Franchetti and Markus Püschel
Program Generation for the All-Pairs Shortest Path Problem
Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 222-232, 2006
2005
- Roland Wunderlich, Markus Püschel and James C. Hoe
Accelerating Blocked Matrix-Matrix Multiplication using a Software-Managed Memory Hierarchy with DMA
Proc. High Performance Extreme Computing (HPEC), 2005
- Thammanit Pipatsrisawat, Aca Gacic, Franz Franchetti, Markus Püschel and José M. F. Moura
Performance Analysis of the Filtered Backprojection Image Reconstruction Algorithms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. 153-156, 2005
- Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
SPIRAL: Code Generation for DSP Transforms
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005
2004
- Xiaoming Li, María J. Garzarán and David Padua
A Dynamically Tuned Sorting Library
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 111-124, 2004
- Aca Gacic
Automatic Implementation and Platform Adaptation of Discrete Filtering and Wavelet Algorithms
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2004
Object detection
2018
- Vit Ruzicka and Franz Franchetti
Fast and Accurate Object Detection in High Resolution 4K and 8K Video Using GPUs
Proc. IEEE High Performance Extreme Computing (HPEC), 2018
OFDM
2012
- Robert Koutsoyannis, Peter A. Milder, Christian Berger, Madeleine Glick, James C. Hoe and Markus Püschel
Improving Fixed-Point Accuracy of FFT Cores in O-OFDM Systems
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2012
2011
- Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Design studies for ASIC implementations of 28 GS/s optical QPSK- and 16-QAM-OFDM transceivers
Optics Express, Vol. 19, No. 21, pp. 20857-20864, 2011
2010
- Peter A. Milder
A Mathematical Approach for Compiling and Optimizing Hardware Implementations of DSP Transforms
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
- Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, Christian Berger, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Design Studies for an ASIC Implementation of an Optical OFDM Transceiver
Proc. European Conference on Optical Communication (ECOC), pp. 1-3, 2010
- Yannis Benlachtar, Rachid Bouziane, Robert I. Killey, Christian Berger, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel and Madeleine Glick
Optical OFDM for the Data Center
Proc. International Conference on Transparent Optical Networks (ICTON), pp. 1-4, 2010
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Real-Time Digital Signal Processing for the Generation of Optical Orthogonal Frequency Division Multiplexed Signals
IEEE Journal of Selected Topics in Quantum Electronics, Vol. 16, No. 5, pp. 1235-1244 , 2010
2009
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
21.4 GS/s Real-Time DSP-Based Optical OFDM Signal Generation and Transmission Over 1600 km of Uncompensated Fibre
Proc. European Conference on Optical Communication (ECOC), pp. 1-2, 2009
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009
Online decentralized charging
2010
- Q. Li, Tao Cui, Rohit Negi, Franz Franchetti and Marija Ilic
On-line Decentralized Charging of Plug-In Electric Vehicles in Power Systems
, 2010
Optimizing
2024
- Sanil Rao, A. Prakash and Franz Franchetti
Towards LibraryX: A Framework for Cross-Library Call Optimization
Proc. High Performance Extreme Computing (HPEC), 2024
2023
- D. Sun, N. Zhang and Franz Franchetti
Optimization and Performance Analysis of Shor’s Algorithm in Qiskit
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
2021
- Scott Mionis, Franz Franchetti and J. Larkin
Optimized Quantum Circuit Generation with SPIRAL
Proc. High Performance Extreme Computing (HPEC), 2021
2020
- Scott Mionis, Franz Franchetti and J. Larkin
Quantum Circuit Optimization with SPIRAL: A First Look
Proc. Supercomputing (SC), 2020
2018
- F. Sadi
Accelerating Sparse Matrix Kernels with Co-optimized Architecture
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2018
- Thom Popovici
An Approach to Specifying and Automatically Optimizing Fourier Transform Based Operations
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2018
2017
- F. Sadi, Lawrence Pileggi and Franz Franchetti
Algorithm and Hardware Co-Optimized Solution for Large SpMV Problems
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017
- Daniele G. Spampinato
A Linear Algebra Compiler for Small Problem Sizes
PhD. thesis, Computer Science, ETH Zurich, Switzerland, 2017
- G. Xu, Tze-Meng Low, James C. Hoe and Franz Franchetti
Optimizing FFT Resource Efficiency of FPGA using High-Level Synthesis
High Performance Extreme Computing Conference (HPEC), 2017
2016
- Richard Veras, Thom Popovici, Tze-Meng Low and Franz Franchetti
Compilers, Hands-Off My Hands-On Optimizations
Proc. Workshop on Programming Models for SIMD/Vector Programming (WPMVP), 2016
2015
- Berkin Akin
A Formal Approach to Memory Access Optimization: Data Layout, Reorganization, and Near-Data Processing
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2015
- Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
Proc. Workshop on Compilers for Parallel Computing (CPC), 2015
- Tze-Meng Low, Qi Guo and Franz Franchetti
Optimizing Space Time Adaptive Processing Through Accelerating Memory-Bounded Operations
Proc. High Performance Extreme Computing (HPEC), 2015
2014
- T. Ozturk, Thom Popovici, C. Stein, R. Pokharel, Franz Franchetti and Anthony Rollett
Fast Fourier Transform Based Mechanical Behavior Formulation: Optimized Implementation and Sensitivity Analysis of the Method
Proc. Materials Science & Technology, 2014
- Berkin Akin, Franz Franchetti and James C. Hoe
Understanding the Design Space of DRAM-optimized Hardware FFT Accelerators
Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 248-255, 2014
2013
- Tao Cui and Franz Franchetti
Power System Probabilistic and Security Analysis on Commodity High Performance Computing Systems
Proc. High Performance Computing, Networking and Analytics for the Power Grid (HiPCNA-PG), 2013
- Martin Kong, Richard Veras, Kevin Stock, Franz Franchetti, Louis-Noël Pouchet and P. Sadayappan
When Polyhedral Transformations Meet SIMD Code Generation
Proc. Programming Languages Design and Implementation (PLDI), Vol. 48, pp. 127-138, 2013
2012
- Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
Algorithm and Architecture Optimization for Large Size Two Dimensional Discrete Fourier Transform
ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA), 2012
- Tao Cui and Franz Franchetti
Optimized Parallel Distribution Load Flow Solver on Commodity Multi-core CPU
Proc. IEEE High Performance Extreme Computing (HPEC), 2012
Parallel Backprojection
2013
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
A Smart Memory Accelerated Computed Tomography Parallel Backprojection
in From Algorithms to Circuits and System-on-Chip Design (VLSI-SoC 2012), Eds. A. Berg, A. Coskun, M. Guthaus, S. Katkoori, R. Reis, pp. 21-44, Springer, Berlin, Heidelberg 2013
2012
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
Cost-Effective Smart Memory Implementation for Parallel Backprojection in Computed Tomography
Proc. IFIP/IEEE Internationa Conference on Very Large Scale Integration, pp. 111-116, 2012
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
Smart Memory Synthesis for Energy-Efficient Computed Tomography Reconstruction
Proc. SRC TECHCON, 2012
Parallel distribution
2012
- Tao Cui and Franz Franchetti
Optimized Parallel Distribution Load Flow Solver on Commodity Multi-core CPU
Proc. IEEE High Performance Extreme Computing (HPEC), 2012
Parallel library generation
2014
- Lingchuan Meng and Jeremy Johnson
Towards parallel general-size library generation for polynomial multiplication
ACM Communications in Computer Algebra, 2014
2013
- Lingchuan Meng and Jeremy Johnson
Automatic Parallel Library Generation for General-Size Modular FFT Algorithms
Proc. International Workshop on Computer Algebra in Scientific Computing, Springer, pp. 243-256, 2013
Parallel platforms
2010
- W. Yu, Tsuhan Chen, Franz Franchetti and James C. Hoe
High Performance Stereo Vision Designed for Massively Data Parallel Platforms
IEEE Transactions on Circuits and Systems for Video Technology (T-CSVT), Vol. 20, No. 11, pp. 1509-1519, 2010
Parallel processing
2019
- Thom Popovici, Martin Schatz, Franz Franchetti and Tze-Meng Low
A Flexible Framework for Parallel Multi-Dimensional DFTs
arXiv (Technical Report), 2019
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
Algorithm Design at Scale: Porting Parallel FFT-based Fortran Simulations to GPUs
SIAM Conference on Computational Science and Engineering (CSE), 2019
- Mark Blanco, Tze-Meng Low and Kyungjoo Kim
Exploration of Fine-Grained Parallelism for Load Balancing Eager K-truss on GPU and CPU
Proc. High Performance Extreme Computing (HPEC), 2019
2018
- Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
Algorithm Design for Large Scale Parallel FFT-Based Simulations on Heterogeneous Platforms
Proc. High Performance Extreme Computing (HPEC), 2018
2016
- Berkin Akin, Franz Franchetti and James C. Hoe
HAMLeT Architecture for Parallel Data Reorganization in Memory
IEEE Micro, Vol. 36, No. 1, pp. 14-23, 2016
2011
- Franz Franchetti and Markus Püschel
Fast Fourier Transform
in Encyclopedia of Parallel Computing, Eds. David Padua, Springer 2011
- Markus Püschel, Franz Franchetti and Yevgen Voronenko
Spiral
in Encyclopedia of Parallel Computing, Eds. David Padua, pp. 1920-1933, Springer 2011
2010
- Srinivas Chellappa
Computer Generation of Fourier Transform Libraries for Distributed Memory Architectures
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
- Yevgen Voronenko, Volodymyr Arbatov, Christian Berger, Ronghui Peng, Markus Püschel and Franz Franchetti
Computer Generation of Platform-Adapted Physical Layer Software
Proc. Software Defined Radio (SDR), 2010
2009
- Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
High Performance Linear Transform Program Generation for the Cell BE
Proc. High Performance Extreme Computing (HPEC), 2009
- Daniel McFarlin, Franz Franchetti, José M. F. Moura and Markus Püschel
High Performance Synthetic Aperture Radar Image Formation On Commodity Architectures
Proc. SPIE Conference on Defense, Security, and Sensing, Proceedings of SPIE, Vol. 7337, pp. 733708, 2009
- Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
Operator Language: A Program Generation Framework for Fast Kernels
Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009
2002
- Kang Chen
A Prototypical Self-Optimizing Package for Parallel Implementation of Fast Signal Transforms
Master thesis, Computer Science, Drexel University, 2002
Parameter estimation
2019
- Yoko Franchetti, Thomas Nolin and Franz Franchetti
Towards Precision Medicine: Simulation Based Parameter Estimation for Drug Metabolism
SIAM Conference on Computational Science and Engineering (CSE), 2019
Performance analysis
2014
- Georg Ofenbeck, Ruedi Steinmann, Victoria Caparrós Cabezas, Daniele G. Spampinato and Markus Püschel
Applying the Roofline Model
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 76 - 85, 2014
Performance metric
2020
- Mark Blanco, S. McMillan and Tze-Meng Low
Towards an Objective Metric for the Performance of Exact Triangle Count
Proc. High Performance Extreme Computing (HPEC), 2020
Performance model
2023
- D. Sun, N. Zhang and Franz Franchetti
Optimization and Performance Analysis of Shor’s Algorithm in Qiskit
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
2014
- Georg Ofenbeck, Ruedi Steinmann, Victoria Caparrós Cabezas, Daniele G. Spampinato and Markus Püschel
Applying the Roofline Model
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 76 - 85, 2014
- Victoria Caparrós Cabezas and Markus Püschel
Extending the Roofline Model: Bottleneck Analysis with Microarchitectural Constraints
Proc. IEEE International Symposium on Workload Characterization (IISWC), pp. 222-231, 2014
2013
- Tao Cui and Franz Franchetti
A Software Performance Engineering Aproach to Fast Transmission Probabilistic Load Flow
IEEE PES General Meeting, 2013
2009
- Daniel McFarlin, Franz Franchetti and Markus Püschel
Automatic Generation of Vectorized Fast Fourier Transform Libraries for the Larrabee and AVX Instruction Set Extension
Proc. High Performance Extreme Computing (HPEC), 2009
- Basilio B. Fraguela, Yevgen Voronenko and Markus Püschel
Automatic Tuning of Discrete Fourier Transforms Driven by Analytical Modeling
Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 271-280, 2009
2008
- Jeremy Johnson and Michael Andrews
Statistical Evaluation of a Self-Tuning Vectorized Library for the Walsh-Hadamard Transform
Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
2007
- Jeremy Johnson and Michael Andrews
Performance Analysis of a Family of WHT Algorithms
Proc. Workshop on Performance Optimization for High-Level Languages and Libraries (POHLL), 2007 IEEE International Parallel and Distributed Processing Symposium , IEEE, pp. 450, 2007
Performance portable
2023
- Sanil Rao, M. A. H. Monil, H. Mankad, J. Vetter and Franz Franchetti
FFTX-IRIS: Towards Performance Portability and Heterogeneity for SPIRAL Generated Code
Proc. Supercomputing (SC), 2023
2019
- Mark Blanco, Tze-Meng Low and Kyungjoo Kim
Exploration of Fine-Grained Parallelism for Load Balancing Eager K-truss on GPU and CPU
Proc. High Performance Extreme Computing (HPEC), 2019
2010
- W. Yu
Performance Portable Tracking of Evolving Surfaces
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
2003
- Franz Franchetti
Performance Portable Short Vector Transforms
PhD. thesis, Vienna University of Technology, 2003
Phasor
2013
- Q. Li, Tao Cui, Yang Weng, Rohit Negi, Franz Franchetti and Marija Ilic
An Information-Theoretic Approach to PMU Placement in Electric Power Systems
IEEE Transactions on Smart Grid, Vol. 4, No. 1, pp. 446-456, 2013
Polar format
2012
- Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
Polar Format Synthetic Aperture Radar in Energy Efficient Application-Specific Logic-in-Memory
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1557-1560, 2012
2011
- Qiuling Zhu, E. L. Turner, Christian Berger, Lawrence Pileggi and Franz Franchetti
Application-Specific Logic-in-Memory for Polar Format Synthetic Aperture Radar
Proc. High Performance Extreme Computing (HPEC), 2011
Polyhedral Model
2017
- Daniele G. Spampinato
A Linear Algebra Compiler for Small Problem Sizes
PhD. thesis, Computer Science, ETH Zurich, Switzerland, 2017
Polynomial multiplication
2014
- Lingchuan Meng and Jeremy Johnson
Towards parallel general-size library generation for polynomial multiplication
ACM Communications in Computer Algebra, 2014
Portability
2023
- Sanil Rao, M. A. H. Monil, H. Mankad, J. Vetter and Franz Franchetti
FFTX-IRIS: Towards Performance Portability and Heterogeneity for SPIRAL Generated Code
Proc. Supercomputing (SC), 2023
2018
- Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
SPIRAL: Extreme Performance Portability
Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018
2012
- W. Yu, Franz Franchetti, James C. Hoe and Tsuhan Chen
Highly Efficient Performance Portable Tracking of Evolving Surfaces
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 296-307, 2012
2011
- W. Yu, Franz Franchetti, James C. Hoe, José M. F. Moura and Tsuhan Chen
Performance Portable Tracking of Evolving Surfaces
Proc. High Performance Extreme Computing (HPEC), 2011
Power10 ISA
2022
- J. Nguyen, Sanil Rao, J. Moreira and Franz Franchetti
Automatic Generation of Matrix-Vector Code Using SPIRAL for the Power10 ISA
Proc. IEEE High Performance Extreme Computing (HPEC), 2022
Power systems
2013
- Tao Cui and Franz Franchetti
Power System Probabilistic and Security Analysis on Commodity High Performance Computing Systems
Proc. High Performance Computing, Networking and Analytics for the Power Grid (HiPCNA-PG), 2013
- Tao Cui
Power System Probabilistic and Security Analysis Using Commodity High Performance Computing Systems
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2013
- Cory Thoma, Tao Cui and Franz Franchetti
Privacy Preserving Smart Meter System Based Retail Level Electricity Market
Proc. IEEE Power and Energy Society General Meeting (PES-GM), 2013
2012
- Tao Cui and Franz Franchetti
A Multi-Core High Performance Computing Framework for Probabilistic Solutions of Distribution Systems
Proc. IEEE Power and Energy Society General Meeting (PES-GM), pp. 1-6, 2012
Privacy
2013
- Cory Thoma, Tao Cui and Franz Franchetti
Privacy Preserving Smart Meter System Based Retail Level Electricity Market
Proc. IEEE Power and Energy Society General Meeting (PES-GM), 2013
2012
- Cory Thoma, Tao Cui and Franz Franchetti
Secure Multiparty Computation Based Privacy Preserving Smart Metering System
Proc. North American Power Symposium (NAPS), pp. 1-6, 2012
Probabilistic Load Flow
2013
- Tao Cui and Franz Franchetti
A Quasi-Monte Carlo Approach for Radial Distribution System Probabilistic Load Flow
Proc. IEEE Innovative Smart Grid Technology Conference, 2013
- Tao Cui and Franz Franchetti
A Software Performance Engineering Aproach to Fast Transmission Probabilistic Load Flow
IEEE PES General Meeting, 2013
- Tao Cui and Franz Franchetti
Power System Probabilistic and Security Analysis on Commodity High Performance Computing Systems
Proc. High Performance Computing, Networking and Analytics for the Power Grid (HiPCNA-PG), 2013
2012
- Tao Cui and Franz Franchetti
A Multi-Core High Performance Computing Framework for Probabilistic Solutions of Distribution Systems
Proc. IEEE Power and Energy Society General Meeting (PES-GM), pp. 1-6, 2012
2011
- Tao Cui and Franz Franchetti
A Monte Carlo Framework for Probabilistic Distribution Power Flow
Carnegie Mellon Conference on the Electricity Industry, 2011
- Tao Cui and Franz Franchetti
A Multi-core High Performance Computing Framework for Distribution Power Flow
Proc. North American Power Symposium (NAPS), 2011
Processing-in-memory
2024
- T. Zhang, E. Tang, F. Siddique, K. Skadron and Franz Franchetti
Towards an End-to-End Processing-in-DRAM Acceleration of Spectral Library Search
Proc. High Performance Extreme Computing (HPEC), 2024
Program generation
2024
- E. Tang, James C. Hoe and Franz Franchetti
Magic Memory: A Memory-Centric Declarative Programming Paradigm to Enable High Productivity on Heterogeneous Systems
Proc. SRC TECHCON, 2024
2023
- Sanil Rao, M. A. H. Monil, H. Mankad, J. Vetter and Franz Franchetti
FFTX-IRIS: Towards Performance Portability and Heterogeneity for SPIRAL Generated Code
Proc. Supercomputing (SC), 2023
2018
- Vadim Zaliva and Franz Franchetti
HELIX: A Case Study of a Formal Verification of High Performance Program Generation
Proc. Workshop on Functional High Performance Computing (FHPC), 2018
2017
- Daniele G. Spampinato
A Linear Algebra Compiler for Small Problem Sizes
PhD. thesis, Computer Science, ETH Zurich, Switzerland, 2017
ProtoX
2022
- H. Mankad, Sanil Rao, P. Colella, B. Van Straalen and Franz Franchetti
ProtoX: A First Look
Proc. High Performance Extreme Computing (HPEC), 2022
Qiskit
2023
- D. Sun, N. Zhang and Franz Franchetti
Optimization and Performance Analysis of Shor’s Algorithm in Qiskit
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
Quadcopter
2017
- H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
Prediction of Quadcopter State through Multi-Microphone Side-Channel Fusion
Technical report UU-CS-2017-001, Dept. of Information and Computing Sciences, Utrecht University, 2017
Quantum
2020
- Scott Mionis, Franz Franchetti and J. Larkin
Quantum Circuit Optimization with SPIRAL: A First Look
Proc. Supercomputing (SC), 2020
Quantum circuit
2020
- Scott Mionis, Franz Franchetti and J. Larkin
Quantum Circuit Optimization with SPIRAL: A First Look
Proc. Supercomputing (SC), 2020
Quantum computing
2021
- Scott Mionis, Franz Franchetti and J. Larkin
Optimized Quantum Circuit Generation with SPIRAL
Proc. High Performance Extreme Computing (HPEC), 2021
Radial
2013
- Tao Cui and Franz Franchetti
A Quasi-Monte Carlo Approach for Radial Distribution System Probabilistic Load Flow
Proc. IEEE Innovative Smart Grid Technology Conference, 2013
Real time
2010
- W. Yu, Franz Franchetti, James C. Hoe, Y.-J. Chang and Tsuhan Chen
Fast Bilateral Filtering By Adapting Block Size
Proc. IEEE International Conference on Image Processing (ICIP), 2010
Rewrite system
2025
- N. Zhang and Franz Franchetti
Code Generation for Cryptographic Kernels using Multi-word Modular Arithmetic on GPU
Proc. International Symposium on Code Generation and Optimization (CGO), 2025
Ring Processing
2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring-Processing Unit
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
Robust contours
2010
- W. Yu, Franz Franchetti, James C. Hoe and Tsuhan Chen
Fast and Robust Active Contours for Image Segmentation
Proc. IEEE International Conference on Image Processing (ICIP), 2010
Roofline model
2014
- Georg Ofenbeck, Ruedi Steinmann, Victoria Caparrós Cabezas, Daniele G. Spampinato and Markus Püschel
Applying the Roofline Model
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 76 - 85, 2014
- Victoria Caparrós Cabezas and Markus Püschel
Extending the Roofline Model: Bottleneck Analysis with Microarchitectural Constraints
Proc. IEEE International Symposium on Workload Characterization (IISWC), pp. 222-231, 2014
RPU
2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
Scalable
2022
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
A Framework for Low Communication Approaches for Large Scale 3D Convolution
Proc. International Conference on Parallel Processing (ICPP), 2022
2021
- N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
An Auto-tuning with Adaptation of A64 Scalable Vector Extension for SPIRAL
Proc. International Workshop on Automatic Performance Tuning (iWAPT), 2021
2020
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
Massive Scaling of MASSIF: Algorithm Development and Analysis for Simulation on GPUs
Proc. Platform for Advanced Scientific Computing (PASC), Article 13, pp. 1 - 10, 2020
2019
- F. Sadi, Joe Sweeney, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Efficient SpMV Operation for Large and Highly Sparse Matrices Using Scalable Multi-way Merge Parallelization
Proc. MICRO, 2019
2018
- F. Sadi, Joe Sweeney, S. McMillan, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
PageRank Acceleration for Large Graphs with Scalable Hardware and Two-Step SpMV
Proc. High Performance Extreme Computing (HPEC), 2018
Scale-free
2017
- Richard Veras and Franz Franchetti
A Scale-free Structure for Real World Networks
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017
Scientific computing
2025
- N. Zhang, Sanil Rao, M. Franusich and Franz Franchetti
Towards Semantics Lifting for Scientific Computing: A Case Study on FFT
Proc. ACM SIGPLAN Symposium on Principles of Programming Languages (POPL), 2025
Search/Learning for optimization
2016
- Marcela Zuluaga, Andreas Krause and Markus Püschel
e-PAL: An Active Learning Approach to the Multi-Objective Optimization Problem
Journal of Machine Learning Research, Vol. 17, No. 104, pp. 1-32, 2016
2013
- Marcela Zuluaga, Andreas Krause, Guillaume Sergent and Markus Püschel
Active Learning for Multi-Objective Optimization
Proc. International Conference on Machine Learning (ICML), pp. 462-470, 2013
- Marcela Zuluaga, Andreas Krause and Markus Püschel
Multi-Objective Optimization for High-Level Synthesis
Proc. Workshop on High-Level Synthesis for High Performance Computing (HLS4HPC), 2013
2012
- Marcela Zuluaga, Andreas Krause, Peter A. Milder and Markus Püschel
"Smart" Design Space Sampling to Predict Pareto-Optimal Solutions
Proc. Languages, Compilers, Tools and Theory for Embedded Systems (LCTES), pp. 119-128 , 2012
2010
- Frédéric de Mesmay, Yevgen Voronenko and Markus Püschel
Offline Library Adaptation Using Automatically Generated Heuristics
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 1-10, 2010
- Frédéric de Mesmay
On the Computer Generation of Adaptive Numerical Libraries
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
2009
- Frédéric de Mesmay, Arpad Rimmel, Yevgen Voronenko and Markus Püschel
Bandit-Based Optimization on Graphs with Application to Library Performance Tuning
Proc. International Conference on Machine Learning (ICML), pp. 729-736, 2009
2002
- Bryan Singer and Manuela Veloso
Automating the Modeling and Optimization of the Performance of Signal Transforms
IEEE Transactions on Signal Processing, Vol. 50, No. 8, pp. 2003-2014, 2002
- Bryan Singer and Manuela Veloso
Learning to Construct Fast Signal Processing Implementations
Journal of Machine Learning Research, special issue on ``the Eighteenth International Conference on Machine Learning (ICML 2001)'', Vol. 3, pp. 887-919, 2002
2001
- Bryan Singer
Automating the Modeling and Optimization of the Performance of Signal Processing Algorithms
PhD. thesis, Computer Science, Carnegie Mellon University, 2001
- Bryan Singer and Manuela Veloso
Learning to Generate Fast Signal Processing Implementations
Proc. International Conference on Machine Learning (ICML), pp. 529-536, 2001
- Bryan Singer and Manuela Veloso
Stochastic Search for Signal Processing Algorithm Optimization
Proc. Supercomputing (SC), pp. 22, 2001
2000
- Pinit Kumhom, Jeremy Johnson and Prawat Nagvajara
Design, optimization, and implementation of a universal FFT processor
Proc. IEEE ASIC/SOC Conference, IEEE, pp. 182-186, 2000
- Bryan Singer and Manuela Veloso
Learning to Predict Performance from Formula Modeling and Training Data
Proc. International Conference on Machine Learning (ICML), pp. 887-894, 2000
Security
2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
2014
- K. Vaidyanathan, R. Liu, H. E. Sumbul, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
Efficient and Secure Intellectual Property (IP) Design for Split Fabrication
Proc. IEEE International Symposium on Hardware-Oriented Security and Trust (HOST), 2014
2013
- Tao Cui
Power System Probabilistic and Security Analysis Using Commodity High Performance Computing Systems
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2013
- Cory Thoma, Tao Cui and Franz Franchetti
Privacy Preserving Smart Meter System Based Retail Level Electricity Market
Proc. IEEE Power and Energy Society General Meeting (PES-GM), 2013
2012
- Cory Thoma, Tao Cui and Franz Franchetti
Secure Multiparty Computation Based Privacy Preserving Smart Metering System
Proc. North American Power Symposium (NAPS), pp. 1-6, 2012
Semantics
2025
- N. Zhang, Sanil Rao, M. Franusich and Franz Franchetti
Towards Semantics Lifting for Scientific Computing: A Case Study on FFT
Proc. ACM SIGPLAN Symposium on Principles of Programming Languages (POPL), 2025
2021
- Yannick Zakowski, Calvin Beck, Irene Yoon, Ilia Zaichuk, Vadim Zaliva and Steve Zdancewic
Modular, Compositional, and Executable Formal Semantics for LLVMIR
Proc. International Conference on Functional Programming (ICFP), 2021
Semantics lifting
2025
- N. Zhang, Sanil Rao, M. Franusich and Franz Franchetti
Towards Semantics Lifting for Scientific Computing: A Case Study on FFT
Proc. ACM SIGPLAN Symposium on Principles of Programming Languages (POPL), 2025
Set intersection
2020
- Jiyuan Zhang, Yi Lu, Daniele G. Spampinato and Franz Franchetti
FESIA: A Fast and Efficient Set Intersection Approach on Modern CPUs
Proc. IEEE International Conference on Data Engineering (ICDE), 2020
- Jiyuan Zhang, Daniele G. Spampinato and Franz Franchetti
FESIA: A Fast and SIMD-Efficient Set Intersection Approach on Modern CPUs
North East Database Day (NEDB), 2020, Poster
Shor's algorithm
2023
- D. Sun, N. Zhang and Franz Franchetti
Optimization and Performance Analysis of Shor’s Algorithm in Qiskit
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
Short vector
2013
- Tom Henretty, Richard Veras, Franz Franchetti, Louis-Noël Pouchet, J. Ramanujam and P. Sadayappan
A Stencil Compiler for Short-Vector SIMD Architectures
Proc. ACM International Conference on Supercomputing , pp. 13-24, 2013
2003
- Franz Franchetti
Performance Portable Short Vector Transforms
PhD. thesis, Vienna University of Technology, 2003
2000
- Franz Franchetti
Short Vector FFTs
Master thesis, Vienna University of Technology, 2000
SIMD vectorization
2020
- Jiyuan Zhang, Daniele G. Spampinato and Franz Franchetti
FESIA: A Fast and SIMD-Efficient Set Intersection Approach on Modern CPUs
North East Database Day (NEDB), 2020, Poster
2018
- Daniele G. Spampinato, Diego Fabregat-Traver, Paolo Bientinesi and Markus Püschel
Program Generation for Small-Scale Linear Algebra Applications
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 327-339, 2018
2017
- Daniele G. Spampinato
A Linear Algebra Compiler for Small Problem Sizes
PhD. thesis, Computer Science, ETH Zurich, Switzerland, 2017
2014
- Daniele G. Spampinato and Markus Püschel
A Basic Linear Algebra Compiler
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 23-32, 2014
- Nikolaos Kyrtatas
A Basic Linear Algebra Compiler for Embedded Processors
Master thesis, Computer Science, ETH Zurich, Switzerland, 2014
- Benjamin Hess, Thomas Gross and Markus Püschel
Automatic Locality-Friendly Interface Extension of Numerical Functions
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 83-92, 2014
2013
- Tom Henretty, Richard Veras, Franz Franchetti, Louis-Noël Pouchet, J. Ramanujam and P. Sadayappan
A Stencil Compiler for Short-Vector SIMD Architectures
Proc. ACM International Conference on Supercomputing , pp. 13-24, 2013
- Jörn Schumacher
High Performance Sparse Fast Fourier Transform
Master thesis, Computer Science, ETH Zurich, Switzerland, 2013
- Tao Cui and Franz Franchetti
Power System Probabilistic and Security Analysis on Commodity High Performance Computing Systems
Proc. High Performance Computing, Networking and Analytics for the Power Grid (HiPCNA-PG), 2013
- Martin Kong, Richard Veras, Kevin Stock, Franz Franchetti, Louis-Noël Pouchet and P. Sadayappan
When Polyhedral Transformations Meet SIMD Code Generation
Proc. Programming Languages Design and Implementation (PLDI), Vol. 48, pp. 127-138, 2013
2011
- Daniel McFarlin, Volodymyr Arbatov, Franz Franchetti and Markus Püschel
Automatic SIMD Vectorization of Fast Fourier Transforms for the Larrabee and AVX Instruction Sets
Proc. International Conference on Supercomputing (ICS), 2011
- Tom Henretty, Kevin Stock, Louis-Noël Pouchet, Franz Franchetti, J. Ramanujam and P. Sadayappan
Data Layout Transformation for Stencil Computations on Short SIMD Architectures
Proc. International Conference on Compiler Construction (CC), 2011
- Franz Franchetti and Markus Püschel
Fast Fourier Transform
in Encyclopedia of Parallel Computing, Eds. David Padua, Springer 2011
- Christian Berger, Volodymyr Arbatov, Yevgen Voronenko, Franz Franchetti and Markus Püschel
Real-Time Software Implementation of an IEEE 802.11a Baseband Receiver on Intel Multicore
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1693-1696 , 2011
- Markus Püschel, Franz Franchetti and Yevgen Voronenko
Spiral
in Encyclopedia of Parallel Computing, Eds. David Padua, pp. 1920-1933, Springer 2011
2010
- Yevgen Voronenko, Volodymyr Arbatov, Christian Berger, Ronghui Peng, Markus Püschel and Franz Franchetti
Computer Generation of Platform-Adapted Physical Layer Software
Proc. Software Defined Radio (SDR), 2010
2009
- Daniel McFarlin, Franz Franchetti and Markus Püschel
Automatic Generation of Vectorized Fast Fourier Transform Libraries for the Larrabee and AVX Instruction Set Extension
Proc. High Performance Extreme Computing (HPEC), 2009
- Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
- Franz Franchetti and Markus Püschel
Generating High-Performance Pruned FFT Implementations
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 549-552, 2009
- Daniel McFarlin, Franz Franchetti, José M. F. Moura and Markus Püschel
High Performance Synthetic Aperture Radar Image Formation On Commodity Architectures
Proc. SPIE Conference on Defense, Security, and Sensing, Proceedings of SPIE, Vol. 7337, pp. 733708, 2009
- Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
Operator Language: A Program Generation Framework for Fast Kernels
Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009
2008
- Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
Domain-Specific Library Generation for Parallel Software and Hardware Platforms
Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
- Franz Franchetti and Markus Püschel
Generating SIMD Vectorized Permutations
Proc. International Conference on Compiler Construction (CC), Lecture Notes in Computer Science, Springer, Vol. 4959, pp. 116-131, 2008
- Yevgen Voronenko
Library Generation for Linear Transforms
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2008
- Jeremy Johnson and Michael Andrews
Statistical Evaluation of a Self-Tuning Vectorized Library for the Walsh-Hadamard Transform
Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
2007
- Franz Franchetti and Markus Püschel
SIMD Vectorization of Non-Two-Power Sized FFTs
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. II-17-II-20, 2007
2006
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
A Rewriting System for the Vectorization of Signal Transforms
Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 4395, pp. 363-377, 2006
- Sung-Chul Han, Franz Franchetti and Markus Püschel
Program Generation for the All-Pairs Shortest Path Problem
Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 222-232, 2006
2005
- Franz Franchetti, Stefan Kral, Juergen Lorenz and Christoph W. Ueberhuber
Efficient Utilization of SIMD Extensions
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 409-425, 2005
2003
- Franz Franchetti and Markus Püschel
Short Vector Code Generation and Adaptation for DSP Algorithms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 537-540, 2003
- Franz Franchetti and Markus Püschel
Short Vector Code Generation for the Discrete Fourier Transform
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2003
- Stefan Kral, Franz Franchetti, Juergen Lorenz and Christoph W. Ueberhuber
SIMD Vectorization of Straight Line Code
Proc. Euro-Par Conference on Parallel and Distributed Computing, LNCS 2985, pp. 251-260, 2003
2002
- Franz Franchetti and Markus Püschel
A SIMD Vectorizing Compiler for Digital Signal Processing Algorithms
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 20-26, 2002
Simulation
2022
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
A Framework for Low Communication Approaches for Large Scale 3D Convolution
Proc. International Conference on Parallel Processing (ICPP), 2022
2020
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
Massive Scaling of MASSIF: Algorithm Development and Analysis for Simulation on GPUs
Proc. Platform for Advanced Scientific Computing (PASC), Article 13, pp. 1 - 10, 2020
2019
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
Algorithm Design at Scale: Porting Parallel FFT-based Fortran Simulations to GPUs
SIAM Conference on Computational Science and Engineering (CSE), 2019
- Anuva Kulkarni, Daniele G. Spampinato and Franz Franchetti
Design and Specification of Large-scale Simulations for GPUs using FFTX
Supercomputing, 2019
- Yoko Franchetti, Thomas Nolin and Franz Franchetti
Towards Precision Medicine: Simulation Based Parameter Estimation for Drug Metabolism
SIAM Conference on Computational Science and Engineering (CSE), 2019
2018
- Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
Algorithm Design for Large Scale FFT-Based Simulations on CPU-GPU Platforms
, 2018
- Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
Algorithm Design for Large Scale Parallel FFT-Based Simulations on Heterogeneous Platforms
Proc. High Performance Extreme Computing (HPEC), 2018
2016
- Qi Guo, Tianshi Chen, Y. Chen and Franz Franchetti
Accelerating Architectural Simulation Via Statistical Techniques: A Survey
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 35, No. 3, pp. 433-446, 2016
2013
- Tao Cui and Franz Franchetti
A Quasi-Monte Carlo Approach for Radial Distribution System Probabilistic Load Flow
Proc. IEEE Innovative Smart Grid Technology Conference, 2013
- Tao Cui and Franz Franchetti
Power System Probabilistic and Security Analysis on Commodity High Performance Computing Systems
Proc. High Performance Computing, Networking and Analytics for the Power Grid (HiPCNA-PG), 2013
Smart grid
2013
- Q. Li, Tao Cui, Yang Weng, Rohit Negi, Franz Franchetti and Marija Ilic
An Information-Theoretic Approach to PMU Placement in Electric Power Systems
IEEE Transactions on Smart Grid, Vol. 4, No. 1, pp. 446-456, 2013
- Tao Cui and Franz Franchetti
A Software Performance Engineering Aproach to Fast Transmission Probabilistic Load Flow
IEEE PES General Meeting, 2013
- Tao Cui and Franz Franchetti
Power System Probabilistic and Security Analysis on Commodity High Performance Computing Systems
Proc. High Performance Computing, Networking and Analytics for the Power Grid (HiPCNA-PG), 2013
- Cory Thoma, Tao Cui and Franz Franchetti
Privacy Preserving Smart Meter System Based Retail Level Electricity Market
Proc. IEEE Power and Energy Society General Meeting (PES-GM), 2013
Smart Metering
2013
- Cory Thoma, Tao Cui and Franz Franchetti
Privacy Preserving Smart Meter System Based Retail Level Electricity Market
Proc. IEEE Power and Energy Society General Meeting (PES-GM), 2013
2012
- Cory Thoma, Tao Cui and Franz Franchetti
Secure Multiparty Computation Based Privacy Preserving Smart Metering System
Proc. North American Power Symposium (NAPS), pp. 1-6, 2012
Software defined radio (SDR)
2011
- Christian Berger, Volodymyr Arbatov, Yevgen Voronenko, Franz Franchetti and Markus Püschel
Real-Time Software Implementation of an IEEE 802.11a Baseband Receiver on Intel Multicore
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1693-1696 , 2011
2010
- Yevgen Voronenko, Volodymyr Arbatov, Christian Berger, Ronghui Peng, Markus Püschel and Franz Franchetti
Computer Generation of Platform-Adapted Physical Layer Software
Proc. Software Defined Radio (SDR), 2010
Sorting
2016
- Francois Serre and Markus Püschel
Optimal Circuits for Streamed Linear Permutations using RAM
Proc. FPGA, pp. 215-223, 2016
- Marcela Zuluaga, Peter A. Milder and Markus Püschel
Streaming Sorting Networks
ACM Transactions on Design Automation of Electronic Systems, Vol. 21, No. 4, pp. 55, 2016
2012
- Marcela Zuluaga, Peter A. Milder and Markus Püschel
Computer Generation of Streaming Sorting Networks
Proc. Design Automation Conference (DAC), pp. 1245-1253, 2012
2005
- Xiaoming Li, María J. Garzarán and David Padua
Optimizing Sorting with Genetic Algorithm
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 99-110, 2005
2004
- Xiaoming Li, María J. Garzarán and David Padua
A Dynamically Tuned Sorting Library
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 111-124, 2004
Sparse matrices
2019
- F. Sadi, Joe Sweeney, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Efficient SpMV Operation for Large and Highly Sparse Matrices Using Scalable Multi-way Merge Parallelization
Proc. MICRO, 2019
Spatial graph embedding
2021
- P. Oostema and Franz Franchetti
Leveraging High Dimensional Spatial Graph Embedding as a Heuristic for Graph Algorithms
Proc. IEEE Workshop on Parallel / Distributed Combinatorics and Optimization (PDCO), 2021
Spectral library search
2024
- T. Zhang, E. Tang, F. Siddique, K. Skadron and Franz Franchetti
Towards an End-to-End Processing-in-DRAM Acceleration of Spectral Library Search
Proc. High Performance Extreme Computing (HPEC), 2024
SpectralPACK
2020
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solver Libraries for Exascale: FFTX and SpectralPack
Exascale Computing Project (ECP) Annual Meeting, 2020, Poster
2019
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solvers for Exascale: FFTX and SpectralPACK
Exascale Computing Project (ECP) Annual Meeting, 2019
SPIRAL
2025
- N. Zhang, Sanil Rao, M. Franusich and Franz Franchetti
Towards Semantics Lifting for Scientific Computing: A Case Study on FFT
Proc. ACM SIGPLAN Symposium on Principles of Programming Languages (POPL), 2025
2023
- Sanil Rao, M. A. H. Monil, H. Mankad, J. Vetter and Franz Franchetti
FFTX-IRIS: Towards Performance Portability and Heterogeneity for SPIRAL Generated Code
Proc. Supercomputing (SC), 2023
2021
- N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
Adaptation of A64 Scalable Vector Extension for Spiral
Proc. High Performance Computing Conference (HPCC), Japan, 2021
- N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
An Auto-tuning with Adaptation of A64 Scalable Vector Extension for SPIRAL
Proc. International Workshop on Automatic Performance Tuning (iWAPT), 2021
- Scott Mionis, Franz Franchetti and J. Larkin
Optimized Quantum Circuit Generation with SPIRAL
Proc. High Performance Extreme Computing (HPEC), 2021
2020
- Daisuke Takahashi and Franz Franchetti
FFTE on SVE: SPIRAL-Generated Kernels
Proc. International Conference on High Performance Computing in Asia-Pacific Region (HPCAsia), pp. 114-122, 2020
- Scott Mionis, Franz Franchetti and J. Larkin
Quantum Circuit Optimization with SPIRAL: A First Look
Proc. Supercomputing (SC), 2020
2019
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solvers for Exascale: FFTX and SpectralPACK
Exascale Computing Project (ECP) Annual Meeting, 2019
2018
- Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
SPIRAL: Extreme Performance Portability
Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018
2017
- Matthias Bolten, Franz Franchetti, P. H. J. Kelly, Christian Lengauer and Marcus Mohr
Algebraic Description and Automatic Generation of Multigrid Methods in SPIRAL
Concurrency and Computation: Practice and Experience, 2017
- Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
IEEE Control Systems Magazine, 2017
- Richard Veras
The Automatic Generation of High-Performance Graph Analytic Code using Spiral
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2017
2015
- Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
Proc. Workshop on Compilers for Parallel Computing (CPC), 2015
2010
- Lingchuan Meng, Jeremy Johnson, Franz Franchetti, Yevgen Voronenko, Marc Moreno Maza and Yuzhen Xie
Spiral-Generated Modular FFT Algorithms
Proc. Parallel Symbolic Computation (PASCO), pp. 169-170, 2010
Spiral overview paper
2014
- Franz Franchetti, Aliaksei Sandryhaila and Jeremy Johnson
High Assurance SPIRAL
Proc. SPIE, Proceedings of SPIE 2014, 2014
2011
- Markus Püschel, Franz Franchetti and Yevgen Voronenko
Spiral
in Encyclopedia of Parallel Computing, Eds. David Padua, pp. 1920-1933, Springer 2011
2009
- Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
- Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
Operator Language: A Program Generation Framework for Fast Kernels
Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009
2008
- Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
Domain-Specific Library Generation for Parallel Software and Hardware Platforms
Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
- Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
Generating High-Performance General Size Linear Transform Libraries Using Spiral
Proc. High Performance Extreme Computing (HPEC), 2008
- Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
Program Generation with Spiral: Beyond Transforms
Proc. High Performance Extreme Computing (HPEC), 2008
2005
- Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
SPIRAL: Code Generation for DSP Transforms
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005
2004
- Markus Püschel, Bryan Singer, Jianxin Xiong, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso and Robert W. Johnson
SPIRAL: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms
Journal of High Performance Computing and Applications, special issue on ``Automatic Performance Tuning'', Vol. 18, No. 1, pp. 21-45, 2004
2001
- Markus Püschel, Bryan Singer, Manuela Veloso and José M. F. Moura
Fast Automatic Generation of DSP Algorithms
Proc. International Conference on Computational Science (ICCS), Lecture Notes In Computer Science, Springer, Vol. 2073, pp. 97-106, 2001
SPIRAL program generation system for transforms
2015
- Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2015
2011
- Markus Püschel, Franz Franchetti and Yevgen Voronenko
Spiral
in Encyclopedia of Parallel Computing, Eds. David Padua, pp. 1920-1933, Springer 2011
2009
- Frédéric de Mesmay, Arpad Rimmel, Yevgen Voronenko and Markus Püschel
Bandit-Based Optimization on Graphs with Application to Library Performance Tuning
Proc. International Conference on Machine Learning (ICML), pp. 729-736, 2009
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
Computer Generation of Fast Fourier Transforms for the Cell Broadband Engine
Proc. International Conference on Supercomputing (ICS), pp. 26-35, 2009
- Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
- Franz Franchetti and Markus Püschel
Generating High-Performance Pruned FFT Implementations
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 549-552, 2009
2008
- Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
Domain-Specific Library Generation for Parallel Software and Hardware Platforms
Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
FFT Program Generation for the Cell BE
Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
- Franz Franchetti and Markus Püschel
Generating SIMD Vectorized Permutations
Proc. International Conference on Compiler Construction (CC), Lecture Notes in Computer Science, Springer, Vol. 4959, pp. 116-131, 2008
2007
- Paolo D'Alberto, Markus Püschel and Franz Franchetti
Performance/Energy Optimization of DSP Transforms on the XScale Processor
Proc. International Conference on High Performance Embedded Architectures and Compilers (HiPEAC), Lecture Notes in Computer Science, Springer, Vol. 4367, pp. 201-214, 2007
2006
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
A Rewriting System for the Vectorization of Signal Transforms
Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 4395, pp. 363-377, 2006
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
FFT Program Generation for Shared Memory: SMP and Multicore
Proc. Supercomputing (SC), 2006
- Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
Parallelism in Spiral
Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006
- Marek Telgarsky, James C. Hoe and José M. F. Moura
Spiral: Joint Runtime and Energy Optimization of Linear Transforms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 3, 2006
2005
- José M. F. Moura, Markus Püschel, David Padua and Jack Dongarra
Scanning the Issue: Special Issue on Program Generation, Optimization, and Platform Adaptation
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 211-215, 2005
- Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
SPIRAL: Code Generation for DSP Transforms
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005
2004
- Markus Püschel, Bryan Singer, Jianxin Xiong, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso and Robert W. Johnson
SPIRAL: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms
Journal of High Performance Computing and Applications, special issue on ``Automatic Performance Tuning'', Vol. 18, No. 1, pp. 21-45, 2004
2003
- Markus Püschel and José M. F. Moura
SPIRAL: An Overview
Proc. Workshop on Optimizations for DSP and Embedded Systems (ODES), 2003
2002
- Markus Püschel and José M. F. Moura
Generation and Manipulation of DSP Transform Algorithms
Proc. Digital Signal Processing Workshop, 2002
- Franz Franchetti, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
Short Vector SIMD Code Generation for DSP Algorithms
Proc. High Performance Extreme Computing (HPEC), 2002
2001
- Markus Püschel, Bryan Singer, Manuela Veloso and José M. F. Moura
Fast Automatic Generation of DSP Algorithms
Proc. International Conference on Computational Science (ICCS), Lecture Notes In Computer Science, Springer, Vol. 2073, pp. 97-106, 2001
- José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel, Bryan Singer, Manuela Veloso and Jianxin Xiong
Generating Platform-Adapted DSP Libraries using SPIRAL
Proc. High Performance Extreme Computing (HPEC), 2001
2000
- José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel and Manuela Veloso
SPIRAL: Automatic Implementation of Signal Processing Algorithms
Proc. High Performance Extreme Computing (HPEC), 2000
SPL compiler: Translating math into code
2013
- Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
Spiral in Scala: Towards the Systematic Construction of Generators for Performance Libraries
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 125-134, 2013
2011
- Markus Püschel, Franz Franchetti and Yevgen Voronenko
Spiral
in Encyclopedia of Parallel Computing, Eds. David Padua, pp. 1920-1933, Springer 2011
2008
- Franz Franchetti and Markus Püschel
Generating SIMD Vectorized Permutations
Proc. International Conference on Compiler Construction (CC), Lecture Notes in Computer Science, Springer, Vol. 4959, pp. 116-131, 2008
2007
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
FFT Compiler: From Math to Efficient Hardware
Proc. IEEE International High Level Design Validation and Test Workshop (HLDVT), 2007
- Yevgen Voronenko and Markus Püschel
Mechanical Derivation of Fused Multiply-Add Algorithms for Linear Transforms
IEEE Transactions on Signal Processing, Vol. 55, No. 9, pp. 4458-4473, 2007
2005
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
Formal Loop Merging for Signal Transforms
Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326 , 2005
2004
- Yevgen Voronenko and Markus Püschel
Automatic Generation of Implementations for DSP Transforms on Fused Multiply-Add Architectures
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-101-V-104, 2004
2003
- Franz Franchetti and Markus Püschel
Short Vector Code Generation and Adaptation for DSP Algorithms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 537-540, 2003
2002
- Franz Franchetti and Markus Püschel
A SIMD Vectorizing Compiler for Digital Signal Processing Algorithms
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 20-26, 2002
2001
- Jianxin Xiong
Automatic Optimization of DSP Algorithms
PhD. thesis, Computer Science, University of Illinois at Urbana-Champaign, 2001, Also as Technical Report UIUCDCS-R-2001-224, University of Illinois
- Jianxin Xiong, Jeremy Johnson, Robert W. Johnson and David Padua
SPL: A Language and Compiler for DSP Algorithms
Proc. Programming Languages Design and Implementation (PLDI), pp. 298-308, 2001
Split fabrication
2014
- K. Vaidyanathan, R. Liu, H. E. Sumbul, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
Efficient and Secure Intellectual Property (IP) Design for Split Fabrication
Proc. IEEE International Symposium on Hardware-Oriented Security and Trust (HOST), 2014
2013
- H. E. Sumbul, A. Patterson, A. Tazzoli, G. Feeder, Franz Franchetti, G. Piazza and Lawrence Pileggi
Trusted Split-Fabrication System-on-Chip Design Technology and Methodology
Proc. Government Applications & Critical Technology Conference (GOMACTech), 2013
SpMV
2019
- F. Sadi, Joe Sweeney, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Efficient SpMV Operation for Large and Highly Sparse Matrices Using Scalable Multi-way Merge Parallelization
Proc. MICRO, 2019
2018
- F. Sadi, Joe Sweeney, S. McMillan, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
PageRank Acceleration for Large Graphs with Scalable Hardware and Two-Step SpMV
Proc. High Performance Extreme Computing (HPEC), 2018
SpMV Operation
2019
- F. Sadi, Joe Sweeney, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Efficient SpMV Operation for Large and Highly Sparse Matrices Using Scalable Multi-way Merge Parallelization
Proc. MICRO, 2019
Staging
2017
- Georg Ofenbeck, Tiark Rompf and Markus Püschel
Staging for Generic Programming in Space and Time
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 15-28, 2017
2014
- Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
Language Support for the Construction of High Performance Code Generators
Proc. International Workshop on Adaptive Self-tuning Computing Systems (ADAPT), 2014
2013
- Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
Spiral in Scala: Towards the Systematic Construction of Generators for Performance Libraries
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 125-134, 2013
State prediction
2017
- H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
Multirotor UAV State Prediction Through Multi-microphone Side-channel Fusion
Proc. Multisensor Fusion and Integration for Intelligent Systems (MFI), IEEE, pp. 15-21, 2017
Static analysis
2025
- N. Zhang, Sanil Rao, M. Franusich and Franz Franchetti
Towards Semantics Lifting for Scientific Computing: A Case Study on FFT
Proc. ACM SIGPLAN Symposium on Principles of Programming Languages (POPL), 2025
Statistics
2016
- Qi Guo, Tianshi Chen, Y. Chen and Franz Franchetti
Accelerating Architectural Simulation Via Statistical Techniques: A Survey
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 35, No. 3, pp. 433-446, 2016
Stencil computations
2011
- Tom Henretty, Kevin Stock, Louis-Noël Pouchet, Franz Franchetti, J. Ramanujam and P. Sadayappan
Data Layout Transformation for Stencil Computations on Short SIMD Architectures
Proc. International Conference on Compiler Construction (CC), 2011
Stereo vision
2010
- W. Yu, Tsuhan Chen, Franz Franchetti and James C. Hoe
High Performance Stereo Vision Designed for Massively Data Parallel Platforms
IEEE Transactions on Circuits and Systems for Video Technology (T-CSVT), Vol. 20, No. 11, pp. 1509-1519, 2010
Streaming permutations
2016
- Francois Serre and Markus Püschel
Optimal Circuits for Streamed Linear Permutations using RAM
Proc. FPGA, pp. 215-223, 2016
- Marcela Zuluaga, Peter A. Milder and Markus Püschel
Streaming Sorting Networks
ACM Transactions on Design Automation of Electronic Systems, Vol. 21, No. 4, pp. 55, 2016
2010
- Peter A. Milder
A Mathematical Approach for Compiling and Optimizing Hardware Implementations of DSP Transforms
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
2009
- Peter A. Milder, James C. Hoe and Markus Püschel
Automatic Generation of Streaming Datapaths for Arbitrary Fixed Permutations
Proc. Design, Automation and Test in Europe (DATE), pp. 1118-1123, 2009
- Markus Püschel, Peter A. Milder and James C. Hoe
Permuting Streaming Data Using RAMs
Journal of the ACM, Vol. 56, No. 2, pp. 10:1-10:34, 2009
Stress-strain analysis
2019
- Anuva Kulkarni, Daniele G. Spampinato and Franz Franchetti
FFTX for Micromechanical Stress-Strain Analysis
IEEE High Performance Extreme Computing Conference (HPEC), 2019
Structured matrices
2017
- Daniele G. Spampinato
A Linear Algebra Compiler for Small Problem Sizes
PhD. thesis, Computer Science, ETH Zurich, Switzerland, 2017
2016
- Daniele G. Spampinato and Markus Püschel
A Basic Linear Algebra Compiler for Structured Matrices
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 117-127, 2016
Submodular functions
2013
- Q. Li, Tao Cui, Yang Weng, Rohit Negi, Franz Franchetti and Marija Ilic
An Information-Theoretic Approach to PMU Placement in Electric Power Systems
IEEE Transactions on Smart Grid, Vol. 4, No. 1, pp. 446-456, 2013
Synthesis
2018
- Daniele G. Spampinato, Diego Fabregat-Traver, Paolo Bientinesi and Markus Püschel
Program Generation for Small-Scale Linear Algebra Applications
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 327-339, 2018
2017
- G. Xu, Tze-Meng Low, James C. Hoe and Franz Franchetti
Optimizing FFT Resource Efficiency of FPGA using High-Level Synthesis
High Performance Extreme Computing Conference (HPEC), 2017
- Georg Ofenbeck, Tiark Rompf and Markus Püschel
Staging for Generic Programming in Space and Time
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 15-28, 2017
2015
- H. E. Sumbul, K. Vaidyanathan, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
A Synthesis Methodology for Application-Specific Logic-in-Memory Designs
Proc. Design Automation Conference (DAC), 2015
2014
- Daniele G. Spampinato and Markus Püschel
A Basic Linear Algebra Compiler
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 23-32, 2014
2013
- Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
Spiral in Scala: Towards the Systematic Construction of Generators for Performance Libraries
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 125-134, 2013
2012
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
Cost-Effective Smart Memory Implementation for Parallel Backprojection in Computed Tomography
Proc. IFIP/IEEE Internationa Conference on Very Large Scale Integration, pp. 111-116, 2012
Synthetic aperture radar
2012
- Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
Local Interpolation-based Polar Format SAR: Algorithm, Hardware Implementation and Design Automation
Journal of Signal Processing Systems, 2012
- Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
Polar Format Synthetic Aperture Radar in Energy Efficient Application-Specific Logic-in-Memory
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1557-1560, 2012
2011
- Qiuling Zhu, E. L. Turner, Christian Berger, Lawrence Pileggi and Franz Franchetti
Application-Specific Logic-in-Memory for Polar Format Synthetic Aperture Radar
Proc. High Performance Extreme Computing (HPEC), 2011
System-on-chip
2013
- H. E. Sumbul, A. Patterson, A. Tazzoli, G. Feeder, Franz Franchetti, G. Piazza and Lawrence Pileggi
Trusted Split-Fabrication System-on-Chip Design Technology and Methodology
Proc. Government Applications & Critical Technology Conference (GOMACTech), 2013
Three-dimensional
2020
- Anuva Kulkarni
An Approach for Large-Scale Three-Dimensional FFT-Based Approximate Convolutions on GPUs
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2020
Tomography
2013
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
A Smart Memory Accelerated Computed Tomography Parallel Backprojection
in From Algorithms to Circuits and System-on-Chip Design (VLSI-SoC 2012), Eds. A. Berg, A. Coskun, M. Guthaus, S. Katkoori, R. Reis, pp. 21-44, Springer, Berlin, Heidelberg 2013
2012
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
Cost-Effective Smart Memory Implementation for Parallel Backprojection in Computed Tomography
Proc. IFIP/IEEE Internationa Conference on Very Large Scale Integration, pp. 111-116, 2012
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
Smart Memory Synthesis for Energy-Efficient Computed Tomography Reconstruction
Proc. SRC TECHCON, 2012
Transform
2024
- S. Fu, N. Zhang and Franz Franchetti
Accelerating High-Precision Number Theoretic Transforms using Intel AVX-512
Proc. Parallel Architectures and Compilation Techniques (PACT), 2024
2014
- Berkin Akin, James C. Hoe and Franz Franchetti
HAMLeT: Hardware Accelerated Memory Layout Transform within 3D-stacked DRAM
Proc. IEEE High Performance Extreme Computing (HPEC), 2014
2008
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
Automatic Linear Transform Program Generation for the Cell BE
Proc. Supercomputing (SC), 2008, Poster (Abstract reviewed)
2005
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
Loop Merging for Signal Transforms
Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326, 2005
2003
- Franz Franchetti
Performance Portable Short Vector Transforms
PhD. thesis, Vienna University of Technology, 2003
Triangle counting
2020
- Mark Blanco, S. McMillan and Tze-Meng Low
Towards an Objective Metric for the Performance of Exact Triangle Count
Proc. High Performance Extreme Computing (HPEC), 2020
2018
- Jiyuan Zhang, Daniele G. Spampinato, S. McMillan and Franz Franchetti
Preliminary Exploration on Large-Scale Triangle Counting in Shared-Memory Multicore System
Proc. IEEE High Performance Extreme Computing (HPEC), 2018
Tutorial
2011
- Markus Püschel, Franz Franchetti and Yevgen Voronenko
Spiral
in Encyclopedia of Parallel Computing, Eds. David Padua, pp. 1920-1933, Springer 2011
2008
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
How To Write Fast Numerical Code: A Small Introduction
Proc. Summer School on Generative and Transformational Techniques in Software Engineering (GTTSE), Lecture Notes in Computer Science, Springer, Vol. 5235, pp. 196-259, 2008
Twiddle factor generation
2023
- P. Brinich, N. Zhang, A. Ebel, Franz Franchetti and Jeremy Johnson
Twiddle Factor Generation for a Vectorized Number Theoretic Transform
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
Vector
2024
- S. Fu, N. Zhang and Franz Franchetti
Accelerating High-Precision Number Theoretic Transforms using Intel AVX-512
Proc. Parallel Architectures and Compilation Techniques (PACT), 2024
2023
- N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
- P. Brinich, N. Zhang, A. Ebel, Franz Franchetti and Jeremy Johnson
Twiddle Factor Generation for a Vectorized Number Theoretic Transform
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
Verification
2020
- Vadim Zaliva
HELIX: From Math to Verified Code
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2020
- Vadim Zaliva, Ilia Zaichuk and Franz Franchetti
Verified Translation Between Purely Functional and Imperative Domain Specific Languages in HELIX
Proc. Working Conference on Verified Software: Theories, Tools, and Experiments (VSTTE), 2020
2019
- Vadim Zaliva and Matthieu Sozeau
Reification of Shallow-Embedded DSLs in Coq with Automated Verification
Proc. International Workshop on Coq for Programming Languages (CoqPL), 2019
Verified compilation
2024
- Sanil Rao, A. Prakash and Franz Franchetti
Towards LibraryX: A Framework for Cross-Library Call Optimization
Proc. High Performance Extreme Computing (HPEC), 2024
2021
- Yannick Zakowski, Calvin Beck, Irene Yoon, Ilia Zaichuk, Vadim Zaliva and Steve Zdancewic
Modular, Compositional, and Executable Formal Semantics for LLVMIR
Proc. International Conference on Functional Programming (ICFP), 2021
Walsh-Hadamard transform
2008
- Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
Domain-Specific Library Generation for Parallel Software and Hardware Platforms
Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
- Jeremy Johnson and Michael Andrews
Statistical Evaluation of a Self-Tuning Vectorized Library for the Walsh-Hadamard Transform
Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
2007
- Jeremy Johnson and Michael Andrews
Performance Analysis of a Family of WHT Algorithms
Proc. Workshop on Performance Optimization for High-Level Languages and Libraries (POHLL), 2007 IEEE International Parallel and Distributed Processing Symposium , IEEE, pp. 450, 2007
2006
- Pawel Hitczenko, Jeremy Johnson and Hung-Jen Huang
Distribution of a Class of Divide and Conquer Recurrences arising from the Computation of the Walsh-Hadamard Transform
Theoretical Computer Science, Vol. 352, pp. 8-30, 2006
2005
- Mihai Furis, Pawel Hitczenko and Jeremy Johnson
Cache Miss Analysis of WHT Algorithms
Proc. International Conference on Analysis of Algorithms, Discrete Mathematics and Theoretical Computer Science, pp. 115-124, 2005
2004
- Jeremy Johnson and Kang Chen
A Self-Adapting Distributed Memory Package for Fast Signal Transforms
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 44-, 2004
2002
- Kang Chen
A Prototypical Self-Optimizing Package for Parallel Implementation of Fast Signal Transforms
Master thesis, Computer Science, Drexel University, 2002
- Kang Chen and Jeremy Johnson
A Prototypical Self-Optimizing Package for Parallel Implementation of Fast Signal Transforms
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 58-63, 2002
- Bryan Singer and Manuela Veloso
Automating the Modeling and Optimization of the Performance of Signal Transforms
IEEE Transactions on Signal Processing, Vol. 50, No. 8, pp. 2003-2014, 2002
2001
- Neungsoo Park and Viktor K. Prasanna
Cache Conscious Walsh-Hadamard Transform
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. II-1205-II-1208, 2001
2000
- Jeremy Johnson and Markus Püschel
In Search of the Optimal Walsh-Hadamard Transform
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 6, pp. 3347-3350, 2000
Wavelet Transform
2001
- Peter Becker
A High Speed VLSI Architecture for the Discrete Haar Wavelet Transform
Master thesis, Electrical and Computer Engineering, Drexel University, 2001
(No keyword)
2016
- Richard Veras, Tze-Meng Low and Franz Franchetti
A Scale-Free Structure for Power-Law Graphs
Proc. High Performance Extreme Computing (HPEC), pp. 1-7, 2016
2015
- Nikolaos Kyrtatas, Daniele G. Spampinato and Markus Püschel
A Basic Linear Algebra Compiler for Embedded Processors
Proc. Design, Automation and Test in Europe (DATE), pp. 1054-1059, 2015
- H. V. Koops and Franz Franchetti
An Ensemble Technique for Estimating Vehicle Speed and Gear Position from Acoustic Data
Proc. International Conference on Digital Signal Processing (DSP), 2015
2014
- Alen Stojanov, Georg Ofenbeck, Tiark Rompf and Markus Püschel
Abstracting Vector Architectures in Library Generators: Case Study Convolution Filters
Proc. ACM International Workshop on Libraries, Languages and Compilers for Array Programming (ARRAY), pp. 14, 2014
- Vadim Zaliva and Franz Franchetti
Barometric and GPS Altitude Sensor Fusion
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014
2013
- Benjamin Hess
Automatic Refactoring: Locality Friendly Interface Enhancements for Numerical Functions
Master thesis, Computer Science, ETH Zurich, Switzerland, 2013
2008
- Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
Automatic Generation of Multithreaded Vectorized Adaptive Libraries for Matrix Multiplication
Proc. International Workshop on Parallel Matrix Algorithms and Applications (PMAA), 2008
2007
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Fast Fourier Transform on FPGA: Design Choices and Evaluation
Proc. International Symposium on Field-Programmable Gate Arrays (FPGA), 2007
2006
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
Spiral: Generating Signal Processing Kernels for New Commodity Architectures
Proc. EDGE Workshop, pp. D49-D50, 2006
2003
- T. Fahringer, Franz Franchetti, M. Geissler, G. Madsen, H. Moritsch and R. Prodan
On Using ZENTURIO for Performance and Parameter Studies on Clusters and Grids
Proc. Euromicro Conference on Parallel Distributed and Network-based Processing (Euro PDP), pp. 185-192, 2003
- Franz Franchetti
Top Performance in Signal Processing
Proc. International Workshop on Numerical and Symbolic Scientific Computing, 2003
2002
- Markus Püschel, Sebastian Egner and Thomas Beth
AREP
in New Reference Book on Computer Algebra, Eds. J. Grabmeier, E. Kaltofen, J. Grabmeier, E. Kaltofen, Springer 2002