Publications
Filtered as:
all types
- all years
- all authors
- all keywords
Sorted by:
year - type
Corresponding
bibtex list
2024
Conference (fully reviewed)
- E. Tang, James C. Hoe and Franz Franchetti
Magic Memory: A Memory-Centric Declarative Programming Paradigm to Enable High Productivity on Heterogeneous Systems
Proc. SRC TECHCON, 2024
Conference (abstract reviewed)
- L. Tang, S. Chen, K. Harisrikanth, G. Xu, Franz Franchetti and K. Mai
A 1.19GHz 9.52Gsamples/sec Radix-8 FFT Hardware Accelerator in 28nm
Proc. Hot Chips Symposium (HCS), 2024
- S. Fu, N. Zhang and Franz Franchetti
Accelerating High-Precision Number Theoretic Transforms using Intel AVX-512
Proc. Parallel Architectures and Compilation Techniques (PACT), 2024
- T. Zhang, E. Tang, F. Siddique, K. Skadron and Franz Franchetti
Towards an End-to-End Processing-in-DRAM Acceleration of Spectral Library Search
Proc. High Performance Extreme Computing (HPEC), 2024
- Y. Eum, N. Zhang, L. Tang and Franz Franchetti
Towards a RISC-V Instruction Set Extension for Multi-word Arithmetic
Proc. High Performance Extreme Computing (HPEC), 2024
- Sanil Rao, A. Prakash and Franz Franchetti
Towards LibraryX: A Framework for Cross-Library Call Optimization
Proc. High Performance Extreme Computing (HPEC), 2024
2023
Conference (fully reviewed)
- Sanil Rao, M. A. H. Monil, H. Mankad, J. Vetter and Franz Franchetti
FFTX-IRIS: Towards Performance Portability and Heterogeneity for SPIRAL Generated Code
Proc. Supercomputing (SC), 2023
- B. Van Straalen, P. Colella, Franz Franchetti and P. Broderick
FFTX: Next-Generation Open-Source Software for Fast Fourier Transforms
to appear in Proc. Exascale Computing Project Annual Meeting (ECPAM), 2023
- N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
- D. Sun, N. Zhang and Franz Franchetti
Optimization and Performance Analysis of Shor’s Algorithm in Qiskit
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
Conference (abstract reviewed)
- N. Zhang and Franz Franchetti
Generating Number Theoretic Transforms for Multi-Word Integer Data Types
Proc. International Symposium on Code Generation and Optimization (CGO), 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring-Processing Unit
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
- P. Brinich, N. Zhang, A. Ebel, Franz Franchetti and Jeremy Johnson
Twiddle Factor Generation for a Vectorized Number Theoretic Transform
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
Thesis
- G. Xu, Franz Franchetti and James C. Hoe
An Approach to Generating Customized Load-Store Architectures
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2023
Other
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
2022
Conference (fully reviewed)
- Joao Rivera, Franz Franchetti and Markus Püschel
A Compiler for Sound Floating-Point Computations using Affine Arithmetic
Proc. International Symposium on Code Generation and Optimization (CGO), 2022
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
A Framework for Low Communication Approaches for Large Scale 3D Convolution
Proc. International Conference on Parallel Processing (ICPP), 2022
- L. Tang, S. Chen, K. Harisrikanth, G. Xu, K. Mai and Franz Franchetti
A High Throughput Hardware Accelerator for FFTW Codelets: A First Look
Proc. High Performance Extreme Computing (HPEC), 2022
Conference (abstract reviewed)
- P. Oostema and Franz Franchetti
Approximating Manifolds and Geodesics with Curved Surfaces
Proc. High Performance Extreme Computing (HPEC), 2022
- J. Nguyen, Sanil Rao, J. Moreira and Franz Franchetti
Automatic Generation of Matrix-Vector Code Using SPIRAL for the Power10 ISA
Proc. IEEE High Performance Extreme Computing (HPEC), 2022
- Z. Gong, N. Zhu, M. Ngaw, Joao Rivera, L. Tang, E. Tang, H. Mankad and Franz Franchetti
Interval Arithmetic-based FFT for Large Integer Multiplication
Proc. High Performance Extreme Computing (HPEC), 2022
- J. Nguyen, M. Cai, Z. Zuo, L. Tang, K. Mai and Franz Franchetti
LIMA: Hardware for FFT based Large Integer Multiplication
Proc. High Performance Extreme Computing (HPEC), 2022
- E. Tang and Franz Franchetti
Magic Memory: A Programming Model For Big Data Analytics
Proc. High Performance Extreme Computing (HPEC), 2022
- P. Oostema and Franz Franchetti
On Manifold Graph Embedding
Proc. High Performance Extreme Computing (HPEC), 2022
- H. Mankad, Sanil Rao, P. Colella, B. Van Straalen and Franz Franchetti
ProtoX: A First Look
Proc. High Performance Extreme Computing (HPEC), 2022
- N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
Towards Full-Stack Acceleration for Fully Homomorphic Encryption
Proc. High Performance Extreme Computing (HPEC), 2022
2021
Conference (fully reviewed)
- N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
Adaptation of A64 Scalable Vector Extension for Spiral
Proc. High Performance Computing Conference (HPCC), Japan, 2021
- N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
An Auto-tuning with Adaptation of A64 Scalable Vector Extension for SPIRAL
Proc. International Workshop on Automatic Performance Tuning (iWAPT), 2021
- Joao Rivera, Franz Franchetti and Markus Püschel
An Interval Compiler for Sound Floating Point Computations
Proc. International Symposium on Code Generation and Optimization (CGO), 2021
- Mark Blanco, S. McMillan and Tze-Meng Low
Delayed Asynchronous Iterative Graph Algorithms
Proc. High Performance Extreme Computing (HPEC), 2021
- P. Oostema and Franz Franchetti
Leveraging High Dimensional Spatial Graph Embedding as a Heuristic for Graph Algorithms
Proc. IEEE Workshop on Parallel / Distributed Combinatorics and Optimization (PDCO), 2021
- Yannick Zakowski, Calvin Beck, Irene Yoon, Ilia Zaichuk, Vadim Zaliva and Steve Zdancewic
Modular, Compositional, and Executable Formal Semantics for LLVMIR
Proc. International Conference on Functional Programming (ICFP), 2021
- Scott Mionis, Franz Franchetti and J. Larkin
Optimized Quantum Circuit Generation with SPIRAL
Proc. High Performance Extreme Computing (HPEC), 2021
2020
Journal
- Thom Popovici, Martin Schatz, Franz Franchetti and Tze-Meng Low
A Flexible Framework for Multi-Dimensional DFTs
SIAM Journal on Scientific Computing (SISC), Software and High-Performance Computing, 2020
Conference (fully reviewed)
- Jiyuan Zhang, Yi Lu, Daniele G. Spampinato and Franz Franchetti
FESIA: A Fast and Efficient Set Intersection Approach on Modern CPUs
Proc. IEEE International Conference on Data Engineering (ICDE), 2020
- Daisuke Takahashi and Franz Franchetti
FFTE on SVE: SPIRAL-Generated Kernels
Proc. International Conference on High Performance Computing in Asia-Pacific Region (HPCAsia), pp. 114-122, 2020
- Sanil Rao, A. Kutuluru, Paul Brouwer, S. McMillan and Franz Franchetti
GBTLX: A First Look
Proc. IEEE High Performance Extreme Computing (HPEC), 2020
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
Massive Scaling of MASSIF: Algorithm Development and Analysis for Simulation on GPUs
Proc. Platform for Advanced Scientific Computing (PASC), Article 13, pp. 1 - 10, 2020
- Scott Mionis, Franz Franchetti and J. Larkin
Quantum Circuit Optimization with SPIRAL: A First Look
Proc. Supercomputing (SC), 2020
- Mark Blanco, S. McMillan and Tze-Meng Low
Towards an Objective Metric for the Performance of Exact Triangle Count
Proc. High Performance Extreme Computing (HPEC), 2020
- Vadim Zaliva, Ilia Zaichuk and Franz Franchetti
Verified Translation Between Purely Functional and Imperative Domain Specific Languages in HELIX
Proc. Working Conference on Verified Software: Theories, Tools, and Experiments (VSTTE), 2020
Thesis
- Jiyuan Zhang
Accelerating the Motifs of Machine Learning Applications on Modern Processors
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2020
- Anuva Kulkarni
An Approach for Large-Scale Three-Dimensional FFT-Based Approximate Convolutions on GPUs
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2020
- Vadim Zaliva
HELIX: From Math to Verified Code
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2020
Other
- Jiyuan Zhang, Daniele G. Spampinato and Franz Franchetti
FESIA: A Fast and SIMD-Efficient Set Intersection Approach on Modern CPUs
North East Database Day (NEDB), 2020, Poster
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solver Libraries for Exascale: FFTX and SpectralPack
Exascale Computing Project (ECP) Annual Meeting, 2020, Poster
2019
Conference (fully reviewed)
- F. Sadi, Joe Sweeney, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Efficient SpMV Operation for Large and Highly Sparse Matrices Using Scalable Multi-way Merge Parallelization
Proc. MICRO, 2019
- Mark Blanco, Tze-Meng Low and Kyungjoo Kim
Exploration of Fine-Grained Parallelism for Load Balancing Eager K-truss on GPU and CPU
Proc. High Performance Extreme Computing (HPEC), 2019
- Vadim Zaliva and Matthieu Sozeau
Reification of Shallow-Embedded DSLs in Coq with Automated Verification
Proc. International Workshop on Coq for Programming Languages (CoqPL), 2019
Other
- Thom Popovici, Martin Schatz, Franz Franchetti and Tze-Meng Low
A Flexible Framework for Parallel Multi-Dimensional DFTs
arXiv (Technical Report), 2019
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
Algorithm Design at Scale: Porting Parallel FFT-based Fortran Simulations to GPUs
SIAM Conference on Computational Science and Engineering (CSE), 2019
- Anuva Kulkarni, Daniele G. Spampinato and Franz Franchetti
Design and Specification of Large-scale Simulations for GPUs using FFTX
Supercomputing, 2019
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solvers for Exascale: FFTX and SpectralPACK
Exascale Computing Project (ECP) Annual Meeting, 2019
- Anuva Kulkarni, Daniele G. Spampinato and Franz Franchetti
FFTX for Micromechanical Stress-Strain Analysis
IEEE High Performance Extreme Computing Conference (HPEC), 2019
- Yoko Franchetti, Thomas Nolin and Franz Franchetti
Towards Precision Medicine: Simulation Based Parameter Estimation for Drug Metabolism
SIAM Conference on Computational Science and Engineering (CSE), 2019
2018
Journal
- Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
SPIRAL: Extreme Performance Portability
Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018
Conference (fully reviewed)
- Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
Algorithm Design for Large Scale Parallel FFT-Based Simulations on Heterogeneous Platforms
Proc. High Performance Extreme Computing (HPEC), 2018
- Vit Ruzicka and Franz Franchetti
Fast and Accurate Object Detection in High Resolution 4K and 8K Video Using GPUs
Proc. IEEE High Performance Extreme Computing (HPEC), 2018
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFTX and SpectralPack: A First Look
Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018
- Vadim Zaliva and Franz Franchetti
HELIX: A Case Study of a Formal Verification of High Performance Program Generation
Proc. Workshop on Functional High Performance Computing (FHPC), 2018
- Jiyuan Zhang, Franz Franchetti and Tze-Meng Low
High Performance Zero-Memory Overhead Direct Convolutions
Proc. International Conference on Machine Learning (ICML), 2018
- Thom Popovici, Tze-Meng Low and Franz Franchetti
Large Bandwidth-Efficient FFTs on Multicore and Multi-Socket Systems
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), IEEE, 2018
- Tze-Meng Low, Daniele G. Spampinato, A. Kutuluru, U. Sridhar, Thom Popovici, Franz Franchetti and S. McMillan
Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency Matrices
Proc. High Performance Extreme Computing (HPEC), 2018
- F. Sadi, Joe Sweeney, S. McMillan, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
PageRank Acceleration for Large Graphs with Scalable Hardware and Two-Step SpMV
Proc. High Performance Extreme Computing (HPEC), 2018
- Jiyuan Zhang, Daniele G. Spampinato, S. McMillan and Franz Franchetti
Preliminary Exploration on Large-Scale Triangle Counting in Shared-Memory Multicore System
Proc. IEEE High Performance Extreme Computing (HPEC), 2018
- Daniele G. Spampinato, Diego Fabregat-Traver, Paolo Bientinesi and Markus Püschel
Program Generation for Small-Scale Linear Algebra Applications
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 327-339, 2018
Thesis
- F. Sadi
Accelerating Sparse Matrix Kernels with Co-optimized Architecture
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2018
- Thom Popovici
An Approach to Specifying and Automatically Optimizing Fourier Transform Based Operations
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2018
Other
- Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
Algorithm Design for Large Scale FFT-Based Simulations on CPU-GPU Platforms
, 2018
2017
Journal
- Matthias Bolten, Franz Franchetti, P. H. J. Kelly, Christian Lengauer and Marcus Mohr
Algebraic Description and Automatic Generation of Multigrid Methods in SPIRAL
Concurrency and Computation: Practice and Experience, 2017
- Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
IEEE Control Systems Magazine, 2017
Conference (fully reviewed)
- F. Sadi, Lawrence Pileggi and Franz Franchetti
Algorithm and Hardware Co-Optimized Solution for Large SpMV Problems
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017
- Richard Veras and Franz Franchetti
A Scale-free Structure for Real World Networks
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017
- Tze-Meng Low, Varun Rao, Matthew Lee, Thom Popovici, Franz Franchetti and S. McMillan
First Look: Linear Algebra-based Triangle Counting Without Matrix Multiplication
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-6, 2017
- Tze-Meng Low and Franz Franchetti
High Assurance Code Generation for Cyber-Physical Systems
Proc. IEEE International Symposium on High Assurance Systems Engineering (HASE), 2017
- Thom Popovici, Franz Franchetti and Tze-Meng Low
Mixed Data Layout Kernels for Vectorized Complex Arithmetic
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017
- H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
Multirotor UAV State Prediction Through Multi-microphone Side-channel Fusion
Proc. Multisensor Fusion and Integration for Intelligent Systems (MFI), IEEE, pp. 15-21, 2017
- Georg Ofenbeck, Tiark Rompf and Markus Püschel
Staging for Generic Programming in Space and Time
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 15-28, 2017
Thesis
- Daniele G. Spampinato
A Linear Algebra Compiler for Small Problem Sizes
PhD. thesis, Computer Science, ETH Zurich, Switzerland, 2017
- Richard Veras
The Automatic Generation of High-Performance Graph Analytic Code using Spiral
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2017
Other
- G. Xu, Tze-Meng Low, James C. Hoe and Franz Franchetti
Optimizing FFT Resource Efficiency of FPGA using High-Level Synthesis
High Performance Extreme Computing Conference (HPEC), 2017
- H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
Prediction of Quadcopter State through Multi-Microphone Side-Channel Fusion
Technical report UU-CS-2017-001, Dept. of Information and Computing Sciences, Utrecht University, 2017
2016
Journal
- Qi Guo, Tianshi Chen, Y. Chen and Franz Franchetti
Accelerating Architectural Simulation Via Statistical Techniques: A Survey
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 35, No. 3, pp. 433-446, 2016
- Marcela Zuluaga, Andreas Krause and Markus Püschel
e-PAL: An Active Learning Approach to the Multi-Objective Optimization Problem
Journal of Machine Learning Research, Vol. 17, No. 104, pp. 1-32, 2016
- Berkin Akin, Franz Franchetti and James C. Hoe
HAMLeT Architecture for Parallel Data Reorganization in Memory
IEEE Micro, Vol. 36, No. 1, pp. 14-23, 2016
- Marcela Zuluaga, Peter A. Milder and Markus Püschel
Streaming Sorting Networks
ACM Transactions on Design Automation of Electronic Systems, Vol. 21, No. 4, pp. 55, 2016
Conference (fully reviewed)
- Daniele G. Spampinato and Markus Püschel
A Basic Linear Algebra Compiler for Structured Matrices
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 117-127, 2016
- Richard Veras, Tze-Meng Low and Franz Franchetti
A Scale-Free Structure for Power-Law Graphs
Proc. High Performance Extreme Computing (HPEC), pp. 1-7, 2016
- Joya Deri, Franz Franchetti and José M. F. Moura
Big Data Computation of Taxi Movement in New York City
Proc. IEEE International Conference on Big Data (Big Data), IEEE, pp. 2616-2625, 2016
- J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
Mathematical Foundations of the GraphBLAS
Proc. High Performance Extreme Computing (HPEC), 2016
- Francois Serre and Markus Püschel
Optimal Circuits for Streamed Linear Permutations using RAM
Proc. FPGA, pp. 215-223, 2016
Conference (abstract reviewed)
- Richard Veras, Thom Popovici, Tze-Meng Low and Franz Franchetti
Compilers, Hands-Off My Hands-On Optimizations
Proc. Workshop on Programming Models for SIMD/Vector Programming (WPMVP), 2016
Other
- F. Sadi, Lawrence Pileggi and Franz Franchetti
3D DRAM Based Application Specific Hardware Accelerator for SpMV
High Performance Extreme Computing Conference (HPEC), 2016
2015
Journal
- M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
Computer, Vol. 48, No. 12, pp. 24-33, 2015
- Berkin Akin, Franz Franchetti and James C. Hoe
FFTs with Near-Optimal Memory Access Through Block Data Layouts: Algorithm, Architecture and Design Automation
Journal of Signal Processing Systems, 2015
Conference (fully reviewed)
- Nikolaos Kyrtatas, Daniele G. Spampinato and Markus Püschel
A Basic Linear Algebra Compiler for Embedded Processors
Proc. Design, Automation and Test in Europe (DATE), pp. 1054-1059, 2015
- H. V. Koops and Franz Franchetti
An Ensemble Technique for Estimating Vehicle Speed and Gear Position from Acoustic Data
Proc. International Conference on Digital Signal Processing (DSP), 2015
- H. E. Sumbul, K. Vaidyanathan, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
A Synthesis Methodology for Application-Specific Logic-in-Memory Designs
Proc. Design Automation Conference (DAC), 2015
- Berkin Akin, Franz Franchetti and James C. Hoe
Data Reorganization in Memory Using 3D-stacked DRAM
Proc. International Symposium on Computer Architectur (ISCA), 2015
- Qi Guo, Tze-Meng Low, N. Alachiotis, Berkin Akin, Lawrence Pileggi, James C. Hoe and Franz Franchetti
Enabling Portable Energy Efficiency with Memory Accelerated Library
Proc. MICRO, 2015
- Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2015
- Tze-Meng Low, Qi Guo and Franz Franchetti
Optimizing Space Time Adaptive Processing Through Accelerating Memory-Bounded Operations
Proc. High Performance Extreme Computing (HPEC), 2015
Conference (abstract reviewed)
- Jiyuan Zhang, Tze-Meng Low, Qi Guo and Franz Franchetti
A 3D-Stacked Memory Manycore Stencil Accelerator System
Proc. Workshop on Near Data Processing (WONDP), 2015
- Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
Proc. Workshop on Compilers for Parallel Computing (CPC), 2015
- T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, R. Suter, Franz Franchetti and Anthony Rollett
Spectral Full-Field Deformation Modeling of Polycrystalline Materials
submitted for publication
Thesis
- Berkin Akin
A Formal Approach to Memory Access Optimization: Data Layout, Reorganization, and Near-Data Processing
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2015
2014
Journal
- Lingchuan Meng and Jeremy Johnson
Towards parallel general-size library generation for polynomial multiplication
ACM Communications in Computer Algebra, 2014
Conference (fully reviewed)
- Qi Guo, N. Alachiotis, Berkin Akin, F. Sadi, G. Xu, Tze-Meng Low, Lawrence Pileggi, James C. Hoe and Franz Franchetti
3D-Stacked Memory-Side Acceleration: Accelerator and System Design
Proc. Workshop on Near Data Processing (WONDP), 2014
- Daniele G. Spampinato and Markus Püschel
A Basic Linear Algebra Compiler
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 23-32, 2014
- Alen Stojanov, Georg Ofenbeck, Tiark Rompf and Markus Püschel
Abstracting Vector Architectures in Library Generators: Case Study Convolution Filters
Proc. ACM International Workshop on Libraries, Languages and Compilers for Array Programming (ARRAY), pp. 14, 2014
- Tao Cui, R. Yang, Gabriela Hug and Franz Franchetti
Accelerated AC Contingency Calculation on Commodity Multi-core SIMD CPUs
Proc. IEEE Power and Energy Society General Meeting (PES-GM), 2014
- F. Sadi, Berkin Akin, Thom Popovici, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Algorithm/Hardware Co-optimized SAR Image Reconstruction with 3D-stacked Logic in Memory
Proc. IEEE High Performance Extreme Computing (HPEC), 2014
- Georg Ofenbeck, Ruedi Steinmann, Victoria Caparrós Cabezas, Daniele G. Spampinato and Markus Püschel
Applying the Roofline Model
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 76 - 85, 2014
- Benjamin Hess, Thomas Gross and Markus Püschel
Automatic Locality-Friendly Interface Extension of Numerical Functions
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 83-92, 2014
- Vadim Zaliva and Franz Franchetti
Barometric and GPS Altitude Sensor Fusion
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014
- K. Vaidyanathan, R. Liu, H. E. Sumbul, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
Efficient and Secure Intellectual Property (IP) Design for Split Fabrication
Proc. IEEE International Symposium on Hardware-Oriented Security and Trust (HOST), 2014
- Victoria Caparrós Cabezas and Markus Püschel
Extending the Roofline Model: Bottleneck Analysis with Microarchitectural Constraints
Proc. IEEE International Symposium on Workload Characterization (IISWC), pp. 222-231, 2014
- Berkin Akin, Franz Franchetti and James C. Hoe
FFTs with Near-Optimal Memory Access Through Block Data Layouts
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014
- Berkin Akin, James C. Hoe and Franz Franchetti
HAMLeT: Hardware Accelerated Memory Layout Transform within 3D-stacked DRAM
Proc. IEEE High Performance Extreme Computing (HPEC), 2014
- Franz Franchetti, Aliaksei Sandryhaila and Jeremy Johnson
High Assurance SPIRAL
Proc. SPIE, Proceedings of SPIE 2014, 2014
- Jörn Schumacher and Markus Püschel
High-performance sparse fast Fourier transforms
Proc. IEEE Workshop on Signal Processing Systems (SIPS), pp. 1-6, 2014
- Berkin Akin, Franz Franchetti and James C. Hoe
Understanding the Design Space of DRAM-optimized Hardware FFT Accelerators
Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 248-255, 2014
Conference (abstract reviewed)
- B. Duff, J. Larkin, M. Franusich and Franz Franchetti
Automatic Generation of 3-D FFTs
submitted for publication
- T. Ozturk, Thom Popovici, C. Stein, R. Pokharel, Franz Franchetti and Anthony Rollett
Fast Fourier Transform Based Mechanical Behavior Formulation: Optimized Implementation and Sensitivity Analysis of the Method
Proc. Materials Science & Technology, 2014
- Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
Language Support for the Construction of High Performance Code Generators
Proc. International Workshop on Adaptive Self-tuning Computing Systems (ADAPT), 2014
- T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, Franz Franchetti, R. Suter and Anthony Rollett
Performance Evaluation, Algorithm Optimization and Sensitivity Analysis of the Spectral Full-Field Deformation Modeling of Polycrystalline Materials
submitted for publication
Thesis
- Nikolaos Kyrtatas
A Basic Linear Algebra Compiler for Embedded Processors
Master thesis, Computer Science, ETH Zurich, Switzerland, 2014
2013
Journal
- Q. Li, Tao Cui, Yang Weng, Rohit Negi, Franz Franchetti and Marija Ilic
An Information-Theoretic Approach to PMU Placement in Electric Power Systems
IEEE Transactions on Smart Grid, Vol. 4, No. 1, pp. 446-456, 2013
Book Chapter
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
A Smart Memory Accelerated Computed Tomography Parallel Backprojection
in From Algorithms to Circuits and System-on-Chip Design (VLSI-SoC 2012), Eds. A. Berg, A. Coskun, M. Guthaus, S. Katkoori, R. Reis, pp. 21-44, Springer, Berlin, Heidelberg 2013
Conference (fully reviewed)
- Qiuling Zhu, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Accelerating Sparse Matrix-Matrix Multiplication with 3D-Stacked Logic-in-Memory Hardware
Proc. High Performance Extreme Computing (HPEC), pp. 1-6, 2013
- Marcela Zuluaga, Andreas Krause, Guillaume Sergent and Markus Püschel
Active Learning for Multi-Objective Optimization
Proc. International Conference on Machine Learning (ICML), pp. 462-470, 2013
- Tao Cui and Franz Franchetti
A Quasi-Monte Carlo Approach for Radial Distribution System Probabilistic Load Flow
Proc. IEEE Innovative Smart Grid Technology Conference, 2013
- Tom Henretty, Richard Veras, Franz Franchetti, Louis-Noël Pouchet, J. Ramanujam and P. Sadayappan
A Stencil Compiler for Short-Vector SIMD Architectures
Proc. ACM International Conference on Supercomputing , pp. 13-24, 2013
- Lingchuan Meng and Jeremy Johnson
Automatic Parallel Library Generation for General-Size Modular FFT Algorithms
Proc. International Workshop on Computer Algebra in Scientific Computing, Springer, pp. 243-256, 2013
- Tao Cui and Franz Franchetti
Power System Probabilistic and Security Analysis on Commodity High Performance Computing Systems
Proc. High Performance Computing, Networking and Analytics for the Power Grid (HiPCNA-PG), 2013
- Cory Thoma, Tao Cui and Franz Franchetti
Privacy Preserving Smart Meter System Based Retail Level Electricity Market
Proc. IEEE Power and Energy Society General Meeting (PES-GM), 2013
- Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
Spiral in Scala: Towards the Systematic Construction of Generators for Performance Libraries
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 125-134, 2013
- Martin Kong, Richard Veras, Kevin Stock, Franz Franchetti, Louis-Noël Pouchet and P. Sadayappan
When Polyhedral Transformations Meet SIMD Code Generation
Proc. Programming Languages Design and Implementation (PLDI), Vol. 48, pp. 127-138, 2013
Conference (abstract reviewed)
- Qiuling Zhu, Berkin Akin, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
A 3D-Stacked Logic-in-Memory Accelerator for Application-Specific Data Intensive Computing
Proc. IEEE International 3D Systems Integration Conference (3DIC), pp. 1-7, 2013
- Marcela Zuluaga, Andreas Krause and Markus Püschel
Multi-Objective Optimization for High-Level Synthesis
Proc. Workshop on High-Level Synthesis for High Performance Computing (HLS4HPC), 2013
- H. E. Sumbul, A. Patterson, A. Tazzoli, G. Feeder, Franz Franchetti, G. Piazza and Lawrence Pileggi
Trusted Split-Fabrication System-on-Chip Design Technology and Methodology
Proc. Government Applications & Critical Technology Conference (GOMACTech), 2013
Thesis
- Qiuling Zhu
Application Specific Logic in Memory
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2013
- Benjamin Hess
Automatic Refactoring: Locality Friendly Interface Enhancements for Numerical Functions
Master thesis, Computer Science, ETH Zurich, Switzerland, 2013
- Jörn Schumacher
High Performance Sparse Fast Fourier Transform
Master thesis, Computer Science, ETH Zurich, Switzerland, 2013
- Tao Cui
Power System Probabilistic and Security Analysis Using Commodity High Performance Computing Systems
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2013
Other
- Tao Cui and Franz Franchetti
A Software Performance Engineering Aproach to Fast Transmission Probabilistic Load Flow
IEEE PES General Meeting, 2013
2012
Journal
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Computer Generation of Hardware for Linear Digital Signal Processing Transforms
ACM Transactions on Design Automation of Electronic Systems, Vol. 17, No. 2, 2012
- Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
Local Interpolation-based Polar Format SAR: Algorithm, Hardware Implementation and Design Automation
Journal of Signal Processing Systems, 2012
Conference (fully reviewed)
- Tao Cui and Franz Franchetti
A Multi-Core High Performance Computing Framework for Probabilistic Solutions of Distribution Systems
Proc. IEEE Power and Energy Society General Meeting (PES-GM), pp. 1-6, 2012
- Franz Franchetti, Yevgen Voronenko and G. Almasi
Automatic Generation of the HPC Challenges Global FFT Benchmark for BlueGene/P
Proc. High Performance Computing for Computational Science (VECPAR), 2012
- Marcela Zuluaga, Peter A. Milder and Markus Püschel
Computer Generation of Streaming Sorting Networks
Proc. Design Automation Conference (DAC), pp. 1245-1253, 2012
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
Cost-Effective Smart Memory Implementation for Parallel Backprojection in Computed Tomography
Proc. IFIP/IEEE Internationa Conference on Very Large Scale Integration, pp. 111-116, 2012
- Qiuling Zhu, K. Vaidyanathan, O. Shacham, M. Horowitz, Lawrence Pileggi and Franz Franchetti
Design Automation Framwork for Application-Specific Logic-in-Memory Blocks
Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 125-132, 2012
- W. Yu, Franz Franchetti, James C. Hoe and Tsuhan Chen
Highly Efficient Performance Portable Tracking of Evolving Surfaces
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 296-307, 2012
- Robert Koutsoyannis, Peter A. Milder, Christian Berger, Madeleine Glick, James C. Hoe and Markus Püschel
Improving Fixed-Point Accuracy of FFT Cores in O-OFDM Systems
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2012
- Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
Memory Bandwidth Efficient Two-Dimensional Fast Fourier Transform Algorithm and Implementation for Large Problem Sizes
Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), 2012
- Tao Cui and Franz Franchetti
Optimized Parallel Distribution Load Flow Solver on Commodity Multi-core CPU
Proc. IEEE High Performance Extreme Computing (HPEC), 2012
- Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
Polar Format Synthetic Aperture Radar in Energy Efficient Application-Specific Logic-in-Memory
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1557-1560, 2012
- Cory Thoma, Tao Cui and Franz Franchetti
Secure Multiparty Computation Based Privacy Preserving Smart Metering System
Proc. North American Power Symposium (NAPS), pp. 1-6, 2012
- Marcela Zuluaga, Andreas Krause, Peter A. Milder and Markus Püschel
"Smart" Design Space Sampling to Predict Pareto-Optimal Solutions
Proc. Languages, Compilers, Tools and Theory for Embedded Systems (LCTES), pp. 119-128 , 2012
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
Smart Memory Synthesis for Energy-Efficient Computed Tomography Reconstruction
Proc. SRC TECHCON, 2012
Other
- Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
Algorithm and Architecture Optimization for Large Size Two Dimensional Discrete Fourier Transform
ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA), 2012
- C. Angelopoulos, Franz Franchetti and Markus Püschel
Automatic Generation of FFT Libraries for GPUs
NVIDIA Research Summit at the GPU Technology Conference, 2012
2011
Journal
- Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Design studies for ASIC implementations of 28 GS/s optical QPSK- and 16-QAM-OFDM transceivers
Optics Express, Vol. 19, No. 21, pp. 20857-20864, 2011
Book Chapter
- Franz Franchetti and Markus Püschel
Fast Fourier Transform
in Encyclopedia of Parallel Computing, Eds. David Padua, Springer 2011
- Markus Püschel, Franz Franchetti and Yevgen Voronenko
Spiral
in Encyclopedia of Parallel Computing, Eds. David Padua, pp. 1920-1933, Springer 2011
Conference (fully reviewed)
- Tao Cui and Franz Franchetti
A Multi-core High Performance Computing Framework for Distribution Power Flow
Proc. North American Power Symposium (NAPS), 2011
- Daniel McFarlin, Volodymyr Arbatov, Franz Franchetti and Markus Püschel
Automatic SIMD Vectorization of Fast Fourier Transforms for the Larrabee and AVX Instruction Sets
Proc. International Conference on Supercomputing (ICS), 2011
- Tao Cui and Franz Franchetti
Autotuning a Random Walk Boolean Satisfiability Solver
Proc. International Workshop on Automatic Performance Tuning (iWAPT), 2011
- Tom Henretty, Kevin Stock, Louis-Noël Pouchet, Franz Franchetti, J. Ramanujam and P. Sadayappan
Data Layout Transformation for Stencil Computations on Short SIMD Architectures
Proc. International Conference on Compiler Construction (CC), 2011
- Christian Berger, Volodymyr Arbatov, Yevgen Voronenko, Franz Franchetti and Markus Püschel
Real-Time Software Implementation of an IEEE 802.11a Baseband Receiver on Intel Multicore
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1693-1696 , 2011
Conference (abstract reviewed)
- Qiuling Zhu, E. L. Turner, Christian Berger, Lawrence Pileggi and Franz Franchetti
Application-Specific Logic-in-Memory for Polar Format Synthetic Aperture Radar
Proc. High Performance Extreme Computing (HPEC), 2011
- W. Yu, Franz Franchetti, James C. Hoe, José M. F. Moura and Tsuhan Chen
Performance Portable Tracking of Evolving Surfaces
Proc. High Performance Extreme Computing (HPEC), 2011
Other
- Tao Cui and Franz Franchetti
A Monte Carlo Framework for Probabilistic Distribution Power Flow
Carnegie Mellon Conference on the Electricity Industry, 2011
2010
Journal
- W. Yu, Tsuhan Chen, Franz Franchetti and James C. Hoe
High Performance Stereo Vision Designed for Massively Data Parallel Platforms
IEEE Transactions on Circuits and Systems for Video Technology (T-CSVT), Vol. 20, No. 11, pp. 1509-1519, 2010
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Real-Time Digital Signal Processing for the Generation of Optical Orthogonal Frequency Division Multiplexed Signals
IEEE Journal of Selected Topics in Quantum Electronics, Vol. 16, No. 5, pp. 1235-1244 , 2010
Conference (fully reviewed)
- Frédéric de Mesmay, Srinivas Chellappa, Franz Franchetti and Markus Püschel
Computer Generation of Efficient Software Viterbi Decoders
Proc. International Conference on High Performance Embedded Architectures and Compilers (HiPEAC), Lecture Notes in Computer Science, Springer, Vol. 5952, pp. 353-368, 2010
- Yevgen Voronenko, Volodymyr Arbatov, Christian Berger, Ronghui Peng, Markus Püschel and Franz Franchetti
Computer Generation of Platform-Adapted Physical Layer Software
Proc. Software Defined Radio (SDR), 2010
- Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, Christian Berger, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Design Studies for an ASIC Implementation of an Optical OFDM Transceiver
Proc. European Conference on Optical Communication (ECOC), pp. 1-3, 2010
- W. Yu, Franz Franchetti, James C. Hoe and Tsuhan Chen
Fast and Robust Active Contours for Image Segmentation
Proc. IEEE International Conference on Image Processing (ICIP), 2010
- W. Yu, Franz Franchetti, James C. Hoe, Y.-J. Chang and Tsuhan Chen
Fast Bilateral Filtering By Adapting Block Size
Proc. IEEE International Conference on Image Processing (ICIP), 2010
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Hardware Implementation of the Discrete Fourier Transform with Non-Power-of-Two Problem Size
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2010
- Frédéric de Mesmay, Yevgen Voronenko and Markus Püschel
Offline Library Adaptation Using Automatically Generated Heuristics
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 1-10, 2010
- Yannis Benlachtar, Rachid Bouziane, Robert I. Killey, Christian Berger, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel and Madeleine Glick
Optical OFDM for the Data Center
Proc. International Conference on Transparent Optical Networks (ICTON), pp. 1-4, 2010
Conference (abstract reviewed)
- Lingchuan Meng, Jeremy Johnson, Franz Franchetti, Yevgen Voronenko, Marc Moreno Maza and Yuzhen Xie
Spiral-Generated Modular FFT Algorithms
Proc. Parallel Symbolic Computation (PASCO), pp. 169-170, 2010
Thesis
- Peter A. Milder
A Mathematical Approach for Compiling and Optimizing Hardware Implementations of DSP Transforms
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
- Srinivas Chellappa
Computer Generation of Fourier Transform Libraries for Distributed Memory Architectures
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
- Douglas F. Jones
Data Pump Architecture Simulator and Performance Model
Master thesis, Computer Science, Drexel University, 2010
- Frédéric de Mesmay
On the Computer Generation of Adaptive Numerical Libraries
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
- W. Yu
Performance Portable Tracking of Evolving Surfaces
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
Other
- C. Angelopoulos, Franz Franchetti and Markus Püschel
DFT Transform on the Fermi (GTX480): Automatic Program Generation
NVIDIA Research Summit at the GPU Technology Conference, 2010
- Q. Li, Tao Cui, Rohit Negi, Franz Franchetti and Marija Ilic
On-line Decentralized Charging of Plug-In Electric Vehicles in Power Systems
, 2010
2009
Journal
- Yevgen Voronenko and Markus Püschel
Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for Real DFTs
IEEE Transactions on Signal Processing, Vol. 57, No. 1, pp. 205-222, 2009
- Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009
- Markus Püschel, Peter A. Milder and James C. Hoe
Permuting Streaming Data Using RAMs
Journal of the ACM, Vol. 56, No. 2, pp. 10:1-10:34, 2009
Conference (fully reviewed)
- Peter A. Milder, James C. Hoe and Markus Püschel
Automatic Generation of Streaming Datapaths for Arbitrary Fixed Permutations
Proc. Design, Automation and Test in Europe (DATE), pp. 1118-1123, 2009
- Basilio B. Fraguela, Yevgen Voronenko and Markus Püschel
Automatic Tuning of Discrete Fourier Transforms Driven by Analytical Modeling
Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 271-280, 2009
- Frédéric de Mesmay, Arpad Rimmel, Yevgen Voronenko and Markus Püschel
Bandit-Based Optimization on Graphs with Application to Library Performance Tuning
Proc. International Conference on Machine Learning (ICML), pp. 729-736, 2009
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
Computer Generation of Fast Fourier Transforms for the Cell Broadband Engine
Proc. International Conference on Supercomputing (ICS), pp. 26-35, 2009
- Yevgen Voronenko, Frédéric de Mesmay and Markus Püschel
Computer Generation of General Size Linear Transform Libraries
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 102-113, 2009
- Franz Franchetti and Markus Püschel
Generating High-Performance Pruned FFT Implementations
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 549-552, 2009
- Daniel McFarlin, Franz Franchetti, José M. F. Moura and Markus Püschel
High Performance Synthetic Aperture Radar Image Formation On Commodity Architectures
Proc. SPIE Conference on Defense, Security, and Sensing, Proceedings of SPIE, Vol. 7337, pp. 733708, 2009
- Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
Operator Language: A Program Generation Framework for Fast Kernels
Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009
Conference (abstract reviewed)
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
21.4 GS/s Real-Time DSP-Based Optical OFDM Signal Generation and Transmission Over 1600 km of Uncompensated Fibre
Proc. European Conference on Optical Communication (ECOC), pp. 1-2, 2009
- Daniel McFarlin, Franz Franchetti and Markus Püschel
Automatic Generation of Vectorized Fast Fourier Transform Libraries for the Larrabee and AVX Instruction Set Extension
Proc. High Performance Extreme Computing (HPEC), 2009
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
High Performance Linear Transform Program Generation for the Cell BE
Proc. High Performance Extreme Computing (HPEC), 2009
2008
Journal
- Markus Püschel and José M. F. Moura
Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for DCTs and DSTs
IEEE Transactions on Signal Processing, Vol. 56, No. 4, pp. 1502-1521, 2008
Conference (fully reviewed)
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Formal Datapath Representation and Manipulation for Implementing DSP Transforms
Proc. Design Automation Conference (DAC), pp. 385-390, 2008
- Franz Franchetti and Markus Püschel
Generating SIMD Vectorized Permutations
Proc. International Conference on Compiler Construction (CC), Lecture Notes in Computer Science, Springer, Vol. 4959, pp. 116-131, 2008
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
How To Write Fast Numerical Code: A Small Introduction
Proc. Summer School on Generative and Transformational Techniques in Software Engineering (GTTSE), Lecture Notes in Computer Science, Springer, Vol. 5235, pp. 196-259, 2008
Conference (abstract reviewed)
- Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
Automatic Generation of Multithreaded Vectorized Adaptive Libraries for Matrix Multiplication
Proc. International Workshop on Parallel Matrix Algorithms and Applications (PMAA), 2008
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
Automatic Linear Transform Program Generation for the Cell BE
Proc. Supercomputing (SC), 2008, Poster (Abstract reviewed)
- Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
Domain-Specific Library Generation for Parallel Software and Hardware Platforms
Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
FFT Program Generation for the Cell BE
Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
- Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
Generating High-Performance General Size Linear Transform Libraries Using Spiral
Proc. High Performance Extreme Computing (HPEC), 2008
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Linear Transforms: From Math to Efficient Hardware
Proc. Workshop on High-Level Synthesis colocated with DAC, 2008
- Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
Program Generation with Spiral: Beyond Transforms
Proc. High Performance Extreme Computing (HPEC), 2008
- Jeremy Johnson, Tim Chagnon, Petya Vachranukunkiet, Prawat Nagvajara and Chika Nwankpa
Sparse LU Decomposition using FPGA
Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
- Jeremy Johnson and Michael Andrews
Statistical Evaluation of a Self-Tuning Vectorized Library for the Walsh-Hadamard Transform
Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
- Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
System Demonstration of Spiral: Generator for High-Performance Linear Transform Libraries
Proc. Algebraic Methodology and Software Technology (AMAST), 2008
Thesis
- Yevgen Voronenko
Library Generation for Linear Transforms
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2008
Other
- Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
Automatic Generation of Adaptive Libraries for Matrix-Multiplication
Parallel Matrix Algorithms and Applications (PMAA), 2008, Presentation (Abstract reviewed)
2007
Journal
- Yevgen Voronenko and Markus Püschel
Mechanical Derivation of Fused Multiply-Add Algorithms for Linear Transforms
IEEE Transactions on Signal Processing, Vol. 55, No. 9, pp. 4458-4473, 2007
- Yevgen Voronenko and Markus Püschel
Multiplierless Multiple Constant Multiplication
ACM Transactions on Algorithms, Vol. 3, No. 2, 2007
- Peter Tummeltshammer, James C. Hoe and Markus Püschel
Time-Multiplexed Multiple Constant Multiplication
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 26, No. 9, pp. 1551-1563, 2007
Conference (fully reviewed)
- Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
Generating FPGA Accelerated DFT Libraries
Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007
- Jeremy Johnson and Xu Xu
Generating Symmetric DFTs and Equivariant FFT Algorithms
Proc. ACM International Symposium on Symbolic and Algebraic Computation (ISSAC), ACM, pp. 195-202, 2007
- Jeremy Johnson and Michael Andrews
Performance Analysis of a Family of WHT Algorithms
Proc. Workshop on Performance Optimization for High-Level Languages and Libraries (POHLL), 2007 IEEE International Parallel and Distributed Processing Symposium , IEEE, pp. 450, 2007
- Paolo D'Alberto, Markus Püschel and Franz Franchetti
Performance/Energy Optimization of DSP Transforms on the XScale Processor
Proc. International Conference on High Performance Embedded Architectures and Compilers (HiPEAC), Lecture Notes in Computer Science, Springer, Vol. 4367, pp. 201-214, 2007
- Franz Franchetti and Markus Püschel
SIMD Vectorization of Non-Two-Power Sized FFTs
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. II-17-II-20, 2007
Conference (abstract reviewed)
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Fast Fourier Transform on FPGA: Design Choices and Evaluation
Proc. International Symposium on Field-Programmable Gate Arrays (FPGA), 2007
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
FFT Compiler: From Math to Efficient Hardware
Proc. IEEE International High Level Design Validation and Test Workshop (HLDVT), 2007
Thesis
- Sung-Chul Han
A Flexible Decoder and Performance Evaluation of Array-Structured LDPC Codes
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2007
- Pranab Shenoy
Universal FFT Core Generator
Master thesis, Computer Science, Drexel University, 2007
Other
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Discrete Fourier Transform Compiler: From Mathematical Representation to Efficient Hardware
CSSI Technical Report #CSSI-07-01, Carnegie Mellon University, 2007
2006
Journal
- Pawel Hitczenko, Jeremy Johnson and Hung-Jen Huang
Distribution of a Class of Divide and Conquer Recurrences arising from the Computation of the Walsh-Hadamard Transform
Theoretical Computer Science, Vol. 352, pp. 8-30, 2006
Conference (fully reviewed)
- Yevgen Voronenko and Markus Püschel
Algebraic Derivation of General Radix Cooley-Tukey Algorithms for the Real Discrete Fourier Transform
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 3, 2006
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
A Rewriting System for the Vectorization of Signal Transforms
Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 4395, pp. 363-377, 2006
- Andreas Bonelli, Franz Franchetti, Juergen Lorenz, Markus Püschel and Christoph W. Ueberhuber
Automatic Performance Optimization of the Discrete Fourier Transform on Distributed Memory Computers
Proc. International Symposium on Parallel and Distributed Processing and Application (ISPA), Lecture Notes In Computer Science, Springer, Vol. 4330, pp. 818-832, 2006
- Peter A. Milder, Mohammad Ahmad, James C. Hoe and Markus Püschel
Fast and Accurate Resource Estimation of Automatically Generated Custom DFT IP Cores
Proc. FPGA, pp. 211-220, 2006
- F. Gygi, E. W. Draeger, M. Schulz, B. R. de Supinski, J. A. Gunnels, V. Austel, J. C. Sexton, Franz Franchetti, Stefan Kral, Christoph W. Ueberhuber and Juergen Lorenz
Large-Scale Electronic Structure Calculations of High-Z Metals on the BlueGene/L Platform
Proc. Supercomputing (SC), 2006
- Sung-Chul Han, Franz Franchetti and Markus Püschel
Program Generation for the All-Pairs Shortest Path Problem
Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 222-232, 2006
- Marek Telgarsky, James C. Hoe and José M. F. Moura
Spiral: Joint Runtime and Energy Optimization of Linear Transforms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 3, 2006
Conference (abstract reviewed)
- Paolo D'Alberto, Peter A. Milder, Franz Franchetti, James C. Hoe, Markus Püschel and José M. F. Moura
Discrete Fourier Transform Compiler for FPGA and CPU/FPGA Partitioned Implementations
Proc. High Performance Extreme Computing (HPEC), 2006
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
FFT Program Generation for Shared Memory: SMP and Multicore
Proc. Supercomputing (SC), 2006
- Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
Parallelism in Spiral
Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
Spiral: Generating Signal Processing Kernels for New Commodity Architectures
Proc. EDGE Workshop, pp. D49-D50, 2006
2005
Journal
- Franz Franchetti, Stefan Kral, Juergen Lorenz and Christoph W. Ueberhuber
Efficient Utilization of SIMD Extensions
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 409-425, 2005
- José M. F. Moura, Markus Püschel, David Padua and Jack Dongarra
Scanning the Issue: Special Issue on Program Generation, Optimization, and Platform Adaptation
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 211-215, 2005
- Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
SPIRAL: Code Generation for DSP Transforms
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005
- Juergen Lorenz, Stefan Kral, Franz Franchetti and Christoph W. Ueberhuber
Vectorization Techniques for the BlueGene/L Double FPU
IBM Journal of Research and Development, Vol. 49, No. 2/3, pp. 437-446, 2005
Conference (fully reviewed)
- Grace Nordin, Peter A. Milder, James C. Hoe and Markus Püschel
Automatic Generation of Customized Discrete Fourier Transform IPs
Proc. Design Automation Conference (DAC), pp. 471-474, 2005
- Mihai Furis, Pawel Hitczenko and Jeremy Johnson
Cache Miss Analysis of WHT Algorithms
Proc. International Conference on Analysis of Algorithms, Discrete Mathematics and Theoretical Computer Science, pp. 115-124, 2005
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
Formal Loop Merging for Signal Transforms
Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326 , 2005
- F. Gygi, E. W. Draeger, B. R. de Supinski, R. K. Yates, Franz Franchetti, Stefan Kral, Juergen Lorenz, Christoph W. Ueberhuber, J. A. Gunnels and J. C. Sexton
Large-Scale First-Principles Molecular Dynamics Simulations on the BlueGene/L Platform Using the Qbox Code
Proc. Supercomputing (SC), 2005
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
Loop Merging for Signal Transforms
Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326, 2005
- Xiaoming Li, María J. Garzarán and David Padua
Optimizing Sorting with Genetic Algorithm
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 99-110, 2005
- Thammanit Pipatsrisawat, Aca Gacic, Franz Franchetti, Markus Püschel and José M. F. Moura
Performance Analysis of the Filtered Backprojection Image Reconstruction Algorithms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. 153-156, 2005
Conference (abstract reviewed)
- Roland Wunderlich, Markus Püschel and James C. Hoe
Accelerating Blocked Matrix-Matrix Multiplication using a Software-Managed Memory Hierarchy with DMA
Proc. High Performance Extreme Computing (HPEC), 2005
2004
Journal
- Anthony F. Breitzman and Jeremy Johnson
Automatic Derivation and Implementation of Fast Convolution Algorithms
Journal of High Performance Computing and Applications, special issue on ``Computer Algebra and Signal Processing'', Vol. 37, No. 2, pp. 157-186, 2004
- Markus Püschel, Bryan Singer, Jianxin Xiong, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso and Robert W. Johnson
SPIRAL: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms
Journal of High Performance Computing and Applications, special issue on ``Automatic Performance Tuning'', Vol. 18, No. 1, pp. 21-45, 2004
Conference (fully reviewed)
- Xiaoming Li, María J. Garzarán and David Padua
A Dynamically Tuned Sorting Library
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 111-124, 2004
- Jeremy Johnson and Kang Chen
A Self-Adapting Distributed Memory Package for Fast Signal Transforms
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 44-, 2004
- Aca Gacic, Markus Püschel and José M. F. Moura
Automatically Generated High-Performance Code for Discrete Wavelet Transforms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-69-V-72, 2004
- Franz Franchetti, Stefan Kral, Juergen Lorenz, Markus Püschel, Christoph W. Ueberhuber and Peter Wurzinger
Automatically Tuned FFTs for BlueGene/L’s Double FPU
Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 3402, pp. 23-36, 2004
- Adam C. Zelinski, Markus Püschel, Smarahara Misra and James C. Hoe
Automatic Cost Minimization for Multiplierless Implementations of Discrete Signal Transforms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-221-V-224, 2004
- Yevgen Voronenko and Markus Püschel
Automatic Generation of Implementations for DSP Transforms on Fused Multiply-Add Architectures
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-101-V-104, 2004
- Markus Püschel, Adam C. Zelinski and James C. Hoe
Custom-Optimized Multiplierless Implementations of DSP Algorithms
Proc. International Conference on Computer-Aided Design (ICCAD), pp. 175-182, 2004
- Nicholas Rizzolo and David Padua
HiLO: High Level Optimization of FFTs
Proc. International Workshop on Languages and Compilers for Parallel Computing (LCPC), 2004
- Peter Tummeltshammer, James C. Hoe and Markus Püschel
Multiple Constant Multiplication By Time-Multiplexed Mapping of Addition Chains
Proc. Design Automation Conference (DAC), pp. 826-829, 2004
Conference (abstract reviewed)
- Lawrence C. Chang, Inpyo Hong, Yevgen Voronenko and Markus Püschel
Adaptive Mapping of Linear DSP Algorithms to Fixed-Point Arithmetic
Proc. High Performance Extreme Computing (HPEC), 2004
- Grace Nordin, James C. Hoe and Markus Püschel
Discrete Fourier Transform IP Generator
Proc. High Performance Extreme Computing (HPEC), 2004
Thesis
- Aca Gacic
Automatic Implementation and Platform Adaptation of Discrete Filtering and Wavelet Algorithms
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2004
2003
Journal
- Markus Püschel and José M. F. Moura
The Algebraic Approach to the Discrete Cosine and Sine Transforms and their Fast Algorithms
SIAM Journal of Computing, Vol. 32, No. 5, pp. 1280-1316, 2003
Conference (fully reviewed)
- Jeremy Johnson and Xu Xu
A Recursive Implementation of the Dimensionless FFT
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2003
- Markus Püschel
Cooley-Tukey FFT like Algorithms for the DCT
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 501-504, 2003
- Aca Gacic, Markus Püschel and José M. F. Moura
Fast Automatic Implementations of FIR Filters
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 541-544, 2003
- T. Fahringer, Franz Franchetti, M. Geissler, G. Madsen, H. Moritsch and R. Prodan
On Using ZENTURIO for Performance and Parameter Studies on Clusters and Grids
Proc. Euromicro Conference on Parallel Distributed and Network-based Processing (Euro PDP), pp. 185-192, 2003
- Franz Franchetti and Markus Püschel
Short Vector Code Generation and Adaptation for DSP Algorithms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 537-540, 2003
- Franz Franchetti and Markus Püschel
Short Vector Code Generation for the Discrete Fourier Transform
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2003
- Stefan Kral, Franz Franchetti, Juergen Lorenz and Christoph W. Ueberhuber
SIMD Vectorization of Straight Line Code
Proc. Euro-Par Conference on Parallel and Distributed Computing, LNCS 2985, pp. 251-260, 2003
- Fang Fang, Rob A. Rutenbar, Markus Püschel and Tsuhan Chen
Toward Efficient Static Analysis of Finite-Precision Effects in DSP Applications via Affine Arithmetic Modeling
Proc. Design Automation Conference (DAC), pp. 496-501, 2003
Conference (abstract reviewed)
- Franz Franchetti
A Portable Short Vector Version of FFTW
Proc. IMACS Symposium on Mathematical Modelling (MATHMOD), Vol. 2, pp. 1539-1548, 2003
- Smarahara Misra, Adam C. Zelinski, James C. Hoe and Markus Püschel
Custom Reduction of Arithmetic in Linear DSP Transforms
Proc. High Performance Extreme Computing (HPEC), 2003
- Markus Püschel and José M. F. Moura
SPIRAL: An Overview
Proc. Workshop on Optimizations for DSP and Embedded Systems (ODES), 2003
- Franz Franchetti
Top Performance in Signal Processing
Proc. International Workshop on Numerical and Symbolic Scientific Computing, 2003
Thesis
- Xu Xu
A Recursive Implementation of the Dimensionless FFT
Master thesis, Computer Science, Drexel University, 2003
- Anthony F. Breitzman
Automatic Derivation and Implementation of Fast Convolution Algorithms
PhD. thesis, Computer Science, Drexel University, 2003
- Mihai Furis
Cache Miss Analysis of Walsh-Hadamard Transform Algorithms
Master thesis, Computer Science, Drexel University, 2003
- Smarahara Misra
Custom Reduction of Arithmetic in Multiplierless implementations of DSP Transforms
Master thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2003
- Franz Franchetti
Performance Portable Short Vector Transforms
PhD. thesis, Vienna University of Technology, 2003
2002
Journal
- Bryan Singer and Manuela Veloso
Automating the Modeling and Optimization of the Performance of Signal Transforms
IEEE Transactions on Signal Processing, Vol. 50, No. 8, pp. 2003-2014, 2002
- Bryan Singer and Manuela Veloso
Learning to Construct Fast Signal Processing Implementations
Journal of Machine Learning Research, special issue on ``the Eighteenth International Conference on Machine Learning (ICML 2001)'', Vol. 3, pp. 887-919, 2002
Book Chapter
- Markus Püschel, Sebastian Egner and Thomas Beth
AREP
in New Reference Book on Computer Algebra, Eds. J. Grabmeier, E. Kaltofen, J. Grabmeier, E. Kaltofen, Springer 2002
Conference (fully reviewed)
- Kang Chen and Jeremy Johnson
A Prototypical Self-Optimizing Package for Parallel Implementation of Fast Signal Transforms
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 58-63, 2002
- Franz Franchetti and Markus Püschel
A SIMD Vectorizing Compiler for Digital Signal Processing Algorithms
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 20-26, 2002
Conference (abstract reviewed)
- Franz Franchetti, F. Kaltenberger and Christoph W. Ueberhuber
FFT Kernels with FMA Utilization
Proc. APLIMAT Conference, pp. 333-339, 2002
- Markus Püschel and José M. F. Moura
Generation and Manipulation of DSP Transform Algorithms
Proc. Digital Signal Processing Workshop, 2002
- Fang Fang, James C. Hoe, Markus Püschel and Smarahara Misra
Generation of Custom DSP Transform IP Cores: Case Study Walsh-Hadamard Transform
Proc. High Performance Extreme Computing (HPEC), 2002
- Franz Franchetti, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
Short Vector SIMD Code Generation for DSP Algorithms
Proc. High Performance Extreme Computing (HPEC), 2002
Thesis
- Mike Balog
A Flexible Framework for Implementing FFT Processors
Master thesis, Electrical and Computer Engineering, Drexel University, 2002
- Kang Chen
A Prototypical Self-Optimizing Package for Parallel Implementation of Fast Signal Transforms
Master thesis, Computer Science, Drexel University, 2002
- Hung-Jen Huang
Performance Analysis of an Adaptive Algorithm for the Walsh-Hadamard Transform
Master thesis, Computer Science, Drexel University, 2002
2001
Journal
- Sebastian Egner, Jeremy Johnson, David Padua, Jianxin Xiong and Markus Püschel
Automatic Derivation and Implementation of Signal Processing Algorithms
ACM SIGSAM Bulletin Communications in Computer Algebra, Vol. 35, No. 2, pp. 1-19, 2001
- Sebastian Egner and Markus Püschel
Automatic Generation of Fast Discrete Signal Transforms
IEEE Transactions on Signal Processing, Vol. 49, No. 9, pp. 1992-2002, 2001
Conference (fully reviewed)
- Franz Franchetti, H. Karner, Stefan Kral and Christoph W. Ueberhuber
Architecture Independent Short Vector FFTs
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 1109-1112, 2001
- Neungsoo Park and Viktor K. Prasanna
Cache Conscious Walsh-Hadamard Transform
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. II-1205-II-1208, 2001
- Markus Püschel, Bryan Singer, Manuela Veloso and José M. F. Moura
Fast Automatic Generation of DSP Algorithms
Proc. International Conference on Computational Science (ICCS), Lecture Notes In Computer Science, Springer, Vol. 2073, pp. 97-106, 2001
- Bryan Singer and Manuela Veloso
Learning to Generate Fast Signal Processing Implementations
Proc. International Conference on Machine Learning (ICML), pp. 529-536, 2001
- Jianxin Xiong, Jeremy Johnson, Robert W. Johnson and David Padua
SPL: A Language and Compiler for DSP Algorithms
Proc. Programming Languages Design and Implementation (PLDI), pp. 298-308, 2001
- Bryan Singer and Manuela Veloso
Stochastic Search for Signal Processing Algorithm Optimization
Proc. Supercomputing (SC), pp. 22, 2001
Conference (abstract reviewed)
- José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel, Bryan Singer, Manuela Veloso and Jianxin Xiong
Generating Platform-Adapted DSP Libraries using SPIRAL
Proc. High Performance Extreme Computing (HPEC), 2001
Thesis
- Peter Becker
A High Speed VLSI Architecture for the Discrete Haar Wavelet Transform
Master thesis, Electrical and Computer Engineering, Drexel University, 2001
- Jianxin Xiong
Automatic Optimization of DSP Algorithms
PhD. thesis, Computer Science, University of Illinois at Urbana-Champaign, 2001, Also as Technical Report UIUCDCS-R-2001-224, University of Illinois
- Bryan Singer
Automating the Modeling and Optimization of the Performance of Signal Processing Algorithms
PhD. thesis, Computer Science, Carnegie Mellon University, 2001
- Pinit Kumhom
Design, Optimization, and Implementation of a Universal FFT Processor
PhD. thesis, Electrical and Computer Engineering, Drexel University, 2001, Also Tech. Report DU-MCS-01-01, Drexel University, 2001
2000
Conference (fully reviewed)
- Pinit Kumhom, Jeremy Johnson and Prawat Nagvajara
Design, optimization, and implementation of a universal FFT processor
Proc. IEEE ASIC/SOC Conference, IEEE, pp. 182-186, 2000
- Neungsoo Park, Viktor K. Prasanna, Kiran Bondalapati and Dongsoo Kang
Dynamic Data Layouts for Cache-conscious Factorization of DFT
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 693-701, 2000
- Jeremy Johnson and Markus Püschel
In Search of the Optimal Walsh-Hadamard Transform
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 6, pp. 3347-3350, 2000
- Bryan Singer and Manuela Veloso
Learning to Predict Performance from Formula Modeling and Training Data
Proc. International Conference on Machine Learning (ICML), pp. 887-894, 2000
Conference (abstract reviewed)
- José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel and Manuela Veloso
SPIRAL: Automatic Implementation of Signal Processing Algorithms
Proc. High Performance Extreme Computing (HPEC), 2000
Thesis
- Gavin Haentjens
An Investigation of Recursive FFT Implementations
Master thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2000
- David Sepiashvili
Performance Models and Search Methods for Optimal FFT Implementations
Master thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2000
- Franz Franchetti
Short Vector FFTs
Master thesis, Vienna University of Technology, 2000