Publications
Filtered as:
all types
- all years
- all authors
- all keywords
Corresponding
bibtex list
- L. Tang, S. Chen, K. Harisrikanth, G. Xu, Franz Franchetti and K. Mai
A 1.19GHz 9.52Gsamples/sec Radix-8 FFT Hardware Accelerator in 28nm
Proc. Hot Chips Symposium (HCS), 2024
- S. Fu, N. Zhang and Franz Franchetti
Accelerating High-Precision Number Theoretic Transforms using Intel AVX-512
Proc. Parallel Architectures and Compilation Techniques (PACT), 2024
- E. Tang, James C. Hoe and Franz Franchetti
Magic Memory: A Memory-Centric Declarative Programming Paradigm to Enable High Productivity on Heterogeneous Systems
Proc. SRC TECHCON, 2024
- T. Zhang, E. Tang, F. Siddique, K. Skadron and Franz Franchetti
Towards an End-to-End Processing-in-DRAM Acceleration of Spectral Library Search
Proc. High Performance Extreme Computing (HPEC), 2024
- Y. Eum, N. Zhang, L. Tang and Franz Franchetti
Towards a RISC-V Instruction Set Extension for Multi-word Arithmetic
Proc. High Performance Extreme Computing (HPEC), 2024
- Sanil Rao, A. Prakash and Franz Franchetti
Towards LibraryX: A Framework for Cross-Library Call Optimization
Proc. High Performance Extreme Computing (HPEC), 2024
- G. Xu, Franz Franchetti and James C. Hoe
An Approach to Generating Customized Load-Store Architectures
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2023
- Sanil Rao, M. A. H. Monil, H. Mankad, J. Vetter and Franz Franchetti
FFTX-IRIS: Towards Performance Portability and Heterogeneity for SPIRAL Generated Code
Proc. Supercomputing (SC), 2023
- B. Van Straalen, P. Colella, Franz Franchetti and P. Broderick
FFTX: Next-Generation Open-Source Software for Fast Fourier Transforms
to appear in Proc. Exascale Computing Project Annual Meeting (ECPAM), 2023
- N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
- N. Zhang and Franz Franchetti
Generating Number Theoretic Transforms for Multi-Word Integer Data Types
Proc. International Symposium on Code Generation and Optimization (CGO), 2023
- D. Sun, N. Zhang and Franz Franchetti
Optimization and Performance Analysis of Shor’s Algorithm in Qiskit
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring Processing Unit
Cryptology ePrint Archive, 2023
- D. Soni, N. Neda, N. Zhang, B. Reynwar, B. Heyman, M. Nabeel, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
RPU: The Ring-Processing Unit
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023
- D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
arXiv, 2023
- P. Brinich, N. Zhang, A. Ebel, Franz Franchetti and Jeremy Johnson
Twiddle Factor Generation for a Vectorized Number Theoretic Transform
Proc. IEEE High Performance Extreme Computing (HPEC), 2023
- Joao Rivera, Franz Franchetti and Markus Püschel
A Compiler for Sound Floating-Point Computations using Affine Arithmetic
Proc. International Symposium on Code Generation and Optimization (CGO), 2022
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
A Framework for Low Communication Approaches for Large Scale 3D Convolution
Proc. International Conference on Parallel Processing (ICPP), 2022
- L. Tang, S. Chen, K. Harisrikanth, G. Xu, K. Mai and Franz Franchetti
A High Throughput Hardware Accelerator for FFTW Codelets: A First Look
Proc. High Performance Extreme Computing (HPEC), 2022
- P. Oostema and Franz Franchetti
Approximating Manifolds and Geodesics with Curved Surfaces
Proc. High Performance Extreme Computing (HPEC), 2022
- J. Nguyen, Sanil Rao, J. Moreira and Franz Franchetti
Automatic Generation of Matrix-Vector Code Using SPIRAL for the Power10 ISA
Proc. IEEE High Performance Extreme Computing (HPEC), 2022
- Z. Gong, N. Zhu, M. Ngaw, Joao Rivera, L. Tang, E. Tang, H. Mankad and Franz Franchetti
Interval Arithmetic-based FFT for Large Integer Multiplication
Proc. High Performance Extreme Computing (HPEC), 2022
- J. Nguyen, M. Cai, Z. Zuo, L. Tang, K. Mai and Franz Franchetti
LIMA: Hardware for FFT based Large Integer Multiplication
Proc. High Performance Extreme Computing (HPEC), 2022
- E. Tang and Franz Franchetti
Magic Memory: A Programming Model For Big Data Analytics
Proc. High Performance Extreme Computing (HPEC), 2022
- P. Oostema and Franz Franchetti
On Manifold Graph Embedding
Proc. High Performance Extreme Computing (HPEC), 2022
- H. Mankad, Sanil Rao, P. Colella, B. Van Straalen and Franz Franchetti
ProtoX: A First Look
Proc. High Performance Extreme Computing (HPEC), 2022
- N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
Towards Full-Stack Acceleration for Fully Homomorphic Encryption
Proc. High Performance Extreme Computing (HPEC), 2022
- N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
Adaptation of A64 Scalable Vector Extension for Spiral
Proc. High Performance Computing Conference (HPCC), Japan, 2021
- N. Kitai, Daisuke Takahashi, Franz Franchetti, T. Katagiri, S. Ohshima and T. Nagai
An Auto-tuning with Adaptation of A64 Scalable Vector Extension for SPIRAL
Proc. International Workshop on Automatic Performance Tuning (iWAPT), 2021
- Joao Rivera, Franz Franchetti and Markus Püschel
An Interval Compiler for Sound Floating Point Computations
Proc. International Symposium on Code Generation and Optimization (CGO), 2021
- Mark Blanco, S. McMillan and Tze-Meng Low
Delayed Asynchronous Iterative Graph Algorithms
Proc. High Performance Extreme Computing (HPEC), 2021
- P. Oostema and Franz Franchetti
Leveraging High Dimensional Spatial Graph Embedding as a Heuristic for Graph Algorithms
Proc. IEEE Workshop on Parallel / Distributed Combinatorics and Optimization (PDCO), 2021
- Yannick Zakowski, Calvin Beck, Irene Yoon, Ilia Zaichuk, Vadim Zaliva and Steve Zdancewic
Modular, Compositional, and Executable Formal Semantics for LLVMIR
Proc. International Conference on Functional Programming (ICFP), 2021
- Scott Mionis, Franz Franchetti and J. Larkin
Optimized Quantum Circuit Generation with SPIRAL
Proc. High Performance Extreme Computing (HPEC), 2021
- Jiyuan Zhang
Accelerating the Motifs of Machine Learning Applications on Modern Processors
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2020
- Thom Popovici, Martin Schatz, Franz Franchetti and Tze-Meng Low
A Flexible Framework for Multi-Dimensional DFTs
SIAM Journal on Scientific Computing (SISC), Software and High-Performance Computing, 2020
- Anuva Kulkarni
An Approach for Large-Scale Three-Dimensional FFT-Based Approximate Convolutions on GPUs
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2020
- Jiyuan Zhang, Yi Lu, Daniele G. Spampinato and Franz Franchetti
FESIA: A Fast and Efficient Set Intersection Approach on Modern CPUs
Proc. IEEE International Conference on Data Engineering (ICDE), 2020
- Jiyuan Zhang, Daniele G. Spampinato and Franz Franchetti
FESIA: A Fast and SIMD-Efficient Set Intersection Approach on Modern CPUs
North East Database Day (NEDB), 2020, Poster
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solver Libraries for Exascale: FFTX and SpectralPack
Exascale Computing Project (ECP) Annual Meeting, 2020, Poster
- Daisuke Takahashi and Franz Franchetti
FFTE on SVE: SPIRAL-Generated Kernels
Proc. International Conference on High Performance Computing in Asia-Pacific Region (HPCAsia), pp. 114-122, 2020
- Sanil Rao, A. Kutuluru, Paul Brouwer, S. McMillan and Franz Franchetti
GBTLX: A First Look
Proc. IEEE High Performance Extreme Computing (HPEC), 2020
- Vadim Zaliva
HELIX: From Math to Verified Code
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2020
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
Massive Scaling of MASSIF: Algorithm Development and Analysis for Simulation on GPUs
Proc. Platform for Advanced Scientific Computing (PASC), Article 13, pp. 1 - 10, 2020
- Scott Mionis, Franz Franchetti and J. Larkin
Quantum Circuit Optimization with SPIRAL: A First Look
Proc. Supercomputing (SC), 2020
- Mark Blanco, S. McMillan and Tze-Meng Low
Towards an Objective Metric for the Performance of Exact Triangle Count
Proc. High Performance Extreme Computing (HPEC), 2020
- Vadim Zaliva, Ilia Zaichuk and Franz Franchetti
Verified Translation Between Purely Functional and Imperative Domain Specific Languages in HELIX
Proc. Working Conference on Verified Software: Theories, Tools, and Experiments (VSTTE), 2020
- Thom Popovici, Martin Schatz, Franz Franchetti and Tze-Meng Low
A Flexible Framework for Parallel Multi-Dimensional DFTs
arXiv (Technical Report), 2019
- Anuva Kulkarni, Jelena Kovacevic and Franz Franchetti
Algorithm Design at Scale: Porting Parallel FFT-based Fortran Simulations to GPUs
SIAM Conference on Computational Science and Engineering (CSE), 2019
- Anuva Kulkarni, Daniele G. Spampinato and Franz Franchetti
Design and Specification of Large-scale Simulations for GPUs using FFTX
Supercomputing, 2019
- F. Sadi, Joe Sweeney, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Efficient SpMV Operation for Large and Highly Sparse Matrices Using Scalable Multi-way Merge Parallelization
Proc. MICRO, 2019
- Mark Blanco, Tze-Meng Low and Kyungjoo Kim
Exploration of Fine-Grained Parallelism for Load Balancing Eager K-truss on GPU and CPU
Proc. High Performance Extreme Computing (HPEC), 2019
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Tze-Meng Low, M. Franusich, Thom Popovici, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFT and Solvers for Exascale: FFTX and SpectralPACK
Exascale Computing Project (ECP) Annual Meeting, 2019
- Anuva Kulkarni, Daniele G. Spampinato and Franz Franchetti
FFTX for Micromechanical Stress-Strain Analysis
IEEE High Performance Extreme Computing Conference (HPEC), 2019
- Vadim Zaliva and Matthieu Sozeau
Reification of Shallow-Embedded DSLs in Coq with Automated Verification
Proc. International Workshop on Coq for Programming Languages (CoqPL), 2019
- Yoko Franchetti, Thomas Nolin and Franz Franchetti
Towards Precision Medicine: Simulation Based Parameter Estimation for Drug Metabolism
SIAM Conference on Computational Science and Engineering (CSE), 2019
- F. Sadi
Accelerating Sparse Matrix Kernels with Co-optimized Architecture
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2018
- Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
Algorithm Design for Large Scale FFT-Based Simulations on CPU-GPU Platforms
, 2018
- Anuva Kulkarni, Franz Franchetti and Jelena Kovacevic
Algorithm Design for Large Scale Parallel FFT-Based Simulations on Heterogeneous Platforms
Proc. High Performance Extreme Computing (HPEC), 2018
- Thom Popovici
An Approach to Specifying and Automatically Optimizing Fourier Transform Based Operations
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2018
- Vit Ruzicka and Franz Franchetti
Fast and Accurate Object Detection in High Resolution 4K and 8K Video Using GPUs
Proc. IEEE High Performance Extreme Computing (HPEC), 2018
- Franz Franchetti, Daniele G. Spampinato, Anuva Kulkarni, Thom Popovici, Tze-Meng Low, M. Franusich, A. Canning, P. McCorquodale, B. Van Straalen and P. Colella
FFTX and SpectralPack: A First Look
Proc. IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2018
- Vadim Zaliva and Franz Franchetti
HELIX: A Case Study of a Formal Verification of High Performance Program Generation
Proc. Workshop on Functional High Performance Computing (FHPC), 2018
- Jiyuan Zhang, Franz Franchetti and Tze-Meng Low
High Performance Zero-Memory Overhead Direct Convolutions
Proc. International Conference on Machine Learning (ICML), 2018
- Thom Popovici, Tze-Meng Low and Franz Franchetti
Large Bandwidth-Efficient FFTs on Multicore and Multi-Socket Systems
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), IEEE, 2018
- Tze-Meng Low, Daniele G. Spampinato, A. Kutuluru, U. Sridhar, Thom Popovici, Franz Franchetti and S. McMillan
Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency Matrices
Proc. High Performance Extreme Computing (HPEC), 2018
- F. Sadi, Joe Sweeney, S. McMillan, Tze-Meng Low, James C. Hoe, Lawrence Pileggi and Franz Franchetti
PageRank Acceleration for Large Graphs with Scalable Hardware and Two-Step SpMV
Proc. High Performance Extreme Computing (HPEC), 2018
- Jiyuan Zhang, Daniele G. Spampinato, S. McMillan and Franz Franchetti
Preliminary Exploration on Large-Scale Triangle Counting in Shared-Memory Multicore System
Proc. IEEE High Performance Extreme Computing (HPEC), 2018
- Daniele G. Spampinato, Diego Fabregat-Traver, Paolo Bientinesi and Markus Püschel
Program Generation for Small-Scale Linear Algebra Applications
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 327-339, 2018
- Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
SPIRAL: Extreme Performance Portability
Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018
- Matthias Bolten, Franz Franchetti, P. H. J. Kelly, Christian Lengauer and Marcus Mohr
Algebraic Description and Automatic Generation of Multigrid Methods in SPIRAL
Concurrency and Computation: Practice and Experience, 2017
- F. Sadi, Lawrence Pileggi and Franz Franchetti
Algorithm and Hardware Co-Optimized Solution for Large SpMV Problems
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017
- Daniele G. Spampinato
A Linear Algebra Compiler for Small Problem Sizes
PhD. thesis, Computer Science, ETH Zurich, Switzerland, 2017
- Richard Veras and Franz Franchetti
A Scale-free Structure for Real World Networks
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017
- Tze-Meng Low, Varun Rao, Matthew Lee, Thom Popovici, Franz Franchetti and S. McMillan
First Look: Linear Algebra-based Triangle Counting Without Matrix Multiplication
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-6, 2017
- Tze-Meng Low and Franz Franchetti
High Assurance Code Generation for Cyber-Physical Systems
Proc. IEEE International Symposium on High Assurance Systems Engineering (HASE), 2017
- Franz Franchetti, Tze-Meng Low, Stefan Mitsch, Juan Pablo Mendoza, Liangyan Gui, Amarin Phaosawasdi, David Padua, Soummya Kar, José M. F. Moura, M. Franusich, Jeremy Johnson, Andre' Platzer and Manuela Veloso
High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control
IEEE Control Systems Magazine, 2017
- Thom Popovici, Franz Franchetti and Tze-Meng Low
Mixed Data Layout Kernels for Vectorized Complex Arithmetic
Proc. High Performance Extreme Computing (HPEC), IEEE, pp. 1-7, 2017
- H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
Multirotor UAV State Prediction Through Multi-microphone Side-channel Fusion
Proc. Multisensor Fusion and Integration for Intelligent Systems (MFI), IEEE, pp. 15-21, 2017
- G. Xu, Tze-Meng Low, James C. Hoe and Franz Franchetti
Optimizing FFT Resource Efficiency of FPGA using High-Level Synthesis
High Performance Extreme Computing Conference (HPEC), 2017
- H. V. Koops, Kashish Garg, Munsung (Bill) Kim, Jonathan Li, Anja Volk and Franz Franchetti
Prediction of Quadcopter State through Multi-Microphone Side-Channel Fusion
Technical report UU-CS-2017-001, Dept. of Information and Computing Sciences, Utrecht University, 2017
- Georg Ofenbeck, Tiark Rompf and Markus Püschel
Staging for Generic Programming in Space and Time
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 15-28, 2017
- Richard Veras
The Automatic Generation of High-Performance Graph Analytic Code using Spiral
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2017
- F. Sadi, Lawrence Pileggi and Franz Franchetti
3D DRAM Based Application Specific Hardware Accelerator for SpMV
High Performance Extreme Computing Conference (HPEC), 2016
- Daniele G. Spampinato and Markus Püschel
A Basic Linear Algebra Compiler for Structured Matrices
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 117-127, 2016
- Qi Guo, Tianshi Chen, Y. Chen and Franz Franchetti
Accelerating Architectural Simulation Via Statistical Techniques: A Survey
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 35, No. 3, pp. 433-446, 2016
- Richard Veras, Tze-Meng Low and Franz Franchetti
A Scale-Free Structure for Power-Law Graphs
Proc. High Performance Extreme Computing (HPEC), pp. 1-7, 2016
- Joya Deri, Franz Franchetti and José M. F. Moura
Big Data Computation of Taxi Movement in New York City
Proc. IEEE International Conference on Big Data (Big Data), IEEE, pp. 2616-2625, 2016
- Richard Veras, Thom Popovici, Tze-Meng Low and Franz Franchetti
Compilers, Hands-Off My Hands-On Optimizations
Proc. Workshop on Programming Models for SIMD/Vector Programming (WPMVP), 2016
- Marcela Zuluaga, Andreas Krause and Markus Püschel
e-PAL: An Active Learning Approach to the Multi-Objective Optimization Problem
Journal of Machine Learning Research, Vol. 17, No. 104, pp. 1-32, 2016
- Berkin Akin, Franz Franchetti and James C. Hoe
HAMLeT Architecture for Parallel Data Reorganization in Memory
IEEE Micro, Vol. 36, No. 1, pp. 14-23, 2016
- J. Kepner, P. Aaltonen, D. Bader, A. Buluc, Franz Franchetti, J. Gilbert, D. Hutchison, M. Kumar, A. Lumsdaine, H. Meyerhenke, S. McMillan, J. Moreira, J. D. Owens, C. Yang, M. Zalewski and T. Mattson
Mathematical Foundations of the GraphBLAS
Proc. High Performance Extreme Computing (HPEC), 2016
- Francois Serre and Markus Püschel
Optimal Circuits for Streamed Linear Permutations using RAM
Proc. FPGA, pp. 215-223, 2016
- Marcela Zuluaga, Peter A. Milder and Markus Püschel
Streaming Sorting Networks
ACM Transactions on Design Automation of Electronic Systems, Vol. 21, No. 4, pp. 55, 2016
- Jiyuan Zhang, Tze-Meng Low, Qi Guo and Franz Franchetti
A 3D-Stacked Memory Manycore Stencil Accelerator System
Proc. Workshop on Near Data Processing (WONDP), 2015
- Nikolaos Kyrtatas, Daniele G. Spampinato and Markus Püschel
A Basic Linear Algebra Compiler for Embedded Processors
Proc. Design, Automation and Test in Europe (DATE), pp. 1054-1059, 2015
- Berkin Akin
A Formal Approach to Memory Access Optimization: Data Layout, Reorganization, and Near-Data Processing
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2015
- H. V. Koops and Franz Franchetti
An Ensemble Technique for Estimating Vehicle Speed and Gear Position from Acoustic Data
Proc. International Conference on Digital Signal Processing (DSP), 2015
- H. E. Sumbul, K. Vaidyanathan, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
A Synthesis Methodology for Application-Specific Logic-in-Memory Designs
Proc. Design Automation Conference (DAC), 2015
- Berkin Akin, Franz Franchetti and James C. Hoe
Data Reorganization in Memory Using 3D-stacked DRAM
Proc. International Symposium on Computer Architectur (ISCA), 2015
- Qi Guo, Tze-Meng Low, N. Alachiotis, Berkin Akin, Lawrence Pileggi, James C. Hoe and Franz Franchetti
Enabling Portable Energy Efficiency with Memory Accelerated Library
Proc. MICRO, 2015
- M. M. Sabry Aly, M. Gao, G. Hills, C.-S. Lee, G. Pitner, M. M. Shulaker, T. F. Wu, M. Asheghi, J. Bokor, Franz Franchetti, K. E. Goodson, C. Kozyrakis, I. Markov, K. Olukoton, Lawrence Pileggi, E. Pop, J. Rabaey, C. Re, H.-S. Wong and S. Mitra
Energy-Efficient Abundant-Data Computing: The N3XT 1,000x
Computer, Vol. 48, No. 12, pp. 24-33, 2015
- Berkin Akin, Franz Franchetti and James C. Hoe
FFTs with Near-Optimal Memory Access Through Block Data Layouts: Algorithm, Architecture and Design Automation
Journal of Signal Processing Systems, 2015
- Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
Proc. Workshop on Compilers for Parallel Computing (CPC), 2015
- Thom Popovici, F. Russell, K. Wilkinson, C-K. Skylaris, P. H. J. Kelly and Franz Franchetti
Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2015
- Tze-Meng Low, Qi Guo and Franz Franchetti
Optimizing Space Time Adaptive Processing Through Accelerating Memory-Bounded Operations
Proc. High Performance Extreme Computing (HPEC), 2015
- T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, R. Suter, Franz Franchetti and Anthony Rollett
Spectral Full-Field Deformation Modeling of Polycrystalline Materials
submitted for publication
- Qi Guo, N. Alachiotis, Berkin Akin, F. Sadi, G. Xu, Tze-Meng Low, Lawrence Pileggi, James C. Hoe and Franz Franchetti
3D-Stacked Memory-Side Acceleration: Accelerator and System Design
Proc. Workshop on Near Data Processing (WONDP), 2014
- Daniele G. Spampinato and Markus Püschel
A Basic Linear Algebra Compiler
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 23-32, 2014
- Nikolaos Kyrtatas
A Basic Linear Algebra Compiler for Embedded Processors
Master thesis, Computer Science, ETH Zurich, Switzerland, 2014
- Alen Stojanov, Georg Ofenbeck, Tiark Rompf and Markus Püschel
Abstracting Vector Architectures in Library Generators: Case Study Convolution Filters
Proc. ACM International Workshop on Libraries, Languages and Compilers for Array Programming (ARRAY), pp. 14, 2014
- Tao Cui, R. Yang, Gabriela Hug and Franz Franchetti
Accelerated AC Contingency Calculation on Commodity Multi-core SIMD CPUs
Proc. IEEE Power and Energy Society General Meeting (PES-GM), 2014
- F. Sadi, Berkin Akin, Thom Popovici, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Algorithm/Hardware Co-optimized SAR Image Reconstruction with 3D-stacked Logic in Memory
Proc. IEEE High Performance Extreme Computing (HPEC), 2014
- Georg Ofenbeck, Ruedi Steinmann, Victoria Caparrós Cabezas, Daniele G. Spampinato and Markus Püschel
Applying the Roofline Model
Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 76 - 85, 2014
- B. Duff, J. Larkin, M. Franusich and Franz Franchetti
Automatic Generation of 3-D FFTs
submitted for publication
- Benjamin Hess, Thomas Gross and Markus Püschel
Automatic Locality-Friendly Interface Extension of Numerical Functions
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 83-92, 2014
- Vadim Zaliva and Franz Franchetti
Barometric and GPS Altitude Sensor Fusion
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014
- K. Vaidyanathan, R. Liu, H. E. Sumbul, Qiuling Zhu, Franz Franchetti and Lawrence Pileggi
Efficient and Secure Intellectual Property (IP) Design for Split Fabrication
Proc. IEEE International Symposium on Hardware-Oriented Security and Trust (HOST), 2014
- Victoria Caparrós Cabezas and Markus Püschel
Extending the Roofline Model: Bottleneck Analysis with Microarchitectural Constraints
Proc. IEEE International Symposium on Workload Characterization (IISWC), pp. 222-231, 2014
- T. Ozturk, Thom Popovici, C. Stein, R. Pokharel, Franz Franchetti and Anthony Rollett
Fast Fourier Transform Based Mechanical Behavior Formulation: Optimized Implementation and Sensitivity Analysis of the Method
Proc. Materials Science & Technology, 2014
- Berkin Akin, Franz Franchetti and James C. Hoe
FFTs with Near-Optimal Memory Access Through Block Data Layouts
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014
- Berkin Akin, James C. Hoe and Franz Franchetti
HAMLeT: Hardware Accelerated Memory Layout Transform within 3D-stacked DRAM
Proc. IEEE High Performance Extreme Computing (HPEC), 2014
- Franz Franchetti, Aliaksei Sandryhaila and Jeremy Johnson
High Assurance SPIRAL
Proc. SPIE, Proceedings of SPIE 2014, 2014
- Jörn Schumacher and Markus Püschel
High-performance sparse fast Fourier transforms
Proc. IEEE Workshop on Signal Processing Systems (SIPS), pp. 1-6, 2014
- Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
Language Support for the Construction of High Performance Code Generators
Proc. International Workshop on Adaptive Self-tuning Computing Systems (ADAPT), 2014
- T. Ozturk, C. Stein, R. Pokharel, Thom Popovici, Franz Franchetti, R. Suter and Anthony Rollett
Performance Evaluation, Algorithm Optimization and Sensitivity Analysis of the Spectral Full-Field Deformation Modeling of Polycrystalline Materials
submitted for publication
- Lingchuan Meng and Jeremy Johnson
Towards parallel general-size library generation for polynomial multiplication
ACM Communications in Computer Algebra, 2014
- Berkin Akin, Franz Franchetti and James C. Hoe
Understanding the Design Space of DRAM-optimized Hardware FFT Accelerators
Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 248-255, 2014
- Qiuling Zhu, Berkin Akin, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
A 3D-Stacked Logic-in-Memory Accelerator for Application-Specific Data Intensive Computing
Proc. IEEE International 3D Systems Integration Conference (3DIC), pp. 1-7, 2013
- Qiuling Zhu, H. E. Sumbul, F. Sadi, James C. Hoe, Lawrence Pileggi and Franz Franchetti
Accelerating Sparse Matrix-Matrix Multiplication with 3D-Stacked Logic-in-Memory Hardware
Proc. High Performance Extreme Computing (HPEC), pp. 1-6, 2013
- Marcela Zuluaga, Andreas Krause, Guillaume Sergent and Markus Püschel
Active Learning for Multi-Objective Optimization
Proc. International Conference on Machine Learning (ICML), pp. 462-470, 2013
- Q. Li, Tao Cui, Yang Weng, Rohit Negi, Franz Franchetti and Marija Ilic
An Information-Theoretic Approach to PMU Placement in Electric Power Systems
IEEE Transactions on Smart Grid, Vol. 4, No. 1, pp. 446-456, 2013
- Qiuling Zhu
Application Specific Logic in Memory
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2013
- Tao Cui and Franz Franchetti
A Quasi-Monte Carlo Approach for Radial Distribution System Probabilistic Load Flow
Proc. IEEE Innovative Smart Grid Technology Conference, 2013
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
A Smart Memory Accelerated Computed Tomography Parallel Backprojection
in From Algorithms to Circuits and System-on-Chip Design (VLSI-SoC 2012), Eds. A. Berg, A. Coskun, M. Guthaus, S. Katkoori, R. Reis, pp. 21-44, Springer, Berlin, Heidelberg 2013
- Tao Cui and Franz Franchetti
A Software Performance Engineering Aproach to Fast Transmission Probabilistic Load Flow
IEEE PES General Meeting, 2013
- Tom Henretty, Richard Veras, Franz Franchetti, Louis-Noël Pouchet, J. Ramanujam and P. Sadayappan
A Stencil Compiler for Short-Vector SIMD Architectures
Proc. ACM International Conference on Supercomputing , pp. 13-24, 2013
- Lingchuan Meng and Jeremy Johnson
Automatic Parallel Library Generation for General-Size Modular FFT Algorithms
Proc. International Workshop on Computer Algebra in Scientific Computing, Springer, pp. 243-256, 2013
- Benjamin Hess
Automatic Refactoring: Locality Friendly Interface Enhancements for Numerical Functions
Master thesis, Computer Science, ETH Zurich, Switzerland, 2013
- Jörn Schumacher
High Performance Sparse Fast Fourier Transform
Master thesis, Computer Science, ETH Zurich, Switzerland, 2013
- Marcela Zuluaga, Andreas Krause and Markus Püschel
Multi-Objective Optimization for High-Level Synthesis
Proc. Workshop on High-Level Synthesis for High Performance Computing (HLS4HPC), 2013
- Tao Cui and Franz Franchetti
Power System Probabilistic and Security Analysis on Commodity High Performance Computing Systems
Proc. High Performance Computing, Networking and Analytics for the Power Grid (HiPCNA-PG), 2013
- Tao Cui
Power System Probabilistic and Security Analysis Using Commodity High Performance Computing Systems
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2013
- Cory Thoma, Tao Cui and Franz Franchetti
Privacy Preserving Smart Meter System Based Retail Level Electricity Market
Proc. IEEE Power and Energy Society General Meeting (PES-GM), 2013
- Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
Spiral in Scala: Towards the Systematic Construction of Generators for Performance Libraries
Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 125-134, 2013
- H. E. Sumbul, A. Patterson, A. Tazzoli, G. Feeder, Franz Franchetti, G. Piazza and Lawrence Pileggi
Trusted Split-Fabrication System-on-Chip Design Technology and Methodology
Proc. Government Applications & Critical Technology Conference (GOMACTech), 2013
- Martin Kong, Richard Veras, Kevin Stock, Franz Franchetti, Louis-Noël Pouchet and P. Sadayappan
When Polyhedral Transformations Meet SIMD Code Generation
Proc. Programming Languages Design and Implementation (PLDI), Vol. 48, pp. 127-138, 2013
- Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
Algorithm and Architecture Optimization for Large Size Two Dimensional Discrete Fourier Transform
ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA), 2012
- Tao Cui and Franz Franchetti
A Multi-Core High Performance Computing Framework for Probabilistic Solutions of Distribution Systems
Proc. IEEE Power and Energy Society General Meeting (PES-GM), pp. 1-6, 2012
- C. Angelopoulos, Franz Franchetti and Markus Püschel
Automatic Generation of FFT Libraries for GPUs
NVIDIA Research Summit at the GPU Technology Conference, 2012
- Franz Franchetti, Yevgen Voronenko and G. Almasi
Automatic Generation of the HPC Challenges Global FFT Benchmark for BlueGene/P
Proc. High Performance Computing for Computational Science (VECPAR), 2012
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Computer Generation of Hardware for Linear Digital Signal Processing Transforms
ACM Transactions on Design Automation of Electronic Systems, Vol. 17, No. 2, 2012
- Marcela Zuluaga, Peter A. Milder and Markus Püschel
Computer Generation of Streaming Sorting Networks
Proc. Design Automation Conference (DAC), pp. 1245-1253, 2012
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
Cost-Effective Smart Memory Implementation for Parallel Backprojection in Computed Tomography
Proc. IFIP/IEEE Internationa Conference on Very Large Scale Integration, pp. 111-116, 2012
- Qiuling Zhu, K. Vaidyanathan, O. Shacham, M. Horowitz, Lawrence Pileggi and Franz Franchetti
Design Automation Framwork for Application-Specific Logic-in-Memory Blocks
Proc. IEEE International Conference on Application-Specific Systems, Architectures and Processors (ASAP), pp. 125-132, 2012
- W. Yu, Franz Franchetti, James C. Hoe and Tsuhan Chen
Highly Efficient Performance Portable Tracking of Evolving Surfaces
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 296-307, 2012
- Robert Koutsoyannis, Peter A. Milder, Christian Berger, Madeleine Glick, James C. Hoe and Markus Püschel
Improving Fixed-Point Accuracy of FFT Cores in O-OFDM Systems
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2012
- Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
Local Interpolation-based Polar Format SAR: Algorithm, Hardware Implementation and Design Automation
Journal of Signal Processing Systems, 2012
- Berkin Akin, Peter A. Milder, Franz Franchetti and James C. Hoe
Memory Bandwidth Efficient Two-Dimensional Fast Fourier Transform Algorithm and Implementation for Large Problem Sizes
Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), 2012
- Tao Cui and Franz Franchetti
Optimized Parallel Distribution Load Flow Solver on Commodity Multi-core CPU
Proc. IEEE High Performance Extreme Computing (HPEC), 2012
- Qiuling Zhu, Christian Berger, E. L. Turner, Lawrence Pileggi and Franz Franchetti
Polar Format Synthetic Aperture Radar in Energy Efficient Application-Specific Logic-in-Memory
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1557-1560, 2012
- Cory Thoma, Tao Cui and Franz Franchetti
Secure Multiparty Computation Based Privacy Preserving Smart Metering System
Proc. North American Power Symposium (NAPS), pp. 1-6, 2012
- Marcela Zuluaga, Andreas Krause, Peter A. Milder and Markus Püschel
"Smart" Design Space Sampling to Predict Pareto-Optimal Solutions
Proc. Languages, Compilers, Tools and Theory for Embedded Systems (LCTES), pp. 119-128 , 2012
- Qiuling Zhu, Lawrence Pileggi and Franz Franchetti
Smart Memory Synthesis for Energy-Efficient Computed Tomography Reconstruction
Proc. SRC TECHCON, 2012
- Tao Cui and Franz Franchetti
A Monte Carlo Framework for Probabilistic Distribution Power Flow
Carnegie Mellon Conference on the Electricity Industry, 2011
- Tao Cui and Franz Franchetti
A Multi-core High Performance Computing Framework for Distribution Power Flow
Proc. North American Power Symposium (NAPS), 2011
- Qiuling Zhu, E. L. Turner, Christian Berger, Lawrence Pileggi and Franz Franchetti
Application-Specific Logic-in-Memory for Polar Format Synthetic Aperture Radar
Proc. High Performance Extreme Computing (HPEC), 2011
- Daniel McFarlin, Volodymyr Arbatov, Franz Franchetti and Markus Püschel
Automatic SIMD Vectorization of Fast Fourier Transforms for the Larrabee and AVX Instruction Sets
Proc. International Conference on Supercomputing (ICS), 2011
- Tao Cui and Franz Franchetti
Autotuning a Random Walk Boolean Satisfiability Solver
Proc. International Workshop on Automatic Performance Tuning (iWAPT), 2011
- Tom Henretty, Kevin Stock, Louis-Noël Pouchet, Franz Franchetti, J. Ramanujam and P. Sadayappan
Data Layout Transformation for Stencil Computations on Short SIMD Architectures
Proc. International Conference on Compiler Construction (CC), 2011
- Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Design studies for ASIC implementations of 28 GS/s optical QPSK- and 16-QAM-OFDM transceivers
Optics Express, Vol. 19, No. 21, pp. 20857-20864, 2011
- Franz Franchetti and Markus Püschel
Fast Fourier Transform
in Encyclopedia of Parallel Computing, Eds. David Padua, Springer 2011
- W. Yu, Franz Franchetti, James C. Hoe, José M. F. Moura and Tsuhan Chen
Performance Portable Tracking of Evolving Surfaces
Proc. High Performance Extreme Computing (HPEC), 2011
- Christian Berger, Volodymyr Arbatov, Yevgen Voronenko, Franz Franchetti and Markus Püschel
Real-Time Software Implementation of an IEEE 802.11a Baseband Receiver on Intel Multicore
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1693-1696 , 2011
- Markus Püschel, Franz Franchetti and Yevgen Voronenko
Spiral
in Encyclopedia of Parallel Computing, Eds. David Padua, pp. 1920-1933, Springer 2011
- Peter A. Milder
A Mathematical Approach for Compiling and Optimizing Hardware Implementations of DSP Transforms
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
- Frédéric de Mesmay, Srinivas Chellappa, Franz Franchetti and Markus Püschel
Computer Generation of Efficient Software Viterbi Decoders
Proc. International Conference on High Performance Embedded Architectures and Compilers (HiPEAC), Lecture Notes in Computer Science, Springer, Vol. 5952, pp. 353-368, 2010
- Srinivas Chellappa
Computer Generation of Fourier Transform Libraries for Distributed Memory Architectures
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
- Yevgen Voronenko, Volodymyr Arbatov, Christian Berger, Ronghui Peng, Markus Püschel and Franz Franchetti
Computer Generation of Platform-Adapted Physical Layer Software
Proc. Software Defined Radio (SDR), 2010
- Douglas F. Jones
Data Pump Architecture Simulator and Performance Model
Master thesis, Computer Science, Drexel University, 2010
- Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, Christian Berger, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Design Studies for an ASIC Implementation of an Optical OFDM Transceiver
Proc. European Conference on Optical Communication (ECOC), pp. 1-3, 2010
- C. Angelopoulos, Franz Franchetti and Markus Püschel
DFT Transform on the Fermi (GTX480): Automatic Program Generation
NVIDIA Research Summit at the GPU Technology Conference, 2010
- W. Yu, Franz Franchetti, James C. Hoe and Tsuhan Chen
Fast and Robust Active Contours for Image Segmentation
Proc. IEEE International Conference on Image Processing (ICIP), 2010
- W. Yu, Franz Franchetti, James C. Hoe, Y.-J. Chang and Tsuhan Chen
Fast Bilateral Filtering By Adapting Block Size
Proc. IEEE International Conference on Image Processing (ICIP), 2010
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Hardware Implementation of the Discrete Fourier Transform with Non-Power-of-Two Problem Size
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2010
- W. Yu, Tsuhan Chen, Franz Franchetti and James C. Hoe
High Performance Stereo Vision Designed for Massively Data Parallel Platforms
IEEE Transactions on Circuits and Systems for Video Technology (T-CSVT), Vol. 20, No. 11, pp. 1509-1519, 2010
- Frédéric de Mesmay, Yevgen Voronenko and Markus Püschel
Offline Library Adaptation Using Automatically Generated Heuristics
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 1-10, 2010
- Q. Li, Tao Cui, Rohit Negi, Franz Franchetti and Marija Ilic
On-line Decentralized Charging of Plug-In Electric Vehicles in Power Systems
, 2010
- Frédéric de Mesmay
On the Computer Generation of Adaptive Numerical Libraries
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
- Yannis Benlachtar, Rachid Bouziane, Robert I. Killey, Christian Berger, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel and Madeleine Glick
Optical OFDM for the Data Center
Proc. International Conference on Transparent Optical Networks (ICTON), pp. 1-4, 2010
- W. Yu
Performance Portable Tracking of Evolving Surfaces
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2010
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Real-Time Digital Signal Processing for the Generation of Optical Orthogonal Frequency Division Multiplexed Signals
IEEE Journal of Selected Topics in Quantum Electronics, Vol. 16, No. 5, pp. 1235-1244 , 2010
- Lingchuan Meng, Jeremy Johnson, Franz Franchetti, Yevgen Voronenko, Marc Moreno Maza and Yuzhen Xie
Spiral-Generated Modular FFT Algorithms
Proc. Parallel Symbolic Computation (PASCO), pp. 169-170, 2010
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
21.4 GS/s Real-Time DSP-Based Optical OFDM Signal Generation and Transmission Over 1600 km of Uncompensated Fibre
Proc. European Conference on Optical Communication (ECOC), pp. 1-2, 2009
- Yevgen Voronenko and Markus Püschel
Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for Real DFTs
IEEE Transactions on Signal Processing, Vol. 57, No. 1, pp. 205-222, 2009
- Peter A. Milder, James C. Hoe and Markus Püschel
Automatic Generation of Streaming Datapaths for Arbitrary Fixed Permutations
Proc. Design, Automation and Test in Europe (DATE), pp. 1118-1123, 2009
- Daniel McFarlin, Franz Franchetti and Markus Püschel
Automatic Generation of Vectorized Fast Fourier Transform Libraries for the Larrabee and AVX Instruction Set Extension
Proc. High Performance Extreme Computing (HPEC), 2009
- Basilio B. Fraguela, Yevgen Voronenko and Markus Püschel
Automatic Tuning of Discrete Fourier Transforms Driven by Analytical Modeling
Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 271-280, 2009
- Frédéric de Mesmay, Arpad Rimmel, Yevgen Voronenko and Markus Püschel
Bandit-Based Optimization on Graphs with Application to Library Performance Tuning
Proc. International Conference on Machine Learning (ICML), pp. 729-736, 2009
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
Computer Generation of Fast Fourier Transforms for the Cell Broadband Engine
Proc. International Conference on Supercomputing (ICS), pp. 26-35, 2009
- Yevgen Voronenko, Frédéric de Mesmay and Markus Püschel
Computer Generation of General Size Linear Transform Libraries
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 102-113, 2009
- Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
- Franz Franchetti and Markus Püschel
Generating High-Performance Pruned FFT Implementations
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 549-552, 2009
- Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
High Performance Linear Transform Program Generation for the Cell BE
Proc. High Performance Extreme Computing (HPEC), 2009
- Daniel McFarlin, Franz Franchetti, José M. F. Moura and Markus Püschel
High Performance Synthetic Aperture Radar Image Formation On Commodity Architectures
Proc. SPIE Conference on Defense, Security, and Sensing, Proceedings of SPIE, Vol. 7337, pp. 733708, 2009
- Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
Operator Language: A Program Generation Framework for Fast Kernels
Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009
- Markus Püschel, Peter A. Milder and James C. Hoe
Permuting Streaming Data Using RAMs
Journal of the ACM, Vol. 56, No. 2, pp. 10:1-10:34, 2009
- Markus Püschel and José M. F. Moura
Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for DCTs and DSTs
IEEE Transactions on Signal Processing, Vol. 56, No. 4, pp. 1502-1521, 2008
- Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
Automatic Generation of Adaptive Libraries for Matrix-Multiplication
Parallel Matrix Algorithms and Applications (PMAA), 2008, Presentation (Abstract reviewed)
- Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
Automatic Generation of Multithreaded Vectorized Adaptive Libraries for Matrix Multiplication
Proc. International Workshop on Parallel Matrix Algorithms and Applications (PMAA), 2008
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
Automatic Linear Transform Program Generation for the Cell BE
Proc. Supercomputing (SC), 2008, Poster (Abstract reviewed)
- Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
Domain-Specific Library Generation for Parallel Software and Hardware Platforms
Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
FFT Program Generation for the Cell BE
Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Formal Datapath Representation and Manipulation for Implementing DSP Transforms
Proc. Design Automation Conference (DAC), pp. 385-390, 2008
- Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
Generating High-Performance General Size Linear Transform Libraries Using Spiral
Proc. High Performance Extreme Computing (HPEC), 2008
- Franz Franchetti and Markus Püschel
Generating SIMD Vectorized Permutations
Proc. International Conference on Compiler Construction (CC), Lecture Notes in Computer Science, Springer, Vol. 4959, pp. 116-131, 2008
- Srinivas Chellappa, Franz Franchetti and Markus Püschel
How To Write Fast Numerical Code: A Small Introduction
Proc. Summer School on Generative and Transformational Techniques in Software Engineering (GTTSE), Lecture Notes in Computer Science, Springer, Vol. 5235, pp. 196-259, 2008
- Yevgen Voronenko
Library Generation for Linear Transforms
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2008
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Linear Transforms: From Math to Efficient Hardware
Proc. Workshop on High-Level Synthesis colocated with DAC, 2008
- Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
Program Generation with Spiral: Beyond Transforms
Proc. High Performance Extreme Computing (HPEC), 2008
- Jeremy Johnson, Tim Chagnon, Petya Vachranukunkiet, Prawat Nagvajara and Chika Nwankpa
Sparse LU Decomposition using FPGA
Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
- Jeremy Johnson and Michael Andrews
Statistical Evaluation of a Self-Tuning Vectorized Library for the Walsh-Hadamard Transform
Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
- Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
System Demonstration of Spiral: Generator for High-Performance Linear Transform Libraries
Proc. Algebraic Methodology and Software Technology (AMAST), 2008
- Sung-Chul Han
A Flexible Decoder and Performance Evaluation of Array-Structured LDPC Codes
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2007
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Discrete Fourier Transform Compiler: From Mathematical Representation to Efficient Hardware
CSSI Technical Report #CSSI-07-01, Carnegie Mellon University, 2007
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
Fast Fourier Transform on FPGA: Design Choices and Evaluation
Proc. International Symposium on Field-Programmable Gate Arrays (FPGA), 2007
- Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
FFT Compiler: From Math to Efficient Hardware
Proc. IEEE International High Level Design Validation and Test Workshop (HLDVT), 2007
- Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
Generating FPGA Accelerated DFT Libraries
Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007
- Jeremy Johnson and Xu Xu
Generating Symmetric DFTs and Equivariant FFT Algorithms
Proc. ACM International Symposium on Symbolic and Algebraic Computation (ISSAC), ACM, pp. 195-202, 2007
- Yevgen Voronenko and Markus Püschel
Mechanical Derivation of Fused Multiply-Add Algorithms for Linear Transforms
IEEE Transactions on Signal Processing, Vol. 55, No. 9, pp. 4458-4473, 2007
- Yevgen Voronenko and Markus Püschel
Multiplierless Multiple Constant Multiplication
ACM Transactions on Algorithms, Vol. 3, No. 2, 2007
- Jeremy Johnson and Michael Andrews
Performance Analysis of a Family of WHT Algorithms
Proc. Workshop on Performance Optimization for High-Level Languages and Libraries (POHLL), 2007 IEEE International Parallel and Distributed Processing Symposium , IEEE, pp. 450, 2007
- Paolo D'Alberto, Markus Püschel and Franz Franchetti
Performance/Energy Optimization of DSP Transforms on the XScale Processor
Proc. International Conference on High Performance Embedded Architectures and Compilers (HiPEAC), Lecture Notes in Computer Science, Springer, Vol. 4367, pp. 201-214, 2007
- Franz Franchetti and Markus Püschel
SIMD Vectorization of Non-Two-Power Sized FFTs
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. II-17-II-20, 2007
- Peter Tummeltshammer, James C. Hoe and Markus Püschel
Time-Multiplexed Multiple Constant Multiplication
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 26, No. 9, pp. 1551-1563, 2007
- Pranab Shenoy
Universal FFT Core Generator
Master thesis, Computer Science, Drexel University, 2007
- Yevgen Voronenko and Markus Püschel
Algebraic Derivation of General Radix Cooley-Tukey Algorithms for the Real Discrete Fourier Transform
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 3, 2006
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
A Rewriting System for the Vectorization of Signal Transforms
Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 4395, pp. 363-377, 2006
- Andreas Bonelli, Franz Franchetti, Juergen Lorenz, Markus Püschel and Christoph W. Ueberhuber
Automatic Performance Optimization of the Discrete Fourier Transform on Distributed Memory Computers
Proc. International Symposium on Parallel and Distributed Processing and Application (ISPA), Lecture Notes In Computer Science, Springer, Vol. 4330, pp. 818-832, 2006
- Paolo D'Alberto, Peter A. Milder, Franz Franchetti, James C. Hoe, Markus Püschel and José M. F. Moura
Discrete Fourier Transform Compiler for FPGA and CPU/FPGA Partitioned Implementations
Proc. High Performance Extreme Computing (HPEC), 2006
- Pawel Hitczenko, Jeremy Johnson and Hung-Jen Huang
Distribution of a Class of Divide and Conquer Recurrences arising from the Computation of the Walsh-Hadamard Transform
Theoretical Computer Science, Vol. 352, pp. 8-30, 2006
- Peter A. Milder, Mohammad Ahmad, James C. Hoe and Markus Püschel
Fast and Accurate Resource Estimation of Automatically Generated Custom DFT IP Cores
Proc. FPGA, pp. 211-220, 2006
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
FFT Program Generation for Shared Memory: SMP and Multicore
Proc. Supercomputing (SC), 2006
- F. Gygi, E. W. Draeger, M. Schulz, B. R. de Supinski, J. A. Gunnels, V. Austel, J. C. Sexton, Franz Franchetti, Stefan Kral, Christoph W. Ueberhuber and Juergen Lorenz
Large-Scale Electronic Structure Calculations of High-Z Metals on the BlueGene/L Platform
Proc. Supercomputing (SC), 2006
- Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
Parallelism in Spiral
Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006
- Sung-Chul Han, Franz Franchetti and Markus Püschel
Program Generation for the All-Pairs Shortest Path Problem
Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 222-232, 2006
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
Spiral: Generating Signal Processing Kernels for New Commodity Architectures
Proc. EDGE Workshop, pp. D49-D50, 2006
- Marek Telgarsky, James C. Hoe and José M. F. Moura
Spiral: Joint Runtime and Energy Optimization of Linear Transforms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 3, 2006
- Roland Wunderlich, Markus Püschel and James C. Hoe
Accelerating Blocked Matrix-Matrix Multiplication using a Software-Managed Memory Hierarchy with DMA
Proc. High Performance Extreme Computing (HPEC), 2005
- Grace Nordin, Peter A. Milder, James C. Hoe and Markus Püschel
Automatic Generation of Customized Discrete Fourier Transform IPs
Proc. Design Automation Conference (DAC), pp. 471-474, 2005
- Mihai Furis, Pawel Hitczenko and Jeremy Johnson
Cache Miss Analysis of WHT Algorithms
Proc. International Conference on Analysis of Algorithms, Discrete Mathematics and Theoretical Computer Science, pp. 115-124, 2005
- Franz Franchetti, Stefan Kral, Juergen Lorenz and Christoph W. Ueberhuber
Efficient Utilization of SIMD Extensions
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 409-425, 2005
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
Formal Loop Merging for Signal Transforms
Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326 , 2005
- F. Gygi, E. W. Draeger, B. R. de Supinski, R. K. Yates, Franz Franchetti, Stefan Kral, Juergen Lorenz, Christoph W. Ueberhuber, J. A. Gunnels and J. C. Sexton
Large-Scale First-Principles Molecular Dynamics Simulations on the BlueGene/L Platform Using the Qbox Code
Proc. Supercomputing (SC), 2005
- Franz Franchetti, Yevgen Voronenko and Markus Püschel
Loop Merging for Signal Transforms
Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326, 2005
- Xiaoming Li, María J. Garzarán and David Padua
Optimizing Sorting with Genetic Algorithm
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 99-110, 2005
- Thammanit Pipatsrisawat, Aca Gacic, Franz Franchetti, Markus Püschel and José M. F. Moura
Performance Analysis of the Filtered Backprojection Image Reconstruction Algorithms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. 153-156, 2005
- José M. F. Moura, Markus Püschel, David Padua and Jack Dongarra
Scanning the Issue: Special Issue on Program Generation, Optimization, and Platform Adaptation
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 211-215, 2005
- Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
SPIRAL: Code Generation for DSP Transforms
Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005
- Juergen Lorenz, Stefan Kral, Franz Franchetti and Christoph W. Ueberhuber
Vectorization Techniques for the BlueGene/L Double FPU
IBM Journal of Research and Development, Vol. 49, No. 2/3, pp. 437-446, 2005
- Lawrence C. Chang, Inpyo Hong, Yevgen Voronenko and Markus Püschel
Adaptive Mapping of Linear DSP Algorithms to Fixed-Point Arithmetic
Proc. High Performance Extreme Computing (HPEC), 2004
- Xiaoming Li, María J. Garzarán and David Padua
A Dynamically Tuned Sorting Library
Proc. International Symposium on Code Generation and Optimization (CGO), pp. 111-124, 2004
- Jeremy Johnson and Kang Chen
A Self-Adapting Distributed Memory Package for Fast Signal Transforms
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 44-, 2004
- Aca Gacic, Markus Püschel and José M. F. Moura
Automatically Generated High-Performance Code for Discrete Wavelet Transforms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-69-V-72, 2004
- Franz Franchetti, Stefan Kral, Juergen Lorenz, Markus Püschel, Christoph W. Ueberhuber and Peter Wurzinger
Automatically Tuned FFTs for BlueGene/L’s Double FPU
Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 3402, pp. 23-36, 2004
- Adam C. Zelinski, Markus Püschel, Smarahara Misra and James C. Hoe
Automatic Cost Minimization for Multiplierless Implementations of Discrete Signal Transforms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-221-V-224, 2004
- Anthony F. Breitzman and Jeremy Johnson
Automatic Derivation and Implementation of Fast Convolution Algorithms
Journal of High Performance Computing and Applications, special issue on ``Computer Algebra and Signal Processing'', Vol. 37, No. 2, pp. 157-186, 2004
- Yevgen Voronenko and Markus Püschel
Automatic Generation of Implementations for DSP Transforms on Fused Multiply-Add Architectures
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-101-V-104, 2004
- Aca Gacic
Automatic Implementation and Platform Adaptation of Discrete Filtering and Wavelet Algorithms
PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2004
- Markus Püschel, Adam C. Zelinski and James C. Hoe
Custom-Optimized Multiplierless Implementations of DSP Algorithms
Proc. International Conference on Computer-Aided Design (ICCAD), pp. 175-182, 2004
- Grace Nordin, James C. Hoe and Markus Püschel
Discrete Fourier Transform IP Generator
Proc. High Performance Extreme Computing (HPEC), 2004
- Nicholas Rizzolo and David Padua
HiLO: High Level Optimization of FFTs
Proc. International Workshop on Languages and Compilers for Parallel Computing (LCPC), 2004
- Peter Tummeltshammer, James C. Hoe and Markus Püschel
Multiple Constant Multiplication By Time-Multiplexed Mapping of Addition Chains
Proc. Design Automation Conference (DAC), pp. 826-829, 2004
- Markus Püschel, Bryan Singer, Jianxin Xiong, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso and Robert W. Johnson
SPIRAL: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms
Journal of High Performance Computing and Applications, special issue on ``Automatic Performance Tuning'', Vol. 18, No. 1, pp. 21-45, 2004
- Franz Franchetti
A Portable Short Vector Version of FFTW
Proc. IMACS Symposium on Mathematical Modelling (MATHMOD), Vol. 2, pp. 1539-1548, 2003
- Xu Xu
A Recursive Implementation of the Dimensionless FFT
Master thesis, Computer Science, Drexel University, 2003
- Jeremy Johnson and Xu Xu
A Recursive Implementation of the Dimensionless FFT
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2003
- Anthony F. Breitzman
Automatic Derivation and Implementation of Fast Convolution Algorithms
PhD. thesis, Computer Science, Drexel University, 2003
- Mihai Furis
Cache Miss Analysis of Walsh-Hadamard Transform Algorithms
Master thesis, Computer Science, Drexel University, 2003
- Markus Püschel
Cooley-Tukey FFT like Algorithms for the DCT
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 501-504, 2003
- Smarahara Misra, Adam C. Zelinski, James C. Hoe and Markus Püschel
Custom Reduction of Arithmetic in Linear DSP Transforms
Proc. High Performance Extreme Computing (HPEC), 2003
- Smarahara Misra
Custom Reduction of Arithmetic in Multiplierless implementations of DSP Transforms
Master thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2003
- Aca Gacic, Markus Püschel and José M. F. Moura
Fast Automatic Implementations of FIR Filters
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 541-544, 2003
- T. Fahringer, Franz Franchetti, M. Geissler, G. Madsen, H. Moritsch and R. Prodan
On Using ZENTURIO for Performance and Parameter Studies on Clusters and Grids
Proc. Euromicro Conference on Parallel Distributed and Network-based Processing (Euro PDP), pp. 185-192, 2003
- Franz Franchetti
Performance Portable Short Vector Transforms
PhD. thesis, Vienna University of Technology, 2003
- Franz Franchetti and Markus Püschel
Short Vector Code Generation and Adaptation for DSP Algorithms
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 537-540, 2003
- Franz Franchetti and Markus Püschel
Short Vector Code Generation for the Discrete Fourier Transform
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2003
- Stefan Kral, Franz Franchetti, Juergen Lorenz and Christoph W. Ueberhuber
SIMD Vectorization of Straight Line Code
Proc. Euro-Par Conference on Parallel and Distributed Computing, LNCS 2985, pp. 251-260, 2003
- Markus Püschel and José M. F. Moura
SPIRAL: An Overview
Proc. Workshop on Optimizations for DSP and Embedded Systems (ODES), 2003
- Markus Püschel and José M. F. Moura
The Algebraic Approach to the Discrete Cosine and Sine Transforms and their Fast Algorithms
SIAM Journal of Computing, Vol. 32, No. 5, pp. 1280-1316, 2003
- Franz Franchetti
Top Performance in Signal Processing
Proc. International Workshop on Numerical and Symbolic Scientific Computing, 2003
- Fang Fang, Rob A. Rutenbar, Markus Püschel and Tsuhan Chen
Toward Efficient Static Analysis of Finite-Precision Effects in DSP Applications via Affine Arithmetic Modeling
Proc. Design Automation Conference (DAC), pp. 496-501, 2003
- Mike Balog
A Flexible Framework for Implementing FFT Processors
Master thesis, Electrical and Computer Engineering, Drexel University, 2002
- Kang Chen
A Prototypical Self-Optimizing Package for Parallel Implementation of Fast Signal Transforms
Master thesis, Computer Science, Drexel University, 2002
- Kang Chen and Jeremy Johnson
A Prototypical Self-Optimizing Package for Parallel Implementation of Fast Signal Transforms
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 58-63, 2002
- Markus Püschel, Sebastian Egner and Thomas Beth
AREP
in New Reference Book on Computer Algebra, Eds. J. Grabmeier, E. Kaltofen, J. Grabmeier, E. Kaltofen, Springer 2002
- Franz Franchetti and Markus Püschel
A SIMD Vectorizing Compiler for Digital Signal Processing Algorithms
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 20-26, 2002
- Bryan Singer and Manuela Veloso
Automating the Modeling and Optimization of the Performance of Signal Transforms
IEEE Transactions on Signal Processing, Vol. 50, No. 8, pp. 2003-2014, 2002
- Franz Franchetti, F. Kaltenberger and Christoph W. Ueberhuber
FFT Kernels with FMA Utilization
Proc. APLIMAT Conference, pp. 333-339, 2002
- Markus Püschel and José M. F. Moura
Generation and Manipulation of DSP Transform Algorithms
Proc. Digital Signal Processing Workshop, 2002
- Fang Fang, James C. Hoe, Markus Püschel and Smarahara Misra
Generation of Custom DSP Transform IP Cores: Case Study Walsh-Hadamard Transform
Proc. High Performance Extreme Computing (HPEC), 2002
- Bryan Singer and Manuela Veloso
Learning to Construct Fast Signal Processing Implementations
Journal of Machine Learning Research, special issue on ``the Eighteenth International Conference on Machine Learning (ICML 2001)'', Vol. 3, pp. 887-919, 2002
- Hung-Jen Huang
Performance Analysis of an Adaptive Algorithm for the Walsh-Hadamard Transform
Master thesis, Computer Science, Drexel University, 2002
- Franz Franchetti, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
Short Vector SIMD Code Generation for DSP Algorithms
Proc. High Performance Extreme Computing (HPEC), 2002
- Peter Becker
A High Speed VLSI Architecture for the Discrete Haar Wavelet Transform
Master thesis, Electrical and Computer Engineering, Drexel University, 2001
- Franz Franchetti, H. Karner, Stefan Kral and Christoph W. Ueberhuber
Architecture Independent Short Vector FFTs
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 1109-1112, 2001
- Sebastian Egner, Jeremy Johnson, David Padua, Jianxin Xiong and Markus Püschel
Automatic Derivation and Implementation of Signal Processing Algorithms
ACM SIGSAM Bulletin Communications in Computer Algebra, Vol. 35, No. 2, pp. 1-19, 2001
- Sebastian Egner and Markus Püschel
Automatic Generation of Fast Discrete Signal Transforms
IEEE Transactions on Signal Processing, Vol. 49, No. 9, pp. 1992-2002, 2001
- Jianxin Xiong
Automatic Optimization of DSP Algorithms
PhD. thesis, Computer Science, University of Illinois at Urbana-Champaign, 2001, Also as Technical Report UIUCDCS-R-2001-224, University of Illinois
- Bryan Singer
Automating the Modeling and Optimization of the Performance of Signal Processing Algorithms
PhD. thesis, Computer Science, Carnegie Mellon University, 2001
- Neungsoo Park and Viktor K. Prasanna
Cache Conscious Walsh-Hadamard Transform
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. II-1205-II-1208, 2001
- Pinit Kumhom
Design, Optimization, and Implementation of a Universal FFT Processor
PhD. thesis, Electrical and Computer Engineering, Drexel University, 2001, Also Tech. Report DU-MCS-01-01, Drexel University, 2001
- Markus Püschel, Bryan Singer, Manuela Veloso and José M. F. Moura
Fast Automatic Generation of DSP Algorithms
Proc. International Conference on Computational Science (ICCS), Lecture Notes In Computer Science, Springer, Vol. 2073, pp. 97-106, 2001
- José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel, Bryan Singer, Manuela Veloso and Jianxin Xiong
Generating Platform-Adapted DSP Libraries using SPIRAL
Proc. High Performance Extreme Computing (HPEC), 2001
- Bryan Singer and Manuela Veloso
Learning to Generate Fast Signal Processing Implementations
Proc. International Conference on Machine Learning (ICML), pp. 529-536, 2001
- Jianxin Xiong, Jeremy Johnson, Robert W. Johnson and David Padua
SPL: A Language and Compiler for DSP Algorithms
Proc. Programming Languages Design and Implementation (PLDI), pp. 298-308, 2001
- Bryan Singer and Manuela Veloso
Stochastic Search for Signal Processing Algorithm Optimization
Proc. Supercomputing (SC), pp. 22, 2001
- Gavin Haentjens
An Investigation of Recursive FFT Implementations
Master thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2000
- Pinit Kumhom, Jeremy Johnson and Prawat Nagvajara
Design, optimization, and implementation of a universal FFT processor
Proc. IEEE ASIC/SOC Conference, IEEE, pp. 182-186, 2000
- Neungsoo Park, Viktor K. Prasanna, Kiran Bondalapati and Dongsoo Kang
Dynamic Data Layouts for Cache-conscious Factorization of DFT
Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 693-701, 2000
- Jeremy Johnson and Markus Püschel
In Search of the Optimal Walsh-Hadamard Transform
Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 6, pp. 3347-3350, 2000
- Bryan Singer and Manuela Veloso
Learning to Predict Performance from Formula Modeling and Training Data
Proc. International Conference on Machine Learning (ICML), pp. 887-894, 2000
- David Sepiashvili
Performance Models and Search Methods for Optimal FFT Implementations
Master thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2000
- Franz Franchetti
Short Vector FFTs
Master thesis, Vienna University of Technology, 2000
- José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel and Manuela Veloso
SPIRAL: Automatic Implementation of Signal Processing Algorithms
Proc. High Performance Extreme Computing (HPEC), 2000