Publications

Filtered as: all types - all years - all authors - all keywords
Sorted by: keyword - year
Corresponding
bibtex list 

Algebraic signal processing theory 

2009 

  1. Yevgen Voronenko and Markus Püschel
    Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for Real DFTs
    IEEE Transactions on Signal Processing, Vol. 57, No. 1, pp. 205-222, 2009

2006 

  1. Yevgen Voronenko and Markus Püschel
    Algebraic Derivation of General Radix Cooley-Tukey Algorithms for the Real Discrete Fourier Transform
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 3, 2006

Algorithm theory and analysis 

2009 

  1. Yevgen Voronenko and Markus Püschel
    Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for Real DFTs
    IEEE Transactions on Signal Processing, Vol. 57, No. 1, pp. 205-222, 2009
  2. Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
    Discrete Fourier Transform on Multicore
    IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009

2007 

  1. Jeremy Johnson and Xu Xu
    Generating Symmetric DFTs and Equivariant FFT Algorithms
    Proc. ACM International Symposium on Symbolic and Algebraic Computation (ISSAC), ACM, pp. 195-202, 2007

2006 

  1. Yevgen Voronenko and Markus Püschel
    Algebraic Derivation of General Radix Cooley-Tukey Algorithms for the Real Discrete Fourier Transform
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 3, 2006
  2. Pawel Hitczenko, Jeremy Johnson and Hung-Jen Huang
    Distribution of a Class of Divide and Conquer Recurrences arising from the Computation of the Walsh-Hadamard Transform
    Theoretical Computer Science, Vol. 352, pp. 8-30, 2006

2005 

  1. Mihai Furis, Pawel Hitczenko and Jeremy Johnson
    Cache Miss Analysis of WHT Algorithms
    Proc. International Conference on Analysis of Algorithms, Discrete Mathematics and Theoretical Computer Science, pp. 115-124, 2005

2004 

  1. Lawrence C. Chang, Inpyo Hong, Yevgen Voronenko and Markus Püschel
    Adaptive Mapping of Linear DSP Algorithms to Fixed-Point Arithmetic
    Proc. High Performance Embedded Computing (HPEC), 2004

2003 

  1. Anthony F. Breitzman
    Automatic Derivation and Implementation of Fast Convolution Algorithms
    PhD. thesis, Computer Science, Drexel University, 2003
  2. Mihai Furis
    Cache Miss Analysis of Walsh-Hadamard Transform Algorithms
    Master thesis, Computer Science, Drexel University, 2003
  3. Fang Fang, Rob A. Rutenbar, Markus Püschel and Tsuhan Chen
    Toward Efficient Static Analysis of Finite-Precision Effects in DSP Applications via Affine Arithmetic Modeling
    Proc. Design Automation Conference (DAC), pp. 496-501, 2003

2002 

  1. Hung-Jen Huang
    Performance Analysis of an Adaptive Algorithm for the Walsh-Hadamard Transform
    Master thesis, Computer Science, Drexel University, 2002

2001 

  1. Sebastian Egner, Jeremy Johnson, David Padua, Jianxin Xiong and Markus Püschel
    Automatic Derivation and Implementation of Signal Processing Algorithms
    ACM SIGSAM Bulletin Communications in Computer Algebra, Vol. 35, No. 2, pp. 1-19, 2001
  2. Sebastian Egner and Markus Püschel
    Automatic Generation of Fast Discrete Signal Transforms
    IEEE Transactions on Signal Processing, Vol. 49, No. 9, pp. 1992-2002, 2001

Beyond transforms 

2010 

  1. Frédéric de Mesmay, Srinivas Chellappa, Franz Franchetti and Markus Püschel
    Computer Generation of Efficient Software Viterbi Decoders
    to appear in Proc. International Conference on High Performance Embedded Architectures & Compilers (HiPEAC), 2010

2009 

  1. Daniel McFarlin, Franz Franchetti, José M. F. Moura and Markus Püschel
    High Performance Synthetic Aperture Radar Image Formation On Commodity Architectures
    Proc. SPIE Conference on Defense, Security, and Sensing, Proceedings of SPIE, Vol. 7337, pp. 733708, 2009
  2. Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
    Operator Language: A Program Generation Framework for Fast Kernels
    Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009

2008 

  1. Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Automatic Generation of Adaptive Libraries for Matrix-Multiplication
    Parallel Matrix Algorithms and Applications (PMAA), 2008, Presentation (Abstract reviewed)
  2. Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
    Program Generation with Spiral: Beyond Transforms
    Proc. High Performance Embedded Computing (HPEC), 2008
  3. Jeremy Johnson, Tim Chagnon, Petya Vachranukunkiet, Prawat Nagvajara and Chika Nwankpa
    Sparse LU Decomposition using FPGA
    Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008

Cell BE Processor 

2009 

  1. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    Computer Generation of Fast Fourier Transforms for the Cell Broadband Engine
    Proc. International Conference on Supercomputing (ICS), pp. 26-35, 2009
  2. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    High Performance Linear Transform Program Generation for the Cell BE
    Proc. High Performance Embedded Computing (HPEC), 2009

2008 

  1. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    Automatic Linear Transform Program Generation for the Cell BE
    Supercomputing (SC), 2008, Poster (Abstract reviewed)
  2. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  3. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    FFT Program Generation for the Cell BE
    Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008

Dimensionaless FFT 

2007 

  1. Pranab Shenoy
    Universal FFT Core Generator
    Master thesis, Computer Science, Drexel University, 2007

2000 

  1. Pinit Kumhom, Jeremy Johnson and Prawat Nagvajara
    Design, optimization, and implementation of a universal FFT processor
    Proc. IEEE ASIC/SOC Conference, IEEE, pp. 182-186, 2000

Discrete/fast Fourier transform 

2010 

  1. Daniel McFarlin, Franz Franchetti and Markus Püschel
    Automatic SIMD Vectorization of Fast Fourier Transforms for the Larrabee and AVX Instruction Sets
    submitted for publication
  2. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Hardware Implementation of the Discrete Fourier Transform with Non-Power-of-Two Problem Size
    submitted for publication

2009 

  1. Yevgen Voronenko and Markus Püschel
    Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for Real DFTs
    IEEE Transactions on Signal Processing, Vol. 57, No. 1, pp. 205-222, 2009
  2. Basilio B. Fraguela, Yevgen Voronenko and Markus Püschel
    Automatic Tuning of Discrete Fourier Transforms Driven by Analytical Modeling
    Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 271-280, 2009
  3. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    Computer Generation of Fast Fourier Transforms for the Cell Broadband Engine
    Proc. International Conference on Supercomputing (ICS), pp. 26-35, 2009
  4. Yevgen Voronenko, Frédéric de Mesmay and Markus Püschel
    Computer Generation of General Size Linear Transform Libraries
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 102-113, 2009
  5. Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
    Discrete Fourier Transform on Multicore
    IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
  6. Franz Franchetti and Markus Püschel
    Generating High-Performance Pruned FFT Implementations
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 549-552, 2009
  7. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
    to appear in Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009
  8. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    High Performance Linear Transform Program Generation for the Cell BE
    Proc. High Performance Embedded Computing (HPEC), 2009

2008 

  1. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    Automatic Linear Transform Program Generation for the Cell BE
    Supercomputing (SC), 2008, Poster (Abstract reviewed)
  2. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  3. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    FFT Program Generation for the Cell BE
    Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
  4. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Formal Datapath Representation and Manipulation for Implementing DSP Transforms
    Proc. Design Automation Conference (DAC), pp. 385-390, 2008
  5. Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
    Generating High-Performance General Size Linear Transform Libraries Using Spiral
    Proc. High Performance Embedded Computing (HPEC), 2008
  6. Franz Franchetti and Markus Püschel
    Generating SIMD Vectorized Permutations
    Proc. International Conference on Compiler Construction (CC), Lecture Notes in Computer Science, Springer, Vol. 4959, pp. 116-131, 2008
  7. Yevgen Voronenko
    Library Generation for Linear Transforms
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2008
  8. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Linear Transforms: From Math to Efficient Hardware
    Proc. Workshop on High-Level Synthesis colocated with DAC, 2008
  9. Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
    System Demonstration of Spiral: Generator for High-Performance Linear Transform Libraries
    Proc. Algebraic Methodology and Software Technology (AMAST), 2008

2007 

  1. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Discrete Fourier Transform Compiler: From Mathematical Representation to Efficient Hardware
    CSSI Technical Report #CSSI-07-01, Carnegie Mellon University, 2007
  2. Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
    Generating FPGA Accelerated DFT Libraries
    Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007
  3. Jeremy Johnson and Xu Xu
    Generating Symmetric DFTs and Equivariant FFT Algorithms
    Proc. ACM International Symposium on Symbolic and Algebraic Computation (ISSAC), ACM, pp. 195-202, 2007
  4. Franz Franchetti and Markus Püschel
    SIMD Vectorization of Non-Two-Power Sized FFTs
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. II-17-II-20, 2007
  5. Pranab Shenoy
    Universal FFT Core Generator
    Master thesis, Computer Science, Drexel University, 2007

2006 

  1. Andreas Bonelli, Franz Franchetti, Juergen Lorenz, Markus Püschel and Christoph W. Ueberhuber
    Automatic Performance Optimization of the Discrete Fourier Transform on Distributed Memory Computers
    Proc. International Symposium on Parallel and Distributed Processing and Application (ISPA), Lecture Notes In Computer Science, Springer, Vol. 4330, pp. 818-832, 2006
  2. Paolo D'Alberto, Peter A. Milder, Franz Franchetti, James C. Hoe, Markus Püschel and José M. F. Moura
    Discrete Fourier Transform Compiler for FPGA and CPU/FPGA Partitioned Implementations
    Proc. High Performance Embedded Computing (HPEC), 2006
  3. Peter A. Milder, Mohammad Ahmad, James C. Hoe and Markus Püschel
    Fast and Accurate Resource Estimation of Automatically Generated Custom DFT IP Cores
    Proc. FPGA, pp. 211-220, 2006
  4. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    FFT Program Generation for Shared Memory: SMP and Multicore
    Proc. Supercomputing (SC), 2006

2005 

  1. Grace Nordin, Peter A. Milder, James C. Hoe and Markus Püschel
    Automatic Generation of Customized Discrete Fourier Transform IPs
    Proc. Design Automation Conference (DAC), pp. 471-474, 2005
  2. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Formal Loop Merging for Signal Transforms
    Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326 , 2005

2004 

  1. Franz Franchetti, Stefan Kral, Juergen Lorenz, Markus Püschel, Christoph W. Ueberhuber and Peter Wurzinger
    Automatically Tuned FFTs for BlueGene/L’s Double FPU
    Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 3402, pp. 23-36, 2004
  2. Grace Nordin, James C. Hoe and Markus Püschel
    Discrete Fourier Transform IP Generator
    Proc. High Performance Embedded Computing (HPEC), 2004
  3. Nicholas Rizzolo and David Padua
    HiLO: High Level Optimization of FFTs
    Proc. International Workshop on Languages and Compilers for Parallel Computing (LCPC), 2004

2003 

  1. Xu Xu
    A Recursive Implementation of the Dimensionless FFT
    Master thesis, Computer Science, Drexel University, 2003
  2. Jeremy Johnson and Xu Xu
    A Recursive Implementation of the Dimensionless FFT
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2003
  3. Franz Franchetti and Markus Püschel
    Short Vector Code Generation for the Discrete Fourier Transform
    Proc. International Parallel and Distributed Processing Symposium (IPDPS), 2003

2002 

  1. Mike Balog
    A Flexible Framework for Implementing FFT Processors
    Master thesis, Electrical and Computer Engineering, Drexel University, 2002

2001 

  1. Pinit Kumhom
    Design, Optimization, and Implementation of a Universal FFT Processor
    PhD. thesis, Electrical and Computer Engineering, Drexel University, 2001, Also Tech. Report DU-MCS-01-01, Drexel University, 2001

2000 

  1. Gavin Haentjens
    An Investigation of Recursive FFT Implementations
    Master thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2000
  2. Neungsoo Park, Viktor K. Prasanna, Kiran Bondalapati and Dongsoo Kang
    Dynamic Data Layouts for Cache-conscious Factorization of DFT
    Proc. International Parallel and Distributed Processing Symposium (IPDPS), pp. 693-701, 2000
  3. David Sepiashvili
    Performance Models and Search Methods for Optimal FFT Implementations
    Master thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2000

Distributed Memory 

2009 

  1. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    High Performance Linear Transform Program Generation for the Cell BE
    Proc. High Performance Embedded Computing (HPEC), 2009

Fast algorithms 

2010 

  1. Daniel McFarlin, Franz Franchetti and Markus Püschel
    Automatic SIMD Vectorization of Fast Fourier Transforms for the Larrabee and AVX Instruction Sets
    submitted for publication

2009 

  1. Yevgen Voronenko and Markus Püschel
    Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for Real DFTs
    IEEE Transactions on Signal Processing, Vol. 57, No. 1, pp. 205-222, 2009
  2. Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
    Discrete Fourier Transform on Multicore
    IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
  3. Franz Franchetti and Markus Püschel
    Generating High-Performance Pruned FFT Implementations
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 549-552, 2009

2006 

  1. Yevgen Voronenko and Markus Püschel
    Algebraic Derivation of General Radix Cooley-Tukey Algorithms for the Real Discrete Fourier Transform
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 3, 2006

Filtering/Convolution/Wavelet transforms 

2008 

  1. Yevgen Voronenko
    Library Generation for Linear Transforms
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2008

2004 

  1. Aca Gacic, Markus Püschel and José M. F. Moura
    Automatically Generated High-Performance Code for Discrete Wavelet Transforms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-69-V-72, 2004
  2. Anthony F. Breitzman and Jeremy Johnson
    Automatic Derivation and Implementation of Fast Convolution Algorithms
    Journal of High Performance Computing and Applications, special issue on ``Computer Algebra and Signal Processing'', Vol. 37, No. 2, pp. 157-186, 2004
  3. Aca Gacic
    Automatic Implementation and Platform Adaptation of Discrete Filtering and Wavelet Algorithms
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2004

2003 

  1. Aca Gacic, Markus Püschel and José M. F. Moura
    Fast Automatic Implementations of FIR Filters
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 541-544, 2003

General size libraries 

2010 

  1. Frédéric de Mesmay, Yevgen Voronenko and Markus Püschel
    Offline Library Adaptation Using Automatically Generated Heuristics
    submitted for publication

2009 

  1. Frédéric de Mesmay, Arpad Rimmel, Yevgen Voronenko and Markus Püschel
    Bandit-Based Optimization on Graphs with Application to Library Performance Tuning
    Proc. International Conference on Machine Learning (ICML), pp. 729-736, 2009
  2. Yevgen Voronenko, Frédéric de Mesmay and Markus Püschel
    Computer Generation of General Size Linear Transform Libraries
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 102-113, 2009

2008 

  1. Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
    Generating High-Performance General Size Linear Transform Libraries Using Spiral
    Proc. High Performance Embedded Computing (HPEC), 2008
  2. Yevgen Voronenko
    Library Generation for Linear Transforms
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2008
  3. Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
    System Demonstration of Spiral: Generator for High-Performance Linear Transform Libraries
    Proc. Algebraic Methodology and Software Technology (AMAST), 2008

IP cores for FPGA/ASIC 

2010 

  1. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Hardware Implementation of the Discrete Fourier Transform with Non-Power-of-Two Problem Size
    submitted for publication

2009 

  1. Peter A. Milder, James C. Hoe and Markus Püschel
    Automatic Generation of Streaming Datapaths for Arbitrary Fixed Permutations
    Proc. Design, Automation and Test in Europe (DATE), pp. 1118-1123, 2009
  2. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
    to appear in Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009
  3. Markus Püschel, Peter A. Milder and James C. Hoe
    Permuting Streaming Data Using RAMs
    Journal of the ACM, Vol. 56, No. 2, pp. 10:1-10:34, 2009

2008 

  1. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  2. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Formal Datapath Representation and Manipulation for Implementing DSP Transforms
    Proc. Design Automation Conference (DAC), pp. 385-390, 2008
  3. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Linear Transforms: From Math to Efficient Hardware
    Proc. Workshop on High-Level Synthesis colocated with DAC, 2008
  4. Jeremy Johnson, Tim Chagnon, Petya Vachranukunkiet, Prawat Nagvajara and Chika Nwankpa
    Sparse LU Decomposition using FPGA
    Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008

2007 

  1. Sung-Chul Han
    A Flexible Decoder and Performance Evaluation of Array-Structured LDPC Codes
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2007
  2. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Discrete Fourier Transform Compiler: From Mathematical Representation to Efficient Hardware
    CSSI Technical Report #CSSI-07-01, Carnegie Mellon University, 2007
  3. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    FFT Compiler: From Math to Efficient Hardware
    Proc. IEEE International High Level Design Validation and Test Workshop (HLDVT), 2007
  4. Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
    Generating FPGA Accelerated DFT Libraries
    Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007
  5. Yevgen Voronenko and Markus Püschel
    Multiplierless Multiple Constant Multiplication
    ACM Transactions on Algorithms, Vol. 3, No. 2, 2007
  6. Peter Tummeltshammer, James C. Hoe and Markus Püschel
    Time-Multiplexed Multiple Constant Multiplication
    IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 26, No. 9, pp. 1551-1563, 2007
  7. Pranab Shenoy
    Universal FFT Core Generator
    Master thesis, Computer Science, Drexel University, 2007

2006 

  1. Peter A. Milder, Mohammad Ahmad, James C. Hoe and Markus Püschel
    Fast and Accurate Resource Estimation of Automatically Generated Custom DFT IP Cores
    Proc. FPGA, pp. 211-220, 2006

2005 

  1. Grace Nordin, Peter A. Milder, James C. Hoe and Markus Püschel
    Automatic Generation of Customized Discrete Fourier Transform IPs
    Proc. Design Automation Conference (DAC), pp. 471-474, 2005

2004 

  1. Markus Püschel, Adam C. Zelinski and James C. Hoe
    Custom-Optimized Multiplierless Implementations of DSP Algorithms
    Proc. International Conference on Computer-Aided Design (ICCAD), pp. 175-182, 2004
  2. Grace Nordin, James C. Hoe and Markus Püschel
    Discrete Fourier Transform IP Generator
    Proc. High Performance Embedded Computing (HPEC), 2004
  3. Peter Tummeltshammer, James C. Hoe and Markus Püschel
    Multiple Constant Multiplication By Time-Multiplexed Mapping of Addition Chains
    Proc. Design Automation Conference (DAC), pp. 826-829, 2004

2002 

  1. Fang Fang, James C. Hoe, Markus Püschel and Smarahara Misra
    Generation of Custom DSP Transform IP Cores: Case Study Walsh-Hadamard Transform
    Proc. High Performance Embedded Computing (HPEC), 2002

2001 

  1. Peter Becker
    A High Speed VLSI Architecture for the Discrete Haar Wavelet Transform
    Master thesis, Electrical and Computer Engineering, Drexel University, 2001
  2. Pinit Kumhom
    Design, Optimization, and Implementation of a Universal FFT Processor
    PhD. thesis, Electrical and Computer Engineering, Drexel University, 2001, Also Tech. Report DU-MCS-01-01, Drexel University, 2001

2000 

  1. Pinit Kumhom, Jeremy Johnson and Prawat Nagvajara
    Design, optimization, and implementation of a universal FFT processor
    Proc. IEEE ASIC/SOC Conference, IEEE, pp. 182-186, 2000

LDPC codes 

2007 

  1. Sung-Chul Han
    A Flexible Decoder and Performance Evaluation of Array-Structured LDPC Codes
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2007

Learn the current Spiral system 

2009 

  1. Yevgen Voronenko, Frédéric de Mesmay and Markus Püschel
    Computer Generation of General Size Linear Transform Libraries
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 102-113, 2009
  2. Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
    Discrete Fourier Transform on Multicore
    IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
  3. Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
    Operator Language: A Program Generation Framework for Fast Kernels
    Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009

2008 

  1. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  2. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Formal Datapath Representation and Manipulation for Implementing DSP Transforms
    Proc. Design Automation Conference (DAC), pp. 385-390, 2008
  3. Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
    Generating High-Performance General Size Linear Transform Libraries Using Spiral
    Proc. High Performance Embedded Computing (HPEC), 2008
  4. Franz Franchetti and Markus Püschel
    Generating SIMD Vectorized Permutations
    Proc. International Conference on Compiler Construction (CC), Lecture Notes in Computer Science, Springer, Vol. 4959, pp. 116-131, 2008
  5. Yevgen Voronenko
    Library Generation for Linear Transforms
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2008
  6. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Linear Transforms: From Math to Efficient Hardware
    Proc. Workshop on High-Level Synthesis colocated with DAC, 2008
  7. Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
    Program Generation with Spiral: Beyond Transforms
    Proc. High Performance Embedded Computing (HPEC), 2008

2007 

  1. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    FFT Compiler: From Math to Efficient Hardware
    Proc. IEEE International High Level Design Validation and Test Workshop (HLDVT), 2007
  2. Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
    Generating FPGA Accelerated DFT Libraries
    Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007
  3. Yevgen Voronenko and Markus Püschel
    Mechanical Derivation of Fused Multiply-Add Algorithms for Linear Transforms
    IEEE Transactions on Signal Processing, Vol. 55, No. 9, pp. 4458-4473, 2007
  4. Paolo D'Alberto, Markus Püschel and Franz Franchetti
    Performance/Energy Optimization of DSP Transforms on the XScale Processor
    Proc. International Conference on High Performance Embedded Architectures & Compilers (HiPEAC), 2007

2006 

  1. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    A Rewriting System for the Vectorization of Signal Transforms
    Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 4395, pp. 363-377, 2006
  2. Andreas Bonelli, Franz Franchetti, Juergen Lorenz, Markus Püschel and Christoph W. Ueberhuber
    Automatic Performance Optimization of the Discrete Fourier Transform on Distributed Memory Computers
    Proc. International Symposium on Parallel and Distributed Processing and Application (ISPA), Lecture Notes In Computer Science, Springer, Vol. 4330, pp. 818-832, 2006
  3. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    FFT Program Generation for Shared Memory: SMP and Multicore
    Proc. Supercomputing (SC), 2006

2005 

  1. Grace Nordin, Peter A. Milder, James C. Hoe and Markus Püschel
    Automatic Generation of Customized Discrete Fourier Transform IPs
    Proc. Design Automation Conference (DAC), pp. 471-474, 2005
  2. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Formal Loop Merging for Signal Transforms
    Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326 , 2005
  3. Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
    SPIRAL: Code Generation for DSP Transforms
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005

2004 

  1. Aca Gacic
    Automatic Implementation and Platform Adaptation of Discrete Filtering and Wavelet Algorithms
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2004

Multibuffering 

2009 

  1. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    High Performance Linear Transform Program Generation for the Cell BE
    Proc. High Performance Embedded Computing (HPEC), 2009

Multidimensional DFT 

2007 

  1. Jeremy Johnson and Xu Xu
    Generating Symmetric DFTs and Equivariant FFT Algorithms
    Proc. ACM International Symposium on Symbolic and Algebraic Computation (ISSAC), ACM, pp. 195-202, 2007
  2. Pranab Shenoy
    Universal FFT Core Generator
    Master thesis, Computer Science, Drexel University, 2007

2000 

  1. Pinit Kumhom, Jeremy Johnson and Prawat Nagvajara
    Design, optimization, and implementation of a universal FFT processor
    Proc. IEEE ASIC/SOC Conference, IEEE, pp. 182-186, 2000

Multiplier block 

2007 

  1. Yevgen Voronenko and Markus Püschel
    Multiplierless Multiple Constant Multiplication
    ACM Transactions on Algorithms, Vol. 3, No. 2, 2007
  2. Peter Tummeltshammer, James C. Hoe and Markus Püschel
    Time-Multiplexed Multiple Constant Multiplication
    IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 26, No. 9, pp. 1551-1563, 2007

2004 

  1. Peter Tummeltshammer, James C. Hoe and Markus Püschel
    Multiple Constant Multiplication By Time-Multiplexed Mapping of Addition Chains
    Proc. Design Automation Conference (DAC), pp. 826-829, 2004

Multiplierless 

2007 

  1. Yevgen Voronenko and Markus Püschel
    Multiplierless Multiple Constant Multiplication
    ACM Transactions on Algorithms, Vol. 3, No. 2, 2007
  2. Peter Tummeltshammer, James C. Hoe and Markus Püschel
    Time-Multiplexed Multiple Constant Multiplication
    IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 26, No. 9, pp. 1551-1563, 2007

2004 

  1. Adam C. Zelinski, Markus Püschel, Smarahara Misra and James C. Hoe
    Automatic Cost Minimization for Multiplierless Implementations of Discrete Signal Transforms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-221-V-224, 2004
  2. Markus Püschel, Adam C. Zelinski and James C. Hoe
    Custom-Optimized Multiplierless Implementations of DSP Algorithms
    Proc. International Conference on Computer-Aided Design (ICCAD), pp. 175-182, 2004
  3. Peter Tummeltshammer, James C. Hoe and Markus Püschel
    Multiple Constant Multiplication By Time-Multiplexed Mapping of Addition Chains
    Proc. Design Automation Conference (DAC), pp. 826-829, 2004

2003 

  1. Smarahara Misra, Adam C. Zelinski, James C. Hoe and Markus Püschel
    Custom Reduction of Arithmetic in Linear DSP Transforms
    Proc. High Performance Embedded Computing (HPEC), 2003
  2. Smarahara Misra
    Custom Reduction of Arithmetic in Multiplierless implementations of DSP Transforms
    Master thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2003

Multithreading 

2009 

  1. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    Computer Generation of Fast Fourier Transforms for the Cell Broadband Engine
    Proc. International Conference on Supercomputing (ICS), pp. 26-35, 2009
  2. Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
    Discrete Fourier Transform on Multicore
    IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
  3. Franz Franchetti and Markus Püschel
    Generating High-Performance Pruned FFT Implementations
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 549-552, 2009
  4. Daniel McFarlin, Franz Franchetti, José M. F. Moura and Markus Püschel
    High Performance Synthetic Aperture Radar Image Formation On Commodity Architectures
    Proc. SPIE Conference on Defense, Security, and Sensing, Proceedings of SPIE, Vol. 7337, pp. 733708, 2009

2008 

  1. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  2. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    FFT Program Generation for the Cell BE
    Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
  3. Yevgen Voronenko
    Library Generation for Linear Transforms
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2008

2006 

  1. Andreas Bonelli, Franz Franchetti, Juergen Lorenz, Markus Püschel and Christoph W. Ueberhuber
    Automatic Performance Optimization of the Discrete Fourier Transform on Distributed Memory Computers
    Proc. International Symposium on Parallel and Distributed Processing and Application (ISPA), Lecture Notes In Computer Science, Springer, Vol. 4330, pp. 818-832, 2006
  2. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    FFT Program Generation for Shared Memory: SMP and Multicore
    Proc. Supercomputing (SC), 2006

2004 

  1. Jeremy Johnson and Kang Chen
    A Self-Adapting Distributed Memory Package for Fast Signal Transforms
    Proc. International Parallel and Distributed Processing Symposium (IPDPS), pp. 44-, 2004

2002 

  1. Kang Chen
    A Prototypical Self-Optimizing Package for Parallel Implementation of Fast Signal Transforms
    Master thesis, Computer Science, Drexel University, 2002
  2. Kang Chen and Jeremy Johnson
    A Prototypical Self-Optimizing Package for Parallel Implementation of Fast Signal Transforms
    Proc. International Parallel and Distributed Processing Symposium (IPDPS), pp. 58-63, 2002

Numerical kernels we consider 

2010 

  1. Daniel McFarlin, Franz Franchetti and Markus Püschel
    Automatic SIMD Vectorization of Fast Fourier Transforms for the Larrabee and AVX Instruction Sets
    submitted for publication

2009 

  1. Daniel McFarlin, Franz Franchetti and Markus Püschel
    Automatic Generation of Vectorized Fast Fourier Transform Libraries for the Larrabee and AVX Instruction Set Extension
    Proc. High Performance Embedded Computing (HPEC), 2009
  2. Daniel McFarlin, Franz Franchetti, José M. F. Moura and Markus Püschel
    High Performance Synthetic Aperture Radar Image Formation On Commodity Architectures
    Proc. SPIE Conference on Defense, Security, and Sensing, Proceedings of SPIE, Vol. 7337, pp. 733708, 2009
  3. Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
    Operator Language: A Program Generation Framework for Fast Kernels
    Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009

2007 

  1. Yevgen Voronenko and Markus Püschel
    Multiplierless Multiple Constant Multiplication
    ACM Transactions on Algorithms, Vol. 3, No. 2, 2007

2006 

  1. Sung-Chul Han, Franz Franchetti and Markus Püschel
    Program Generation for the All-Pairs Shortest Path Problem
    Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 222-232, 2006

2005 

  1. Roland Wunderlich, Markus Püschel and James C. Hoe
    Accelerating Blocked Matrix-Matrix Multiplication using a Software-Managed Memory Hierarchy with DMA
    Proc. High Performance Embedded Computing (HPEC), 2005
  2. Thammanit Pipatsrisawat, Aca Gacic, Franz Franchetti, Markus Püschel and José M. F. Moura
    Performance Analysis of the Filtered Backprojection Image Reconstruction Algorithms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. 153-156, 2005
  3. Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
    SPIRAL: Code Generation for DSP Transforms
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005

2004 

  1. Xiaoming Li, María J. Garzarán and David Padua
    A Dynamically Tuned Sorting Library
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 111-124, 2004
  2. Aca Gacic
    Automatic Implementation and Platform Adaptation of Discrete Filtering and Wavelet Algorithms
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2004

OFDM 

2009 

  1. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    21.4 GS/s Real-Time DSP-Based Optical OFDM Signal Generation and Transmission Over 1600 km of Uncompensated Fibre
    to appear in Proc. European Conference on Optical Communication (ECOC), 2009
  2. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
    to appear in Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009

Parallel Processing 

2009 

  1. Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
    Discrete Fourier Transform on Multicore
    IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
  2. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    High Performance Linear Transform Program Generation for the Cell BE
    Proc. High Performance Embedded Computing (HPEC), 2009
  3. Daniel McFarlin, Franz Franchetti, José M. F. Moura and Markus Püschel
    High Performance Synthetic Aperture Radar Image Formation On Commodity Architectures
    Proc. SPIE Conference on Defense, Security, and Sensing, Proceedings of SPIE, Vol. 7337, pp. 733708, 2009
  4. Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
    Operator Language: A Program Generation Framework for Fast Kernels
    Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009

2002 

  1. Kang Chen
    A Prototypical Self-Optimizing Package for Parallel Implementation of Fast Signal Transforms
    Master thesis, Computer Science, Drexel University, 2002

Performance Model 

2009 

  1. Daniel McFarlin, Franz Franchetti and Markus Püschel
    Automatic Generation of Vectorized Fast Fourier Transform Libraries for the Larrabee and AVX Instruction Set Extension
    Proc. High Performance Embedded Computing (HPEC), 2009
  2. Basilio B. Fraguela, Yevgen Voronenko and Markus Püschel
    Automatic Tuning of Discrete Fourier Transforms Driven by Analytical Modeling
    Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 271-280, 2009

2008 

  1. Jeremy Johnson and Michael Andrews
    Statistical Evaluation of a Self-Tuning Vectorized Library for the Walsh-Hadamard Transform
    Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008

2007 

  1. Jeremy Johnson and Michael Andrews
    Performance Analysis of a Family of WHT Algorithms
    Proc. Workshop on Performance Optimization for High-Level Languages and Libraries (POHLL), 2007 IEEE International Parallel and Distributed Processing Symposium , IEEE, pp. 450, 2007

Search/Learning for optimization 

2010 

  1. Daniel McFarlin, Franz Franchetti and Markus Püschel
    Automatic SIMD Vectorization of Fast Fourier Transforms for the Larrabee and AVX Instruction Sets
    submitted for publication
  2. Frédéric de Mesmay, Yevgen Voronenko and Markus Püschel
    Offline Library Adaptation Using Automatically Generated Heuristics
    submitted for publication

2009 

  1. Frédéric de Mesmay, Arpad Rimmel, Yevgen Voronenko and Markus Püschel
    Bandit-Based Optimization on Graphs with Application to Library Performance Tuning
    Proc. International Conference on Machine Learning (ICML), pp. 729-736, 2009

2002 

  1. Bryan Singer and Manuela Veloso
    Automating the Modeling and Optimization of the Performance of Signal Transforms
    IEEE Transactions on Signal Processing, Vol. 50, No. 8, pp. 2003-2014, 2002
  2. Bryan Singer and Manuela Veloso
    Learning to Construct Fast Signal Processing Implementations
    Journal of Machine Learning Research, special issue on ``the Eighteenth International Conference on Machine Learning (ICML 2001)'', Vol. 3, pp. 887-919, 2002

2001 

  1. Bryan Singer
    Automating the Modeling and Optimization of the Performance of Signal Processing Algorithms
    PhD. thesis, Computer Science, Carnegie Mellon University, 2001
  2. Bryan Singer and Manuela Veloso
    Learning to Generate Fast Signal Processing Implementations
    Proc. International Conference on Machine Learning (ICML), pp. 529-536, 2001
  3. Bryan Singer and Manuela Veloso
    Stochastic Search for Signal Processing Algorithm Optimization
    Proc. Supercomputing (SC), pp. 22, 2001

2000 

  1. Pinit Kumhom, Jeremy Johnson and Prawat Nagvajara
    Design, optimization, and implementation of a universal FFT processor
    Proc. IEEE ASIC/SOC Conference, IEEE, pp. 182-186, 2000
  2. Bryan Singer and Manuela Veloso
    Learning to Predict Performance from Formula Modeling and Training Data
    Proc. International Conference on Machine Learning (ICML), pp. 887-894, 2000

SIMD vectorization 

2010 

  1. Daniel McFarlin, Franz Franchetti and Markus Püschel
    Automatic SIMD Vectorization of Fast Fourier Transforms for the Larrabee and AVX Instruction Sets
    submitted for publication

2009 

  1. Daniel McFarlin, Franz Franchetti and Markus Püschel
    Automatic Generation of Vectorized Fast Fourier Transform Libraries for the Larrabee and AVX Instruction Set Extension
    Proc. High Performance Embedded Computing (HPEC), 2009
  2. Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
    Discrete Fourier Transform on Multicore
    IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
  3. Franz Franchetti and Markus Püschel
    Generating High-Performance Pruned FFT Implementations
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 549-552, 2009
  4. Daniel McFarlin, Franz Franchetti, José M. F. Moura and Markus Püschel
    High Performance Synthetic Aperture Radar Image Formation On Commodity Architectures
    Proc. SPIE Conference on Defense, Security, and Sensing, Proceedings of SPIE, Vol. 7337, pp. 733708, 2009
  5. Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
    Operator Language: A Program Generation Framework for Fast Kernels
    Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009

2008 

  1. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  2. Franz Franchetti and Markus Püschel
    Generating SIMD Vectorized Permutations
    Proc. International Conference on Compiler Construction (CC), Lecture Notes in Computer Science, Springer, Vol. 4959, pp. 116-131, 2008
  3. Yevgen Voronenko
    Library Generation for Linear Transforms
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2008
  4. Jeremy Johnson and Michael Andrews
    Statistical Evaluation of a Self-Tuning Vectorized Library for the Walsh-Hadamard Transform
    Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008

2007 

  1. Franz Franchetti and Markus Püschel
    SIMD Vectorization of Non-Two-Power Sized FFTs
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. II-17-II-20, 2007

2006 

  1. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    A Rewriting System for the Vectorization of Signal Transforms
    Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 4395, pp. 363-377, 2006
  2. Sung-Chul Han, Franz Franchetti and Markus Püschel
    Program Generation for the All-Pairs Shortest Path Problem
    Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 222-232, 2006

2003 

  1. Franz Franchetti and Markus Püschel
    Short Vector Code Generation and Adaptation for DSP Algorithms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 537-540, 2003
  2. Franz Franchetti and Markus Püschel
    Short Vector Code Generation for the Discrete Fourier Transform
    Proc. International Parallel and Distributed Processing Symposium (IPDPS), 2003

2002 

  1. Franz Franchetti and Markus Püschel
    A SIMD Vectorizing Compiler for Digital Signal Processing Algorithms
    Proc. International Parallel and Distributed Processing Symposium (IPDPS), pp. 20-26, 2002

Sorting 

2005 

  1. Xiaoming Li, María J. Garzarán and David Padua
    Optimizing Sorting with Genetic Algorithm
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 99-110, 2005

2004 

  1. Xiaoming Li, María J. Garzarán and David Padua
    A Dynamically Tuned Sorting Library
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 111-124, 2004

Sparse Linear Algebra 

2008 

  1. Jeremy Johnson, Tim Chagnon, Petya Vachranukunkiet, Prawat Nagvajara and Chika Nwankpa
    Sparse LU Decomposition using FPGA
    Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008

Spiral overview paper 

2009 

  1. Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
    Discrete Fourier Transform on Multicore
    IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
  2. Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
    Operator Language: A Program Generation Framework for Fast Kernels
    Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009

2008 

  1. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  2. Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
    Generating High-Performance General Size Linear Transform Libraries Using Spiral
    Proc. High Performance Embedded Computing (HPEC), 2008
  3. Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
    Program Generation with Spiral: Beyond Transforms
    Proc. High Performance Embedded Computing (HPEC), 2008

2005 

  1. Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
    SPIRAL: Code Generation for DSP Transforms
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005

2004 

  1. Markus Püschel, Bryan Singer, Jianxin Xiong, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso and Robert W. Johnson
    SPIRAL: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms
    Journal of High Performance Computing and Applications, special issue on ``Automatic Performance Tuning'', Vol. 18, No. 1, pp. 21-45, 2004

2001 

  1. Markus Püschel, Bryan Singer, Manuela Veloso and José M. F. Moura
    Fast Automatic Generation of DSP Algorithms
    Proc. International Conference on Computational Science (ICCS), Lecture Notes In Computer Science, Springer, Vol. 2073, pp. 97-106, 2001

SPIRAL program generation system for transforms 

2009 

  1. Frédéric de Mesmay, Arpad Rimmel, Yevgen Voronenko and Markus Püschel
    Bandit-Based Optimization on Graphs with Application to Library Performance Tuning
    Proc. International Conference on Machine Learning (ICML), pp. 729-736, 2009
  2. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    Computer Generation of Fast Fourier Transforms for the Cell Broadband Engine
    Proc. International Conference on Supercomputing (ICS), pp. 26-35, 2009
  3. Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
    Discrete Fourier Transform on Multicore
    IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
  4. Franz Franchetti and Markus Püschel
    Generating High-Performance Pruned FFT Implementations
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 549-552, 2009

2008 

  1. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  2. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    FFT Program Generation for the Cell BE
    Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
  3. Franz Franchetti and Markus Püschel
    Generating SIMD Vectorized Permutations
    Proc. International Conference on Compiler Construction (CC), Lecture Notes in Computer Science, Springer, Vol. 4959, pp. 116-131, 2008

2007 

  1. Paolo D'Alberto, Markus Püschel and Franz Franchetti
    Performance/Energy Optimization of DSP Transforms on the XScale Processor
    Proc. International Conference on High Performance Embedded Architectures & Compilers (HiPEAC), 2007

2006 

  1. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    A Rewriting System for the Vectorization of Signal Transforms
    Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 4395, pp. 363-377, 2006
  2. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    FFT Program Generation for Shared Memory: SMP and Multicore
    Proc. Supercomputing (SC), 2006
  3. Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
    Parallelism in Spiral
    Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006
  4. Marek Telgarsky, James C. Hoe and José M. F. Moura
    Spiral: Joint Runtime and Energy Optimization of Linear Transforms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 3, 2006

2005 

  1. José M. F. Moura, Markus Püschel, David Padua and Jack Dongarra
    Scanning the Issue: Special Issue on Program Generation, Optimization, and Platform Adaptation
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 211-215, 2005
  2. Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
    SPIRAL: Code Generation for DSP Transforms
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005

2004 

  1. Markus Püschel, Bryan Singer, Jianxin Xiong, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso and Robert W. Johnson
    SPIRAL: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms
    Journal of High Performance Computing and Applications, special issue on ``Automatic Performance Tuning'', Vol. 18, No. 1, pp. 21-45, 2004

2003 

  1. Markus Püschel and José M. F. Moura
    SPIRAL: An Overview
    Proc. Workshop on Optimizations for DSP and Embedded Systems (ODES), 2003

2002 

  1. Markus Püschel and José M. F. Moura
    Generation and Manipulation of DSP Transform Algorithms
    Proc. Digital Signal Processing Workshop, 2002
  2. Franz Franchetti, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
    Short Vector SIMD Code Generation for DSP Algorithms
    Proc. High Performance Embedded Computing (HPEC), 2002

2001 

  1. Markus Püschel, Bryan Singer, Manuela Veloso and José M. F. Moura
    Fast Automatic Generation of DSP Algorithms
    Proc. International Conference on Computational Science (ICCS), Lecture Notes In Computer Science, Springer, Vol. 2073, pp. 97-106, 2001
  2. José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel, Bryan Singer, Manuela Veloso and Jianxin Xiong
    Generating Platform-Adapted DSP Libraries using SPIRAL
    Proc. High Performance Embedded Computing (HPEC), 2001

2000 

  1. José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel and Manuela Veloso
    SPIRAL: Automatic Implementation of Signal Processing Algorithms
    Proc. High Performance Embedded Computing (HPEC), 2000

SPL compiler: Translating math into code 

2008 

  1. Franz Franchetti and Markus Püschel
    Generating SIMD Vectorized Permutations
    Proc. International Conference on Compiler Construction (CC), Lecture Notes in Computer Science, Springer, Vol. 4959, pp. 116-131, 2008

2007 

  1. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    FFT Compiler: From Math to Efficient Hardware
    Proc. IEEE International High Level Design Validation and Test Workshop (HLDVT), 2007
  2. Yevgen Voronenko and Markus Püschel
    Mechanical Derivation of Fused Multiply-Add Algorithms for Linear Transforms
    IEEE Transactions on Signal Processing, Vol. 55, No. 9, pp. 4458-4473, 2007

2005 

  1. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Formal Loop Merging for Signal Transforms
    Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326 , 2005

2004 

  1. Yevgen Voronenko and Markus Püschel
    Automatic Generation of Implementations for DSP Transforms on Fused Multiply-Add Architectures
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-101-V-104, 2004

2003 

  1. Franz Franchetti and Markus Püschel
    Short Vector Code Generation and Adaptation for DSP Algorithms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 537-540, 2003

2002 

  1. Franz Franchetti and Markus Püschel
    A SIMD Vectorizing Compiler for Digital Signal Processing Algorithms
    Proc. International Parallel and Distributed Processing Symposium (IPDPS), pp. 20-26, 2002

2001 

  1. Jianxin Xiong
    Automatic Optimization of DSP Algorithms
    PhD. thesis, Computer Science, University of Illinois at Urbana-Champaign, 2001, Also as Technical Report UIUCDCS-R-2001-224, University of Illinois
  2. Jianxin Xiong, Jeremy Johnson, Robert W. Johnson and David Padua
    SPL: A Language and Compiler for DSP Algorithms
    Proc. Programming Languages Design and Implementation (PLDI), pp. 298-308, 2001

Streaming permutations 

2009 

  1. Peter A. Milder, James C. Hoe and Markus Püschel
    Automatic Generation of Streaming Datapaths for Arbitrary Fixed Permutations
    Proc. Design, Automation and Test in Europe (DATE), pp. 1118-1123, 2009
  2. Markus Püschel, Peter A. Milder and James C. Hoe
    Permuting Streaming Data Using RAMs
    Journal of the ACM, Vol. 56, No. 2, pp. 10:1-10:34, 2009

Tutorial 

2008 

  1. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    How To Write Fast Numerical Code: A Small Introduction
    Proc. Summer School on Generative and Transformational Techniques in Software Engineering (GTTSE), Lecture Notes in Computer Science, Springer, Vol. 5235, pp. 196-259, 2008

Walsh-Hadamard transform 

2008 

  1. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  2. Jeremy Johnson and Michael Andrews
    Statistical Evaluation of a Self-Tuning Vectorized Library for the Walsh-Hadamard Transform
    Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008

2007 

  1. Jeremy Johnson and Michael Andrews
    Performance Analysis of a Family of WHT Algorithms
    Proc. Workshop on Performance Optimization for High-Level Languages and Libraries (POHLL), 2007 IEEE International Parallel and Distributed Processing Symposium , IEEE, pp. 450, 2007

2006 

  1. Pawel Hitczenko, Jeremy Johnson and Hung-Jen Huang
    Distribution of a Class of Divide and Conquer Recurrences arising from the Computation of the Walsh-Hadamard Transform
    Theoretical Computer Science, Vol. 352, pp. 8-30, 2006

2005 

  1. Mihai Furis, Pawel Hitczenko and Jeremy Johnson
    Cache Miss Analysis of WHT Algorithms
    Proc. International Conference on Analysis of Algorithms, Discrete Mathematics and Theoretical Computer Science, pp. 115-124, 2005

2004 

  1. Jeremy Johnson and Kang Chen
    A Self-Adapting Distributed Memory Package for Fast Signal Transforms
    Proc. International Parallel and Distributed Processing Symposium (IPDPS), pp. 44-, 2004

2002 

  1. Kang Chen
    A Prototypical Self-Optimizing Package for Parallel Implementation of Fast Signal Transforms
    Master thesis, Computer Science, Drexel University, 2002
  2. Kang Chen and Jeremy Johnson
    A Prototypical Self-Optimizing Package for Parallel Implementation of Fast Signal Transforms
    Proc. International Parallel and Distributed Processing Symposium (IPDPS), pp. 58-63, 2002
  3. Bryan Singer and Manuela Veloso
    Automating the Modeling and Optimization of the Performance of Signal Transforms
    IEEE Transactions on Signal Processing, Vol. 50, No. 8, pp. 2003-2014, 2002

2001 

  1. Neungsoo Park and Viktor K. Prasanna
    Cache Conscious Walsh-Hadamard Transform
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. II-1205-II-1208, 2001

2000 

  1. Jeremy Johnson and Markus Püschel
    In Search of the Optimal Walsh-Hadamard Transform
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 6, pp. 3347-3350, 2000

Wavelet Transform 

2001 

  1. Peter Becker
    A High Speed VLSI Architecture for the Discrete Haar Wavelet Transform
    Master thesis, Electrical and Computer Engineering, Drexel University, 2001

(No keyword) 

2002 

  1. Markus Püschel, Sebastian Egner and Thomas Beth
    AREP
    in New Reference Book on Computer Algebra, Eds. J. Grabmeier, E. Kaltofen, J. Grabmeier, E. Kaltofen, Springer 2002
Publication interface designed and implemented by Patra Pantupat, Aliaksei Sandryhaila, and Markus Püschel
Electrical and Computer Engineering, Carnegie Mellon University, 2007