Publications

Filtered as: all types - all years - author: Püschel, Markus - all keywords
Sorted by: keyword
Corresponding
bibtex list 

Affine arithmetic 

  1. Joao Rivera, Franz Franchetti and Markus Püschel
    A Compiler for Sound Floating-Point Computations using Affine Arithmetic
    Proc. International Symposium on Code Generation and Optimization (CGO), 2022

Algebraic 

  1. Daniele G. Spampinato and Markus Püschel
    A Basic Linear Algebra Compiler for Structured Matrices
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 117-127, 2016

Algebraic signal processing theory 

  1. Yevgen Voronenko and Markus Püschel
    Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for Real DFTs
    IEEE Transactions on Signal Processing, Vol. 57, No. 1, pp. 205-222, 2009
  2. Markus Püschel and José M. F. Moura
    Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for DCTs and DSTs
    IEEE Transactions on Signal Processing, Vol. 56, No. 4, pp. 1502-1521, 2008
  3. Yevgen Voronenko and Markus Püschel
    Algebraic Derivation of General Radix Cooley-Tukey Algorithms for the Real Discrete Fourier Transform
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 3, 2006
  4. Markus Püschel
    Cooley-Tukey FFT like Algorithms for the DCT
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 501-504, 2003
  5. Markus Püschel and José M. F. Moura
    The Algebraic Approach to the Discrete Cosine and Sine Transforms and their Fast Algorithms
    SIAM Journal of Computing, Vol. 32, No. 5, pp. 1280-1316, 2003

Algorithm theory and analysis 

  1. Marcela Zuluaga, Peter A. Milder and Markus Püschel
    Streaming Sorting Networks
    ACM Transactions on Design Automation of Electronic Systems, Vol. 21, No. 4, pp. 55, 2016
  2. Yevgen Voronenko and Markus Püschel
    Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for Real DFTs
    IEEE Transactions on Signal Processing, Vol. 57, No. 1, pp. 205-222, 2009
  3. Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
    Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
    IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
  4. Markus Püschel and José M. F. Moura
    Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for DCTs and DSTs
    IEEE Transactions on Signal Processing, Vol. 56, No. 4, pp. 1502-1521, 2008
  5. Yevgen Voronenko and Markus Püschel
    Algebraic Derivation of General Radix Cooley-Tukey Algorithms for the Real Discrete Fourier Transform
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 3, 2006
  6. Lawrence C. Chang, Inpyo Hong, Yevgen Voronenko and Markus Püschel
    Adaptive Mapping of Linear DSP Algorithms to Fixed-Point Arithmetic
    Proc. High Performance Extreme Computing (HPEC), 2004
  7. Markus Püschel
    Cooley-Tukey FFT like Algorithms for the DCT
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 501-504, 2003
  8. Markus Püschel and José M. F. Moura
    The Algebraic Approach to the Discrete Cosine and Sine Transforms and their Fast Algorithms
    SIAM Journal of Computing, Vol. 32, No. 5, pp. 1280-1316, 2003
  9. Fang Fang, Rob A. Rutenbar, Markus Püschel and Tsuhan Chen
    Toward Efficient Static Analysis of Finite-Precision Effects in DSP Applications via Affine Arithmetic Modeling
    Proc. Design Automation Conference (DAC), pp. 496-501, 2003
  10. Sebastian Egner, Jeremy Johnson, David Padua, Jianxin Xiong and Markus Püschel
    Automatic Derivation and Implementation of Signal Processing Algorithms
    ACM SIGSAM Bulletin Communications in Computer Algebra, Vol. 35, No. 2, pp. 1-19, 2001
  11. Sebastian Egner and Markus Püschel
    Automatic Generation of Fast Discrete Signal Transforms
    IEEE Transactions on Signal Processing, Vol. 49, No. 9, pp. 1992-2002, 2001

Automatic generation 

  1. C. Angelopoulos, Franz Franchetti and Markus Püschel
    Automatic Generation of FFT Libraries for GPUs
    NVIDIA Research Summit at the GPU Technology Conference, 2012

Beyond transforms 

  1. Daniele G. Spampinato, Diego Fabregat-Traver, Paolo Bientinesi and Markus Püschel
    Program Generation for Small-Scale Linear Algebra Applications
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 327-339, 2018
  2. Daniele G. Spampinato and Markus Püschel
    A Basic Linear Algebra Compiler
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 23-32, 2014
  3. Christian Berger, Volodymyr Arbatov, Yevgen Voronenko, Franz Franchetti and Markus Püschel
    Real-Time Software Implementation of an IEEE 802.11a Baseband Receiver on Intel Multicore
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1693-1696 , 2011
  4. Frédéric de Mesmay, Srinivas Chellappa, Franz Franchetti and Markus Püschel
    Computer Generation of Efficient Software Viterbi Decoders
    Proc. International Conference on High Performance Embedded Architectures and Compilers (HiPEAC), Lecture Notes in Computer Science, Springer, Vol. 5952, pp. 353-368, 2010
  5. Yevgen Voronenko, Volodymyr Arbatov, Christian Berger, Ronghui Peng, Markus Püschel and Franz Franchetti
    Computer Generation of Platform-Adapted Physical Layer Software
    Proc. Software Defined Radio (SDR), 2010
  6. Daniel McFarlin, Franz Franchetti, José M. F. Moura and Markus Püschel
    High Performance Synthetic Aperture Radar Image Formation On Commodity Architectures
    Proc. SPIE Conference on Defense, Security, and Sensing, Proceedings of SPIE, Vol. 7337, pp. 733708, 2009
  7. Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
    Operator Language: A Program Generation Framework for Fast Kernels
    Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009
  8. Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Automatic Generation of Adaptive Libraries for Matrix-Multiplication
    Parallel Matrix Algorithms and Applications (PMAA), 2008, Presentation (Abstract reviewed)
  9. Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
    Program Generation with Spiral: Beyond Transforms
    Proc. High Performance Extreme Computing (HPEC), 2008

Bottleneck analysis 

  1. Victoria Caparrós Cabezas and Markus Püschel
    Extending the Roofline Model: Bottleneck Analysis with Microarchitectural Constraints
    Proc. IEEE International Symposium on Workload Characterization (IISWC), pp. 222-231, 2014

Cell BE Processor 

  1. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    Computer Generation of Fast Fourier Transforms for the Cell Broadband Engine
    Proc. International Conference on Supercomputing (ICS), pp. 26-35, 2009
  2. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    High Performance Linear Transform Program Generation for the Cell BE
    Proc. High Performance Extreme Computing (HPEC), 2009
  3. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    Automatic Linear Transform Program Generation for the Cell BE
    Proc. Supercomputing (SC), 2008, Poster (Abstract reviewed)
  4. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  5. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    FFT Program Generation for the Cell BE
    Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008

Coding/Decoding 

  1. Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
    SPIRAL: Extreme Performance Portability
    Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018

Compiler 

  1. Joao Rivera, Franz Franchetti and Markus Püschel
    A Compiler for Sound Floating-Point Computations using Affine Arithmetic
    Proc. International Symposium on Code Generation and Optimization (CGO), 2022
  2. Joao Rivera, Franz Franchetti and Markus Püschel
    An Interval Compiler for Sound Floating Point Computations
    Proc. International Symposium on Code Generation and Optimization (CGO), 2021
  3. Daniele G. Spampinato and Markus Püschel
    A Basic Linear Algebra Compiler for Structured Matrices
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 117-127, 2016

Construction of Generators 

  1. Georg Ofenbeck, Tiark Rompf and Markus Püschel
    Staging for Generic Programming in Space and Time
    Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 15-28, 2017
  2. Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
    Language Support for the Construction of High Performance Code Generators
    Proc. International Workshop on Adaptive Self-tuning Computing Systems (ADAPT), 2014
  3. Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
    Spiral in Scala: Towards the Systematic Construction of Generators for Performance Libraries
    Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 125-134, 2013

Discrete/fast cosine transforms 

  1. Markus Püschel and José M. F. Moura
    Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for DCTs and DSTs
    IEEE Transactions on Signal Processing, Vol. 56, No. 4, pp. 1502-1521, 2008
  2. Markus Püschel
    Cooley-Tukey FFT like Algorithms for the DCT
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 501-504, 2003
  3. Markus Püschel and José M. F. Moura
    The Algebraic Approach to the Discrete Cosine and Sine Transforms and their Fast Algorithms
    SIAM Journal of Computing, Vol. 32, No. 5, pp. 1280-1316, 2003

Discrete/fast Fourier transform 

  1. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Computer Generation of Hardware for Linear Digital Signal Processing Transforms
    ACM Transactions on Design Automation of Electronic Systems, Vol. 17, No. 2, 2012
  2. Daniel McFarlin, Volodymyr Arbatov, Franz Franchetti and Markus Püschel
    Automatic SIMD Vectorization of Fast Fourier Transforms for the Larrabee and AVX Instruction Sets
    Proc. International Conference on Supercomputing (ICS), 2011
  3. Franz Franchetti and Markus Püschel
    Fast Fourier Transform
    in Encyclopedia of Parallel Computing, Eds. David Padua, Springer 2011
  4. Markus Püschel, Franz Franchetti and Yevgen Voronenko
    Spiral
    in Encyclopedia of Parallel Computing, Eds. David Padua, pp. 1920-1933, Springer 2011
  5. C. Angelopoulos, Franz Franchetti and Markus Püschel
    DFT Transform on the Fermi (GTX480): Automatic Program Generation
    NVIDIA Research Summit at the GPU Technology Conference, 2010
  6. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Hardware Implementation of the Discrete Fourier Transform with Non-Power-of-Two Problem Size
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2010
  7. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Real-Time Digital Signal Processing for the Generation of Optical Orthogonal Frequency Division Multiplexed Signals
    IEEE Journal of Selected Topics in Quantum Electronics, Vol. 16, No. 5, pp. 1235-1244 , 2010
  8. Yevgen Voronenko and Markus Püschel
    Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for Real DFTs
    IEEE Transactions on Signal Processing, Vol. 57, No. 1, pp. 205-222, 2009
  9. Basilio B. Fraguela, Yevgen Voronenko and Markus Püschel
    Automatic Tuning of Discrete Fourier Transforms Driven by Analytical Modeling
    Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 271-280, 2009
  10. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    Computer Generation of Fast Fourier Transforms for the Cell Broadband Engine
    Proc. International Conference on Supercomputing (ICS), pp. 26-35, 2009
  11. Yevgen Voronenko, Frédéric de Mesmay and Markus Püschel
    Computer Generation of General Size Linear Transform Libraries
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 102-113, 2009
  12. Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
    Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
    IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
  13. Franz Franchetti and Markus Püschel
    Generating High-Performance Pruned FFT Implementations
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 549-552, 2009
  14. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
    Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009
  15. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    High Performance Linear Transform Program Generation for the Cell BE
    Proc. High Performance Extreme Computing (HPEC), 2009
  16. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    Automatic Linear Transform Program Generation for the Cell BE
    Proc. Supercomputing (SC), 2008, Poster (Abstract reviewed)
  17. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  18. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    FFT Program Generation for the Cell BE
    Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
  19. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Formal Datapath Representation and Manipulation for Implementing DSP Transforms
    Proc. Design Automation Conference (DAC), pp. 385-390, 2008
  20. Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
    Generating High-Performance General Size Linear Transform Libraries Using Spiral
    Proc. High Performance Extreme Computing (HPEC), 2008
  21. Franz Franchetti and Markus Püschel
    Generating SIMD Vectorized Permutations
    Proc. International Conference on Compiler Construction (CC), Lecture Notes in Computer Science, Springer, Vol. 4959, pp. 116-131, 2008
  22. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Linear Transforms: From Math to Efficient Hardware
    Proc. Workshop on High-Level Synthesis colocated with DAC, 2008
  23. Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
    System Demonstration of Spiral: Generator for High-Performance Linear Transform Libraries
    Proc. Algebraic Methodology and Software Technology (AMAST), 2008
  24. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Discrete Fourier Transform Compiler: From Mathematical Representation to Efficient Hardware
    CSSI Technical Report #CSSI-07-01, Carnegie Mellon University, 2007
  25. Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
    Generating FPGA Accelerated DFT Libraries
    Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007
  26. Franz Franchetti and Markus Püschel
    SIMD Vectorization of Non-Two-Power Sized FFTs
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. II-17-II-20, 2007
  27. Andreas Bonelli, Franz Franchetti, Juergen Lorenz, Markus Püschel and Christoph W. Ueberhuber
    Automatic Performance Optimization of the Discrete Fourier Transform on Distributed Memory Computers
    Proc. International Symposium on Parallel and Distributed Processing and Application (ISPA), Lecture Notes In Computer Science, Springer, Vol. 4330, pp. 818-832, 2006
  28. Paolo D'Alberto, Peter A. Milder, Franz Franchetti, James C. Hoe, Markus Püschel and José M. F. Moura
    Discrete Fourier Transform Compiler for FPGA and CPU/FPGA Partitioned Implementations
    Proc. High Performance Extreme Computing (HPEC), 2006
  29. Peter A. Milder, Mohammad Ahmad, James C. Hoe and Markus Püschel
    Fast and Accurate Resource Estimation of Automatically Generated Custom DFT IP Cores
    Proc. FPGA, pp. 211-220, 2006
  30. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    FFT Program Generation for Shared Memory: SMP and Multicore
    Proc. Supercomputing (SC), 2006
  31. Grace Nordin, Peter A. Milder, James C. Hoe and Markus Püschel
    Automatic Generation of Customized Discrete Fourier Transform IPs
    Proc. Design Automation Conference (DAC), pp. 471-474, 2005
  32. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Formal Loop Merging for Signal Transforms
    Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326 , 2005
  33. Franz Franchetti, Stefan Kral, Juergen Lorenz, Markus Püschel, Christoph W. Ueberhuber and Peter Wurzinger
    Automatically Tuned FFTs for BlueGene/L’s Double FPU
    Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 3402, pp. 23-36, 2004
  34. Grace Nordin, James C. Hoe and Markus Püschel
    Discrete Fourier Transform IP Generator
    Proc. High Performance Extreme Computing (HPEC), 2004
  35. Franz Franchetti and Markus Püschel
    Short Vector Code Generation for the Discrete Fourier Transform
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2003

Distributed memory 

  1. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    High Performance Linear Transform Program Generation for the Cell BE
    Proc. High Performance Extreme Computing (HPEC), 2009

Fast algorithms 

  1. Yevgen Voronenko and Markus Püschel
    Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for Real DFTs
    IEEE Transactions on Signal Processing, Vol. 57, No. 1, pp. 205-222, 2009
  2. Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
    Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
    IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
  3. Franz Franchetti and Markus Püschel
    Generating High-Performance Pruned FFT Implementations
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 549-552, 2009
  4. Markus Püschel and José M. F. Moura
    Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for DCTs and DSTs
    IEEE Transactions on Signal Processing, Vol. 56, No. 4, pp. 1502-1521, 2008
  5. Yevgen Voronenko and Markus Püschel
    Algebraic Derivation of General Radix Cooley-Tukey Algorithms for the Real Discrete Fourier Transform
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 3, 2006
  6. Markus Püschel
    Cooley-Tukey FFT like Algorithms for the DCT
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 501-504, 2003
  7. Markus Püschel and José M. F. Moura
    The Algebraic Approach to the Discrete Cosine and Sine Transforms and their Fast Algorithms
    SIAM Journal of Computing, Vol. 32, No. 5, pp. 1280-1316, 2003

Fast Fourier Transform 

  1. Francois Serre and Markus Püschel
    Optimal Circuits for Streamed Linear Permutations using RAM
    Proc. FPGA, pp. 215-223, 2016
  2. Jörn Schumacher and Markus Püschel
    High-performance sparse fast Fourier transforms
    Proc. IEEE Workshop on Signal Processing Systems (SIPS), pp. 1-6, 2014
  3. C. Angelopoulos, Franz Franchetti and Markus Püschel
    Automatic Generation of FFT Libraries for GPUs
    NVIDIA Research Summit at the GPU Technology Conference, 2012

Filtering/Convolution/Wavelet transforms 

  1. Aca Gacic, Markus Püschel and José M. F. Moura
    Automatically Generated High-Performance Code for Discrete Wavelet Transforms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-69-V-72, 2004
  2. Aca Gacic, Markus Püschel and José M. F. Moura
    Fast Automatic Implementations of FIR Filters
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 541-544, 2003

Floating-point arithmetic 

  1. Joao Rivera, Franz Franchetti and Markus Püschel
    A Compiler for Sound Floating-Point Computations using Affine Arithmetic
    Proc. International Symposium on Code Generation and Optimization (CGO), 2022
  2. Joao Rivera, Franz Franchetti and Markus Püschel
    An Interval Compiler for Sound Floating Point Computations
    Proc. International Symposium on Code Generation and Optimization (CGO), 2021

General size libraries 

  1. C. Angelopoulos, Franz Franchetti and Markus Püschel
    Automatic Generation of FFT Libraries for GPUs
    NVIDIA Research Summit at the GPU Technology Conference, 2012
  2. Markus Püschel, Franz Franchetti and Yevgen Voronenko
    Spiral
    in Encyclopedia of Parallel Computing, Eds. David Padua, pp. 1920-1933, Springer 2011
  3. Frédéric de Mesmay, Yevgen Voronenko and Markus Püschel
    Offline Library Adaptation Using Automatically Generated Heuristics
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 1-10, 2010
  4. Frédéric de Mesmay, Arpad Rimmel, Yevgen Voronenko and Markus Püschel
    Bandit-Based Optimization on Graphs with Application to Library Performance Tuning
    Proc. International Conference on Machine Learning (ICML), pp. 729-736, 2009
  5. Yevgen Voronenko, Frédéric de Mesmay and Markus Püschel
    Computer Generation of General Size Linear Transform Libraries
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 102-113, 2009
  6. Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
    Generating High-Performance General Size Linear Transform Libraries Using Spiral
    Proc. High Performance Extreme Computing (HPEC), 2008
  7. Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
    System Demonstration of Spiral: Generator for High-Performance Linear Transform Libraries
    Proc. Algebraic Methodology and Software Technology (AMAST), 2008

GPUs 

  1. C. Angelopoulos, Franz Franchetti and Markus Püschel
    Automatic Generation of FFT Libraries for GPUs
    NVIDIA Research Summit at the GPU Technology Conference, 2012
  2. C. Angelopoulos, Franz Franchetti and Markus Püschel
    DFT Transform on the Fermi (GTX480): Automatic Program Generation
    NVIDIA Research Summit at the GPU Technology Conference, 2010

Hardware 

  1. Francois Serre and Markus Püschel
    Optimal Circuits for Streamed Linear Permutations using RAM
    Proc. FPGA, pp. 215-223, 2016

High performance 

  1. Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
    SPIRAL: Extreme Performance Portability
    Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018

IP cores for FPGA/ASIC 

  1. Francois Serre and Markus Püschel
    Optimal Circuits for Streamed Linear Permutations using RAM
    Proc. FPGA, pp. 215-223, 2016
  2. Marcela Zuluaga, Peter A. Milder and Markus Püschel
    Streaming Sorting Networks
    ACM Transactions on Design Automation of Electronic Systems, Vol. 21, No. 4, pp. 55, 2016
  3. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Computer Generation of Hardware for Linear Digital Signal Processing Transforms
    ACM Transactions on Design Automation of Electronic Systems, Vol. 17, No. 2, 2012
  4. Marcela Zuluaga, Peter A. Milder and Markus Püschel
    Computer Generation of Streaming Sorting Networks
    Proc. Design Automation Conference (DAC), pp. 1245-1253, 2012
  5. Robert Koutsoyannis, Peter A. Milder, Christian Berger, Madeleine Glick, James C. Hoe and Markus Püschel
    Improving Fixed-Point Accuracy of FFT Cores in O-OFDM Systems
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2012
  6. Marcela Zuluaga, Andreas Krause, Peter A. Milder and Markus Püschel
    "Smart" Design Space Sampling to Predict Pareto-Optimal Solutions
    Proc. Languages, Compilers, Tools and Theory for Embedded Systems (LCTES), pp. 119-128 , 2012
  7. Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Design studies for ASIC implementations of 28 GS/s optical QPSK- and 16-QAM-OFDM transceivers
    Optics Express, Vol. 19, No. 21, pp. 20857-20864, 2011
  8. Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, Christian Berger, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Design Studies for an ASIC Implementation of an Optical OFDM Transceiver
    Proc. European Conference on Optical Communication (ECOC), pp. 1-3, 2010
  9. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Hardware Implementation of the Discrete Fourier Transform with Non-Power-of-Two Problem Size
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2010
  10. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Real-Time Digital Signal Processing for the Generation of Optical Orthogonal Frequency Division Multiplexed Signals
    IEEE Journal of Selected Topics in Quantum Electronics, Vol. 16, No. 5, pp. 1235-1244 , 2010
  11. Peter A. Milder, James C. Hoe and Markus Püschel
    Automatic Generation of Streaming Datapaths for Arbitrary Fixed Permutations
    Proc. Design, Automation and Test in Europe (DATE), pp. 1118-1123, 2009
  12. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
    Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009
  13. Markus Püschel, Peter A. Milder and James C. Hoe
    Permuting Streaming Data Using RAMs
    Journal of the ACM, Vol. 56, No. 2, pp. 10:1-10:34, 2009
  14. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  15. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Formal Datapath Representation and Manipulation for Implementing DSP Transforms
    Proc. Design Automation Conference (DAC), pp. 385-390, 2008
  16. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Linear Transforms: From Math to Efficient Hardware
    Proc. Workshop on High-Level Synthesis colocated with DAC, 2008
  17. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Discrete Fourier Transform Compiler: From Mathematical Representation to Efficient Hardware
    CSSI Technical Report #CSSI-07-01, Carnegie Mellon University, 2007
  18. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    FFT Compiler: From Math to Efficient Hardware
    Proc. IEEE International High Level Design Validation and Test Workshop (HLDVT), 2007
  19. Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
    Generating FPGA Accelerated DFT Libraries
    Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007
  20. Yevgen Voronenko and Markus Püschel
    Multiplierless Multiple Constant Multiplication
    ACM Transactions on Algorithms, Vol. 3, No. 2, 2007
  21. Peter Tummeltshammer, James C. Hoe and Markus Püschel
    Time-Multiplexed Multiple Constant Multiplication
    IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 26, No. 9, pp. 1551-1563, 2007
  22. Peter A. Milder, Mohammad Ahmad, James C. Hoe and Markus Püschel
    Fast and Accurate Resource Estimation of Automatically Generated Custom DFT IP Cores
    Proc. FPGA, pp. 211-220, 2006
  23. Grace Nordin, Peter A. Milder, James C. Hoe and Markus Püschel
    Automatic Generation of Customized Discrete Fourier Transform IPs
    Proc. Design Automation Conference (DAC), pp. 471-474, 2005
  24. Markus Püschel, Adam C. Zelinski and James C. Hoe
    Custom-Optimized Multiplierless Implementations of DSP Algorithms
    Proc. International Conference on Computer-Aided Design (ICCAD), pp. 175-182, 2004
  25. Grace Nordin, James C. Hoe and Markus Püschel
    Discrete Fourier Transform IP Generator
    Proc. High Performance Extreme Computing (HPEC), 2004
  26. Peter Tummeltshammer, James C. Hoe and Markus Püschel
    Multiple Constant Multiplication By Time-Multiplexed Mapping of Addition Chains
    Proc. Design Automation Conference (DAC), pp. 826-829, 2004
  27. Fang Fang, James C. Hoe, Markus Püschel and Smarahara Misra
    Generation of Custom DSP Transform IP Cores: Case Study Walsh-Hadamard Transform
    Proc. High Performance Extreme Computing (HPEC), 2002

Learn the current Spiral system 

  1. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Computer Generation of Hardware for Linear Digital Signal Processing Transforms
    ACM Transactions on Design Automation of Electronic Systems, Vol. 17, No. 2, 2012
  2. Markus Püschel, Franz Franchetti and Yevgen Voronenko
    Spiral
    in Encyclopedia of Parallel Computing, Eds. David Padua, pp. 1920-1933, Springer 2011
  3. Frédéric de Mesmay, Yevgen Voronenko and Markus Püschel
    Offline Library Adaptation Using Automatically Generated Heuristics
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 1-10, 2010
  4. Yevgen Voronenko, Frédéric de Mesmay and Markus Püschel
    Computer Generation of General Size Linear Transform Libraries
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 102-113, 2009
  5. Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
    Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
    IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
  6. Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
    Operator Language: A Program Generation Framework for Fast Kernels
    Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009
  7. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  8. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Formal Datapath Representation and Manipulation for Implementing DSP Transforms
    Proc. Design Automation Conference (DAC), pp. 385-390, 2008
  9. Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
    Generating High-Performance General Size Linear Transform Libraries Using Spiral
    Proc. High Performance Extreme Computing (HPEC), 2008
  10. Franz Franchetti and Markus Püschel
    Generating SIMD Vectorized Permutations
    Proc. International Conference on Compiler Construction (CC), Lecture Notes in Computer Science, Springer, Vol. 4959, pp. 116-131, 2008
  11. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Linear Transforms: From Math to Efficient Hardware
    Proc. Workshop on High-Level Synthesis colocated with DAC, 2008
  12. Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
    Program Generation with Spiral: Beyond Transforms
    Proc. High Performance Extreme Computing (HPEC), 2008
  13. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    FFT Compiler: From Math to Efficient Hardware
    Proc. IEEE International High Level Design Validation and Test Workshop (HLDVT), 2007
  14. Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
    Generating FPGA Accelerated DFT Libraries
    Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173-184, 2007
  15. Yevgen Voronenko and Markus Püschel
    Mechanical Derivation of Fused Multiply-Add Algorithms for Linear Transforms
    IEEE Transactions on Signal Processing, Vol. 55, No. 9, pp. 4458-4473, 2007
  16. Paolo D'Alberto, Markus Püschel and Franz Franchetti
    Performance/Energy Optimization of DSP Transforms on the XScale Processor
    Proc. International Conference on High Performance Embedded Architectures and Compilers (HiPEAC), Lecture Notes in Computer Science, Springer, Vol. 4367, pp. 201-214, 2007
  17. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    A Rewriting System for the Vectorization of Signal Transforms
    Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 4395, pp. 363-377, 2006
  18. Andreas Bonelli, Franz Franchetti, Juergen Lorenz, Markus Püschel and Christoph W. Ueberhuber
    Automatic Performance Optimization of the Discrete Fourier Transform on Distributed Memory Computers
    Proc. International Symposium on Parallel and Distributed Processing and Application (ISPA), Lecture Notes In Computer Science, Springer, Vol. 4330, pp. 818-832, 2006
  19. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    FFT Program Generation for Shared Memory: SMP and Multicore
    Proc. Supercomputing (SC), 2006
  20. Grace Nordin, Peter A. Milder, James C. Hoe and Markus Püschel
    Automatic Generation of Customized Discrete Fourier Transform IPs
    Proc. Design Automation Conference (DAC), pp. 471-474, 2005
  21. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Formal Loop Merging for Signal Transforms
    Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326 , 2005
  22. Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
    SPIRAL: Code Generation for DSP Transforms
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005

Linear algebra 

  1. Daniele G. Spampinato, Diego Fabregat-Traver, Paolo Bientinesi and Markus Püschel
    Program Generation for Small-Scale Linear Algebra Applications
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 327-339, 2018
  2. Daniele G. Spampinato and Markus Püschel
    A Basic Linear Algebra Compiler for Structured Matrices
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 117-127, 2016
  3. Daniele G. Spampinato and Markus Püschel
    A Basic Linear Algebra Compiler
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 23-32, 2014
  4. Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
    Operator Language: A Program Generation Framework for Fast Kernels
    Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009
  5. Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Automatic Generation of Adaptive Libraries for Matrix-Multiplication
    Parallel Matrix Algorithms and Applications (PMAA), 2008, Presentation (Abstract reviewed)
  6. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    Automatic Linear Transform Program Generation for the Cell BE
    Proc. Supercomputing (SC), 2008, Poster (Abstract reviewed)

Locality optimization 

  1. Benjamin Hess, Thomas Gross and Markus Püschel
    Automatic Locality-Friendly Interface Extension of Numerical Functions
    Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 83-92, 2014

Multibuffering 

  1. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    High Performance Linear Transform Program Generation for the Cell BE
    Proc. High Performance Extreme Computing (HPEC), 2009

Multiplier block 

  1. Yevgen Voronenko and Markus Püschel
    Multiplierless Multiple Constant Multiplication
    ACM Transactions on Algorithms, Vol. 3, No. 2, 2007
  2. Peter Tummeltshammer, James C. Hoe and Markus Püschel
    Time-Multiplexed Multiple Constant Multiplication
    IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 26, No. 9, pp. 1551-1563, 2007
  3. Peter Tummeltshammer, James C. Hoe and Markus Püschel
    Multiple Constant Multiplication By Time-Multiplexed Mapping of Addition Chains
    Proc. Design Automation Conference (DAC), pp. 826-829, 2004

Multiplierless 

  1. Yevgen Voronenko and Markus Püschel
    Multiplierless Multiple Constant Multiplication
    ACM Transactions on Algorithms, Vol. 3, No. 2, 2007
  2. Peter Tummeltshammer, James C. Hoe and Markus Püschel
    Time-Multiplexed Multiple Constant Multiplication
    IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 26, No. 9, pp. 1551-1563, 2007
  3. Adam C. Zelinski, Markus Püschel, Smarahara Misra and James C. Hoe
    Automatic Cost Minimization for Multiplierless Implementations of Discrete Signal Transforms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-221-V-224, 2004
  4. Markus Püschel, Adam C. Zelinski and James C. Hoe
    Custom-Optimized Multiplierless Implementations of DSP Algorithms
    Proc. International Conference on Computer-Aided Design (ICCAD), pp. 175-182, 2004
  5. Peter Tummeltshammer, James C. Hoe and Markus Püschel
    Multiple Constant Multiplication By Time-Multiplexed Mapping of Addition Chains
    Proc. Design Automation Conference (DAC), pp. 826-829, 2004
  6. Smarahara Misra, Adam C. Zelinski, James C. Hoe and Markus Püschel
    Custom Reduction of Arithmetic in Linear DSP Transforms
    Proc. High Performance Extreme Computing (HPEC), 2003

Multithreading 

  1. Christian Berger, Volodymyr Arbatov, Yevgen Voronenko, Franz Franchetti and Markus Püschel
    Real-Time Software Implementation of an IEEE 802.11a Baseband Receiver on Intel Multicore
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1693-1696 , 2011
  2. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    Computer Generation of Fast Fourier Transforms for the Cell Broadband Engine
    Proc. International Conference on Supercomputing (ICS), pp. 26-35, 2009
  3. Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
    Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
    IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
  4. Franz Franchetti and Markus Püschel
    Generating High-Performance Pruned FFT Implementations
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 549-552, 2009
  5. Daniel McFarlin, Franz Franchetti, José M. F. Moura and Markus Püschel
    High Performance Synthetic Aperture Radar Image Formation On Commodity Architectures
    Proc. SPIE Conference on Defense, Security, and Sensing, Proceedings of SPIE, Vol. 7337, pp. 733708, 2009
  6. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  7. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    FFT Program Generation for the Cell BE
    Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
  8. Andreas Bonelli, Franz Franchetti, Juergen Lorenz, Markus Püschel and Christoph W. Ueberhuber
    Automatic Performance Optimization of the Discrete Fourier Transform on Distributed Memory Computers
    Proc. International Symposium on Parallel and Distributed Processing and Application (ISPA), Lecture Notes In Computer Science, Springer, Vol. 4330, pp. 818-832, 2006
  9. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    FFT Program Generation for Shared Memory: SMP and Multicore
    Proc. Supercomputing (SC), 2006

Numerical kernels we consider 

  1. Daniele G. Spampinato and Markus Püschel
    A Basic Linear Algebra Compiler
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 23-32, 2014
  2. Daniel McFarlin, Franz Franchetti and Markus Püschel
    Automatic Generation of Vectorized Fast Fourier Transform Libraries for the Larrabee and AVX Instruction Set Extension
    Proc. High Performance Extreme Computing (HPEC), 2009
  3. Daniel McFarlin, Franz Franchetti, José M. F. Moura and Markus Püschel
    High Performance Synthetic Aperture Radar Image Formation On Commodity Architectures
    Proc. SPIE Conference on Defense, Security, and Sensing, Proceedings of SPIE, Vol. 7337, pp. 733708, 2009
  4. Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
    Operator Language: A Program Generation Framework for Fast Kernels
    Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009
  5. Yevgen Voronenko and Markus Püschel
    Multiplierless Multiple Constant Multiplication
    ACM Transactions on Algorithms, Vol. 3, No. 2, 2007
  6. Sung-Chul Han, Franz Franchetti and Markus Püschel
    Program Generation for the All-Pairs Shortest Path Problem
    Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 222-232, 2006
  7. Roland Wunderlich, Markus Püschel and James C. Hoe
    Accelerating Blocked Matrix-Matrix Multiplication using a Software-Managed Memory Hierarchy with DMA
    Proc. High Performance Extreme Computing (HPEC), 2005
  8. Thammanit Pipatsrisawat, Aca Gacic, Franz Franchetti, Markus Püschel and José M. F. Moura
    Performance Analysis of the Filtered Backprojection Image Reconstruction Algorithms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. 153-156, 2005
  9. Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
    SPIRAL: Code Generation for DSP Transforms
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005

OFDM 

  1. Robert Koutsoyannis, Peter A. Milder, Christian Berger, Madeleine Glick, James C. Hoe and Markus Püschel
    Improving Fixed-Point Accuracy of FFT Cores in O-OFDM Systems
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2012
  2. Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Design studies for ASIC implementations of 28 GS/s optical QPSK- and 16-QAM-OFDM transceivers
    Optics Express, Vol. 19, No. 21, pp. 20857-20864, 2011
  3. Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, Yannis Benlachtar, Christian Berger, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Design Studies for an ASIC Implementation of an Optical OFDM Transceiver
    Proc. European Conference on Optical Communication (ECOC), pp. 1-3, 2010
  4. Yannis Benlachtar, Rachid Bouziane, Robert I. Killey, Christian Berger, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel and Madeleine Glick
    Optical OFDM for the Data Center
    Proc. International Conference on Transparent Optical Networks (ICTON), pp. 1-4, 2010
  5. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Real-Time Digital Signal Processing for the Generation of Optical Orthogonal Frequency Division Multiplexed Signals
    IEEE Journal of Selected Topics in Quantum Electronics, Vol. 16, No. 5, pp. 1235-1244 , 2010
  6. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    21.4 GS/s Real-Time DSP-Based Optical OFDM Signal Generation and Transmission Over 1600 km of Uncompensated Fibre
    Proc. European Conference on Optical Communication (ECOC), pp. 1-2, 2009
  7. Yannis Benlachtar, Philip M. Watts, Rachid Bouziane, Peter A. Milder, Deepak Rangaraj, Anthony Cartolano, Robert Koutsoyannis, James C. Hoe, Markus Püschel, Madeleine Glick and Robert I. Killey
    Generation of Optical OFDM Signals Using 21.4 GS/s Real Time Digital Signal Processing
    Optics Express, Vol. 17, No. 20, pp. 17658-17668, 2009

Parallel processing 

  1. Franz Franchetti and Markus Püschel
    Fast Fourier Transform
    in Encyclopedia of Parallel Computing, Eds. David Padua, Springer 2011
  2. Markus Püschel, Franz Franchetti and Yevgen Voronenko
    Spiral
    in Encyclopedia of Parallel Computing, Eds. David Padua, pp. 1920-1933, Springer 2011
  3. Yevgen Voronenko, Volodymyr Arbatov, Christian Berger, Ronghui Peng, Markus Püschel and Franz Franchetti
    Computer Generation of Platform-Adapted Physical Layer Software
    Proc. Software Defined Radio (SDR), 2010
  4. Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
    Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
    IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
  5. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    High Performance Linear Transform Program Generation for the Cell BE
    Proc. High Performance Extreme Computing (HPEC), 2009
  6. Daniel McFarlin, Franz Franchetti, José M. F. Moura and Markus Püschel
    High Performance Synthetic Aperture Radar Image Formation On Commodity Architectures
    Proc. SPIE Conference on Defense, Security, and Sensing, Proceedings of SPIE, Vol. 7337, pp. 733708, 2009
  7. Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
    Operator Language: A Program Generation Framework for Fast Kernels
    Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009

Performance analysis 

  1. Georg Ofenbeck, Ruedi Steinmann, Victoria Caparrós Cabezas, Daniele G. Spampinato and Markus Püschel
    Applying the Roofline Model
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 76 - 85, 2014

Performance model 

  1. Georg Ofenbeck, Ruedi Steinmann, Victoria Caparrós Cabezas, Daniele G. Spampinato and Markus Püschel
    Applying the Roofline Model
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 76 - 85, 2014
  2. Victoria Caparrós Cabezas and Markus Püschel
    Extending the Roofline Model: Bottleneck Analysis with Microarchitectural Constraints
    Proc. IEEE International Symposium on Workload Characterization (IISWC), pp. 222-231, 2014
  3. Daniel McFarlin, Franz Franchetti and Markus Püschel
    Automatic Generation of Vectorized Fast Fourier Transform Libraries for the Larrabee and AVX Instruction Set Extension
    Proc. High Performance Extreme Computing (HPEC), 2009
  4. Basilio B. Fraguela, Yevgen Voronenko and Markus Püschel
    Automatic Tuning of Discrete Fourier Transforms Driven by Analytical Modeling
    Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 271-280, 2009

Portability 

  1. Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
    SPIRAL: Extreme Performance Portability
    Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018

Roofline model 

  1. Georg Ofenbeck, Ruedi Steinmann, Victoria Caparrós Cabezas, Daniele G. Spampinato and Markus Püschel
    Applying the Roofline Model
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 76 - 85, 2014
  2. Victoria Caparrós Cabezas and Markus Püschel
    Extending the Roofline Model: Bottleneck Analysis with Microarchitectural Constraints
    Proc. IEEE International Symposium on Workload Characterization (IISWC), pp. 222-231, 2014

Search/Learning for optimization 

  1. Marcela Zuluaga, Andreas Krause and Markus Püschel
    e-PAL: An Active Learning Approach to the Multi-Objective Optimization Problem
    Journal of Machine Learning Research, Vol. 17, No. 104, pp. 1-32, 2016
  2. Marcela Zuluaga, Andreas Krause, Guillaume Sergent and Markus Püschel
    Active Learning for Multi-Objective Optimization
    Proc. International Conference on Machine Learning (ICML), pp. 462-470, 2013
  3. Marcela Zuluaga, Andreas Krause and Markus Püschel
    Multi-Objective Optimization for High-Level Synthesis
    Proc. Workshop on High-Level Synthesis for High Performance Computing (HLS4HPC), 2013
  4. Marcela Zuluaga, Andreas Krause, Peter A. Milder and Markus Püschel
    "Smart" Design Space Sampling to Predict Pareto-Optimal Solutions
    Proc. Languages, Compilers, Tools and Theory for Embedded Systems (LCTES), pp. 119-128 , 2012
  5. Frédéric de Mesmay, Yevgen Voronenko and Markus Püschel
    Offline Library Adaptation Using Automatically Generated Heuristics
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 1-10, 2010
  6. Frédéric de Mesmay, Arpad Rimmel, Yevgen Voronenko and Markus Püschel
    Bandit-Based Optimization on Graphs with Application to Library Performance Tuning
    Proc. International Conference on Machine Learning (ICML), pp. 729-736, 2009

SIMD vectorization 

  1. Daniele G. Spampinato, Diego Fabregat-Traver, Paolo Bientinesi and Markus Püschel
    Program Generation for Small-Scale Linear Algebra Applications
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 327-339, 2018
  2. Daniele G. Spampinato and Markus Püschel
    A Basic Linear Algebra Compiler
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 23-32, 2014
  3. Benjamin Hess, Thomas Gross and Markus Püschel
    Automatic Locality-Friendly Interface Extension of Numerical Functions
    Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 83-92, 2014
  4. Daniel McFarlin, Volodymyr Arbatov, Franz Franchetti and Markus Püschel
    Automatic SIMD Vectorization of Fast Fourier Transforms for the Larrabee and AVX Instruction Sets
    Proc. International Conference on Supercomputing (ICS), 2011
  5. Franz Franchetti and Markus Püschel
    Fast Fourier Transform
    in Encyclopedia of Parallel Computing, Eds. David Padua, Springer 2011
  6. Christian Berger, Volodymyr Arbatov, Yevgen Voronenko, Franz Franchetti and Markus Püschel
    Real-Time Software Implementation of an IEEE 802.11a Baseband Receiver on Intel Multicore
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1693-1696 , 2011
  7. Markus Püschel, Franz Franchetti and Yevgen Voronenko
    Spiral
    in Encyclopedia of Parallel Computing, Eds. David Padua, pp. 1920-1933, Springer 2011
  8. Yevgen Voronenko, Volodymyr Arbatov, Christian Berger, Ronghui Peng, Markus Püschel and Franz Franchetti
    Computer Generation of Platform-Adapted Physical Layer Software
    Proc. Software Defined Radio (SDR), 2010
  9. Daniel McFarlin, Franz Franchetti and Markus Püschel
    Automatic Generation of Vectorized Fast Fourier Transform Libraries for the Larrabee and AVX Instruction Set Extension
    Proc. High Performance Extreme Computing (HPEC), 2009
  10. Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
    Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
    IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
  11. Franz Franchetti and Markus Püschel
    Generating High-Performance Pruned FFT Implementations
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 549-552, 2009
  12. Daniel McFarlin, Franz Franchetti, José M. F. Moura and Markus Püschel
    High Performance Synthetic Aperture Radar Image Formation On Commodity Architectures
    Proc. SPIE Conference on Defense, Security, and Sensing, Proceedings of SPIE, Vol. 7337, pp. 733708, 2009
  13. Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
    Operator Language: A Program Generation Framework for Fast Kernels
    Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009
  14. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  15. Franz Franchetti and Markus Püschel
    Generating SIMD Vectorized Permutations
    Proc. International Conference on Compiler Construction (CC), Lecture Notes in Computer Science, Springer, Vol. 4959, pp. 116-131, 2008
  16. Franz Franchetti and Markus Püschel
    SIMD Vectorization of Non-Two-Power Sized FFTs
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. II-17-II-20, 2007
  17. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    A Rewriting System for the Vectorization of Signal Transforms
    Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 4395, pp. 363-377, 2006
  18. Sung-Chul Han, Franz Franchetti and Markus Püschel
    Program Generation for the All-Pairs Shortest Path Problem
    Proc. Parallel Architectures and Compilation Techniques (PACT), pp. 222-232, 2006
  19. Franz Franchetti and Markus Püschel
    Short Vector Code Generation and Adaptation for DSP Algorithms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 537-540, 2003
  20. Franz Franchetti and Markus Püschel
    Short Vector Code Generation for the Discrete Fourier Transform
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2003
  21. Franz Franchetti and Markus Püschel
    A SIMD Vectorizing Compiler for Digital Signal Processing Algorithms
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 20-26, 2002

Software defined radio (SDR) 

  1. Christian Berger, Volodymyr Arbatov, Yevgen Voronenko, Franz Franchetti and Markus Püschel
    Real-Time Software Implementation of an IEEE 802.11a Baseband Receiver on Intel Multicore
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1693-1696 , 2011
  2. Yevgen Voronenko, Volodymyr Arbatov, Christian Berger, Ronghui Peng, Markus Püschel and Franz Franchetti
    Computer Generation of Platform-Adapted Physical Layer Software
    Proc. Software Defined Radio (SDR), 2010

Sorting 

  1. Francois Serre and Markus Püschel
    Optimal Circuits for Streamed Linear Permutations using RAM
    Proc. FPGA, pp. 215-223, 2016
  2. Marcela Zuluaga, Peter A. Milder and Markus Püschel
    Streaming Sorting Networks
    ACM Transactions on Design Automation of Electronic Systems, Vol. 21, No. 4, pp. 55, 2016
  3. Marcela Zuluaga, Peter A. Milder and Markus Püschel
    Computer Generation of Streaming Sorting Networks
    Proc. Design Automation Conference (DAC), pp. 1245-1253, 2012

SPIRAL 

  1. Franz Franchetti, Tze-Meng Low, Thom Popovici, Richard Veras, Daniele G. Spampinato, Jeremy Johnson, Markus Püschel, James C. Hoe and José M. F. Moura
    SPIRAL: Extreme Performance Portability
    Proceedings of the IEEE, special issue on ``From High Level Specification to High Performance Code'', Vol. 106, No. 11, 2018

Spiral overview paper 

  1. Markus Püschel, Franz Franchetti and Yevgen Voronenko
    Spiral
    in Encyclopedia of Parallel Computing, Eds. David Padua, pp. 1920-1933, Springer 2011
  2. Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
    Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
    IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
  3. Franz Franchetti, Frédéric de Mesmay, Daniel McFarlin and Markus Püschel
    Operator Language: A Program Generation Framework for Fast Kernels
    Proc. IFIP Working Conference on Domain Specific Languages (DSL WC), Lecture Notes in Computer Science, Springer, Vol. 5658, pp. 385-410, 2009
  4. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  5. Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
    Generating High-Performance General Size Linear Transform Libraries Using Spiral
    Proc. High Performance Extreme Computing (HPEC), 2008
  6. Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
    Program Generation with Spiral: Beyond Transforms
    Proc. High Performance Extreme Computing (HPEC), 2008
  7. Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
    SPIRAL: Code Generation for DSP Transforms
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005
  8. Markus Püschel, Bryan Singer, Jianxin Xiong, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso and Robert W. Johnson
    SPIRAL: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms
    Journal of High Performance Computing and Applications, special issue on ``Automatic Performance Tuning'', Vol. 18, No. 1, pp. 21-45, 2004
  9. Markus Püschel, Bryan Singer, Manuela Veloso and José M. F. Moura
    Fast Automatic Generation of DSP Algorithms
    Proc. International Conference on Computational Science (ICCS), Lecture Notes In Computer Science, Springer, Vol. 2073, pp. 97-106, 2001

SPIRAL program generation system for transforms 

  1. Markus Püschel, Franz Franchetti and Yevgen Voronenko
    Spiral
    in Encyclopedia of Parallel Computing, Eds. David Padua, pp. 1920-1933, Springer 2011
  2. Frédéric de Mesmay, Arpad Rimmel, Yevgen Voronenko and Markus Püschel
    Bandit-Based Optimization on Graphs with Application to Library Performance Tuning
    Proc. International Conference on Machine Learning (ICML), pp. 729-736, 2009
  3. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    Computer Generation of Fast Fourier Transforms for the Cell Broadband Engine
    Proc. International Conference on Supercomputing (ICS), pp. 26-35, 2009
  4. Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa and José M. F. Moura
    Discrete Fourier Transform on Multicores: Algorithms and Automatic Implementation
    IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 90-102, 2009
  5. Franz Franchetti and Markus Püschel
    Generating High-Performance Pruned FFT Implementations
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 549-552, 2009
  6. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  7. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    FFT Program Generation for the Cell BE
    Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
  8. Franz Franchetti and Markus Püschel
    Generating SIMD Vectorized Permutations
    Proc. International Conference on Compiler Construction (CC), Lecture Notes in Computer Science, Springer, Vol. 4959, pp. 116-131, 2008
  9. Paolo D'Alberto, Markus Püschel and Franz Franchetti
    Performance/Energy Optimization of DSP Transforms on the XScale Processor
    Proc. International Conference on High Performance Embedded Architectures and Compilers (HiPEAC), Lecture Notes in Computer Science, Springer, Vol. 4367, pp. 201-214, 2007
  10. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    A Rewriting System for the Vectorization of Signal Transforms
    Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 4395, pp. 363-377, 2006
  11. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    FFT Program Generation for Shared Memory: SMP and Multicore
    Proc. Supercomputing (SC), 2006
  12. Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
    Parallelism in Spiral
    Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006
  13. José M. F. Moura, Markus Püschel, David Padua and Jack Dongarra
    Scanning the Issue: Special Issue on Program Generation, Optimization, and Platform Adaptation
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 211-215, 2005
  14. Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
    SPIRAL: Code Generation for DSP Transforms
    Proceedings of the IEEE, special issue on ``Program Generation, Optimization, and Adaptation'', Vol. 93, No. 2, pp. 232- 275, 2005
  15. Markus Püschel, Bryan Singer, Jianxin Xiong, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso and Robert W. Johnson
    SPIRAL: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms
    Journal of High Performance Computing and Applications, special issue on ``Automatic Performance Tuning'', Vol. 18, No. 1, pp. 21-45, 2004
  16. Markus Püschel and José M. F. Moura
    SPIRAL: An Overview
    Proc. Workshop on Optimizations for DSP and Embedded Systems (ODES), 2003
  17. Markus Püschel and José M. F. Moura
    Generation and Manipulation of DSP Transform Algorithms
    Proc. Digital Signal Processing Workshop, 2002
  18. Franz Franchetti, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
    Short Vector SIMD Code Generation for DSP Algorithms
    Proc. High Performance Extreme Computing (HPEC), 2002
  19. Markus Püschel, Bryan Singer, Manuela Veloso and José M. F. Moura
    Fast Automatic Generation of DSP Algorithms
    Proc. International Conference on Computational Science (ICCS), Lecture Notes In Computer Science, Springer, Vol. 2073, pp. 97-106, 2001
  20. José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel, Bryan Singer, Manuela Veloso and Jianxin Xiong
    Generating Platform-Adapted DSP Libraries using SPIRAL
    Proc. High Performance Extreme Computing (HPEC), 2001
  21. José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel and Manuela Veloso
    SPIRAL: Automatic Implementation of Signal Processing Algorithms
    Proc. High Performance Extreme Computing (HPEC), 2000

SPL compiler: Translating math into code 

  1. Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
    Spiral in Scala: Towards the Systematic Construction of Generators for Performance Libraries
    Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 125-134, 2013
  2. Markus Püschel, Franz Franchetti and Yevgen Voronenko
    Spiral
    in Encyclopedia of Parallel Computing, Eds. David Padua, pp. 1920-1933, Springer 2011
  3. Franz Franchetti and Markus Püschel
    Generating SIMD Vectorized Permutations
    Proc. International Conference on Compiler Construction (CC), Lecture Notes in Computer Science, Springer, Vol. 4959, pp. 116-131, 2008
  4. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    FFT Compiler: From Math to Efficient Hardware
    Proc. IEEE International High Level Design Validation and Test Workshop (HLDVT), 2007
  5. Yevgen Voronenko and Markus Püschel
    Mechanical Derivation of Fused Multiply-Add Algorithms for Linear Transforms
    IEEE Transactions on Signal Processing, Vol. 55, No. 9, pp. 4458-4473, 2007
  6. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Formal Loop Merging for Signal Transforms
    Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326 , 2005
  7. Yevgen Voronenko and Markus Püschel
    Automatic Generation of Implementations for DSP Transforms on Fused Multiply-Add Architectures
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-101-V-104, 2004
  8. Franz Franchetti and Markus Püschel
    Short Vector Code Generation and Adaptation for DSP Algorithms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 537-540, 2003
  9. Franz Franchetti and Markus Püschel
    A SIMD Vectorizing Compiler for Digital Signal Processing Algorithms
    Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 20-26, 2002

Staging 

  1. Georg Ofenbeck, Tiark Rompf and Markus Püschel
    Staging for Generic Programming in Space and Time
    Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 15-28, 2017
  2. Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
    Language Support for the Construction of High Performance Code Generators
    Proc. International Workshop on Adaptive Self-tuning Computing Systems (ADAPT), 2014
  3. Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
    Spiral in Scala: Towards the Systematic Construction of Generators for Performance Libraries
    Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 125-134, 2013

Streaming permutations 

  1. Francois Serre and Markus Püschel
    Optimal Circuits for Streamed Linear Permutations using RAM
    Proc. FPGA, pp. 215-223, 2016
  2. Marcela Zuluaga, Peter A. Milder and Markus Püschel
    Streaming Sorting Networks
    ACM Transactions on Design Automation of Electronic Systems, Vol. 21, No. 4, pp. 55, 2016
  3. Peter A. Milder, James C. Hoe and Markus Püschel
    Automatic Generation of Streaming Datapaths for Arbitrary Fixed Permutations
    Proc. Design, Automation and Test in Europe (DATE), pp. 1118-1123, 2009
  4. Markus Püschel, Peter A. Milder and James C. Hoe
    Permuting Streaming Data Using RAMs
    Journal of the ACM, Vol. 56, No. 2, pp. 10:1-10:34, 2009

Structured matrices 

  1. Daniele G. Spampinato and Markus Püschel
    A Basic Linear Algebra Compiler for Structured Matrices
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 117-127, 2016

Synthesis 

  1. Daniele G. Spampinato, Diego Fabregat-Traver, Paolo Bientinesi and Markus Püschel
    Program Generation for Small-Scale Linear Algebra Applications
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 327-339, 2018
  2. Georg Ofenbeck, Tiark Rompf and Markus Püschel
    Staging for Generic Programming in Space and Time
    Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 15-28, 2017
  3. Daniele G. Spampinato and Markus Püschel
    A Basic Linear Algebra Compiler
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 23-32, 2014
  4. Georg Ofenbeck, Tiark Rompf, Alen Stojanov, Martin Odersky and Markus Püschel
    Spiral in Scala: Towards the Systematic Construction of Generators for Performance Libraries
    Proc. International Conference on Generative Programming: Concepts & Experiences (GPCE), pp. 125-134, 2013

Transform 

  1. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    Automatic Linear Transform Program Generation for the Cell BE
    Proc. Supercomputing (SC), 2008, Poster (Abstract reviewed)
  2. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Loop Merging for Signal Transforms
    Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326, 2005

Tutorial 

  1. Markus Püschel, Franz Franchetti and Yevgen Voronenko
    Spiral
    in Encyclopedia of Parallel Computing, Eds. David Padua, pp. 1920-1933, Springer 2011
  2. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    How To Write Fast Numerical Code: A Small Introduction
    Proc. Summer School on Generative and Transformational Techniques in Software Engineering (GTTSE), Lecture Notes in Computer Science, Springer, Vol. 5235, pp. 196-259, 2008

Walsh-Hadamard transform 

  1. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  2. Jeremy Johnson and Markus Püschel
    In Search of the Optimal Walsh-Hadamard Transform
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 6, pp. 3347-3350, 2000

(No keyword) 

  1. Nikolaos Kyrtatas, Daniele G. Spampinato and Markus Püschel
    A Basic Linear Algebra Compiler for Embedded Processors
    Proc. Design, Automation and Test in Europe (DATE), pp. 1054-1059, 2015
  2. Alen Stojanov, Georg Ofenbeck, Tiark Rompf and Markus Püschel
    Abstracting Vector Architectures in Library Generators: Case Study Convolution Filters
    Proc. ACM International Workshop on Libraries, Languages and Compilers for Array Programming (ARRAY), pp. 14, 2014
  3. Frédéric de Mesmay, Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Automatic Generation of Multithreaded Vectorized Adaptive Libraries for Matrix Multiplication
    Proc. International Workshop on Parallel Matrix Algorithms and Applications (PMAA), 2008
  4. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Fast Fourier Transform on FPGA: Design Choices and Evaluation
    Proc. International Symposium on Field-Programmable Gate Arrays (FPGA), 2007
  5. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Spiral: Generating Signal Processing Kernels for New Commodity Architectures
    Proc. EDGE Workshop, pp. D49-D50, 2006
  6. Markus Püschel, Sebastian Egner and Thomas Beth
    AREP
    in New Reference Book on Computer Algebra, Eds. J. Grabmeier, E. Kaltofen, J. Grabmeier, E. Kaltofen, Springer 2002
Publication interface designed and implemented by Patra Pantupat, Aliaksei Sandryhaila, and Markus Püschel
Electrical and Computer Engineering, Carnegie Mellon University, 2007