Publications

Filtered as: all types - all years - all authors - all keywords
Sorted by: year - type
Corresponding
bibtex list 

2008 

Journal 

  1. Markus Püschel, Peter A. Milder and James C. Hoe
    Permuting Streaming Data Using RAMs
    submitted for publication

Conference (fully reviewed) 

  1. Sung-Chul Han, Markus Püschel and Rohit Negi
    A Flexible Decoder for Quasi-Cyclic LDPC Codes
    submitted for publication
  2. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Formal Datapath Representation and Manipulation for Implementing DSP Transforms
    Proc. Design Automation Conference (DAC), 2008
  3. Franz Franchetti and Markus Püschel
    Generating SIMD Vectorized Permutations
    Proc. International Conference on Compiler Construction (CC), Lecture Notes in Computer Science, Springer, Vol. 4959, pp. 116-131, 2008
  4. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    How To Write Fast Numerical Code: A Small Introduction
    to appear in Proc. Summer School on Generative and Transformational Techniques in Software Engineering, Lecture Notes in Computer Science, Springer, 2008

Conference (abstract reviewed) 

  1. Franz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek Telgarsky, Hao Shen, Paolo D'Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura and Markus Püschel
    Domain-Specific Library Generation for Parallel Software and Hardware Platforms
    Proc. NSF Next Generation Software Program Workshop (NSFNGS) colocated with IPDPS, 2008
  2. Srinivas Chellappa, Franz Franchetti and Markus Püschel
    FFT Program Generation for the Cell BE
    Proc. International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), 2008
  3. Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
    Generating High-Performance General Size Linear Transform Libraries Using Spiral
    submitted for publication
  4. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Linear Transforms: From Math to Efficient Hardware
    Proc. Workshop on High-Level Synthesis colocated with DAC, 2008
  5. Franz Franchetti, Daniel McFarlin, Frédéric de Mesmay, Hao Shen, Tomasz Wiktor Włodarczyk, Srinivas Chellappa, Marek Telgarsky, Peter A. Milder, Yevgen Voronenko, Qian Yu, James C. Hoe, José M. F. Moura and Markus Püschel
    Program Generation with Spiral: Beyond Transforms
    to appear in Proc. High Performance Embedded Computing (HPEC), 2008
  6. Yevgen Voronenko, Franz Franchetti, Frédéric de Mesmay and Markus Püschel
    System Demonstration of Spiral: Generator for High-Performance Linear Transform Libraries
    to appear in Proc. Algebraic Methodology and Software Technology (AMAST), 2008

Thesis 

  1. Yevgen Voronenko
    Library Generation for Linear Transforms
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2008

2007 

Journal 

  1. Yevgen Voronenko and Markus Püschel
    Mechanical Derivation of Fused Multiply-Add Algorithms for Linear Transforms
    IEEE Transactions on Signal Processing, Vol. 55, No. 9, pp. 4458-4473, 2007
  2. Yevgen Voronenko and Markus Püschel
    Multiplierless Multiple Constant Multiplication
    ACM Transactions on Algorithms, Vol. 3, No. 2, 2007
  3. Peter Tummeltshammer, James C. Hoe and Markus Püschel
    Time-Multiplexed Multiple Constant Multiplication
    IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 26, No. 9, pp. 1551-1563, 2007

Conference (fully reviewed) 

  1. Paolo D'Alberto, Peter A. Milder, Aliaksei Sandryhaila, Franz Franchetti, James C. Hoe, José M. F. Moura, Markus Püschel and Jeremy Johnson
    Generating FPGA Accelerated DFT Libraries
    Proc. IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 173 - 184, 2007
  2. Paolo D'Alberto, Markus Püschel and Franz Franchetti
    Performance/Energy Optimization of DSP Transforms on the XScale Processor
    Proc. International Conference on High Performance Embedded Architectures & Compilers (HiPEAC), 2007
  3. Franz Franchetti and Markus Püschel
    SIMD Vectorization of Non-Two-Power Sized FFTs
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. II-17-II-20, 2007

Conference (abstract reviewed) 

  1. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    FFT Compiler: From Math to Efficient Hardware
    Proc. IEEE International High Level Design Validation and Test Workshop (HLDVT), 2007

Thesis 

  1. Sung-Chul Han
    A Flexible Decoder and Performance Evaluation of Array-Structured LDPC Codes
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2007

Other 

  1. Peter A. Milder, Franz Franchetti, James C. Hoe and Markus Püschel
    Discrete Fourier Transform Compiler: From Mathematical Representation to Efficient Hardware
    CSSI Technical Report #CSSI-07-01, Carnegie Mellon University, 2007

2006 

Journal 

  1. Pawel Hitczenko, Jeremy Johnson and Hung-Jen Huang
    Distribution of a Class of Divide and Conquer Recurrences arising from the Computation of the Walsh-Hadamard Transform
    Theoretical Computer Science, Vol. 352, pp. 8-30, 2006

Conference (fully reviewed) 

  1. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    A Rewriting System for the Vectorization of Signal Transforms
    Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 4395, pp. 363-377, 2006
  2. Andreas Bonelli, Franz Franchetti, Juergen Lorenz, Markus Püschel and Christoph W. Ueberhuber
    Automatic Performance Optimization of the Discrete Fourier Transform on Distributed Memory Computers
    Proc. International Symposium on Parallel and Distributed Processing and Application (ISPA), Lecture Notes In Computer Science, Springer, Vol. 4330, pp. 818-832, 2006
  3. Peter A. Milder, Mohammad Ahmad, James C. Hoe and Markus Püschel
    Fast and Accurate Resource Estimation of Automatically Generated Custom DFT IP Cores
    Proc. FPGA, pp. 211-220, 2006
  4. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    FFT Program Generation for Shared Memory: SMP and Multicore
    Proc. Supercomputing (SC), 2006
  5. Sung-Chul Han, Franz Franchetti and Markus Püschel
    Program Generation for the All-Pairs Shortest Path Problem
    Proc. Parallel Architectures and Compilation Techniques (PACT) , pp. 222-232, 2006
  6. Marek Telgarsky, James C. Hoe and José M. F. Moura
    Spiral: Joint Runtime and Energy Optimization of Linear Transforms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 3, 2006

Conference (abstract reviewed) 

  1. Paolo D'Alberto, Peter A. Milder, Franz Franchetti, James C. Hoe, Markus Püschel and José M. F. Moura
    Discrete Fourier Transform Compiler for FPGA and CPU/FPGA Partitioned Implementations
    Proc. High Performance Embedded Computing (HPEC), 2006
  2. Franz Franchetti, Andreas Bonelli, Ekapol Chuangsuwanich, Yu-Chiang Lee, Juergen Lorenz, Thomas Peter, Hao Shen, Marek Telgarsky, Yevgen Voronenko, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
    Parallelism in Spiral
    Proc. Workshop on Programming Models for Ubiquitous Parallelism (PMUP), 2006

2005 

Journal 

  1. José M. F. Moura, Markus Püschel, David Padua and Jack Dongarra
    Scanning the Issue: Special Issue on Program Generation, Optimization, and Platform Adaptation
    Proceedings of the IEEE, special issue on "Program Generation, Optimization, and Adaptation", Vol. 93, No. 2, pp. 211-215, 2005
  2. Markus Püschel, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso, Bryan Singer, Jianxin Xiong, Franz Franchetti, Aca Gacic, Yevgen Voronenko, Kang Chen, Robert W. Johnson and Nicholas Rizzolo
    SPIRAL: Code Generation for DSP Transforms
    Proceedings of the IEEE, special issue on "Program Generation, Optimization, and Adaptation", Vol. 93, No. 2, pp. 232- 275, 2005

Conference (fully reviewed) 

  1. Grace Nordin, Peter A. Milder, James C. Hoe and Markus Püschel
    Automatic Generation of Customized Discrete Fourier Transform IPs
    Proc. Design Automation Conference (DAC), pp. 471-474, 2005
  2. Mihai Furis, Pawel Hitczenko and Jeremy Johnson
    Cache Miss Analysis of WHT Algorithms
    Proc. International Conference on Analysis of Algorithms, Discrete Mathematics and Theoretical Computer Science, pp. 115-124, 2005
  3. Franz Franchetti, Yevgen Voronenko and Markus Püschel
    Formal Loop Merging for Signal Transforms
    Proc. Programming Languages Design and Implementation (PLDI), pp. 315-326 , 2005
  4. Xiaoming Li, María J. Garzarán and David Padua
    Optimizing Sorting with Genetic Algorithm
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 99-110, 2005
  5. Thammanit Pipatsrisawat, Aca Gacic, Franz Franchetti, Markus Püschel and José M. F. Moura
    Performance Analysis of the Filtered Backprojection Image Reconstruction Algorithms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. 153-156, 2005

Conference (abstract reviewed) 

  1. Roland Wunderlich, Markus Püschel and James C. Hoe
    Accelerating Blocked Matrix-Matrix Multiplication using a Software-Managed Memory Hierarchy with DMA
    Proc. High Performance Embedded Computing (HPEC), 2005

2004 

Journal 

  1. Anthony F. Breitzman and Jeremy Johnson
    Automatic Derivation and Implementation of Fast Convolution Algorithms
    Journal of High Performance Computing and Applications, special issue on "Computer Algebra and Signal Processing", Vol. 37, No. 2, pp. 157-186, 2004
  2. Markus Püschel, Bryan Singer, Jianxin Xiong, José M. F. Moura, Jeremy Johnson, David Padua, Manuela Veloso and Robert W. Johnson
    SPIRAL: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms
    Journal of High Performance Computing and Applications, special issue on "Automatic Performance Tuning", Vol. 18, No. 1, pp. 21-45, 2004

Conference (fully reviewed) 

  1. Xiaoming Li, María J. Garzarán and David Padua
    A Dynamically Tuned Sorting Library
    Proc. International Symposium on Code Generation and Optimization (CGO), pp. 111-124, 2004
  2. Jeremy Johnson and Kang Chen
    A Self-Adapting Distributed Memory Package for Fast Signal Transforms
    Proc. International Parallel and Distributed Processing Symposium (IPDPS), pp. 44-, 2004
  3. Aca Gacic, Markus Püschel and José M. F. Moura
    Automatically Generated High-Performance Code for Discrete Wavelet Transforms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-69-V-72, 2004
  4. Franz Franchetti, Stefan Kral, Juergen Lorenz, Markus Püschel, Christoph W. Ueberhuber and Peter Wurzinger
    Automatically Tuned FFTs for BlueGene/L’s Double FPU
    Proc. High Performance Computing for Computational Science (VECPAR), Lecture Notes in Computer Science, Springer, Vol. 3402, pp. 23-36, 2004
  5. Adam C. Zelinski, Markus Püschel, Smarahara Misra and James C. Hoe
    Automatic Cost Minimization for Multiplierless Implementations of Discrete Signal Transforms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-221-V-224, 2004
  6. Yevgen Voronenko and Markus Püschel
    Automatic Generation of Implementations for DSP Transforms on Fused Multiply-Add Architectures
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. V-101-V-104, 2004
  7. Markus Püschel, Adam C. Zelinski and James C. Hoe
    Custom-Optimized Multiplierless Implementations of DSP Algorithms
    Proc. International Conference on Computer-Aided Design (ICCAD), pp. 175-182, 2004
  8. Nicholas Rizzolo and David Padua
    HiLO: High Level Optimization of FFTs
    Proc. International Workshop on Languages and Compilers for Parallel Computing (LCPC), 2004
  9. Peter Tummeltshammer, James C. Hoe and Markus Püschel
    Multiple Constant Multiplication By Time-Multiplexed Mapping of Addition Chains
    Proc. Design Automation Conference (DAC), pp. 826-829, 2004

Conference (abstract reviewed) 

  1. Lawrence C. Chang, Inpyo Hong, Yevgen Voronenko and Markus Püschel
    Adaptive Mapping of Linear DSP Algorithms to Fixed-Point Arithmetic
    Proc. High Performance Embedded Computing (HPEC), 2004
  2. Grace Nordin, James C. Hoe and Markus Püschel
    Discrete Fourier Transform IP Generator
    Proc. High Performance Embedded Computing (HPEC), 2004

Thesis 

  1. Aca Gacic
    Automatic Implementation and Platform Adaptation of Discrete Filtering and Wavelet Algorithms
    PhD. thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2004

2003 

Conference (fully reviewed) 

  1. Jeremy Johnson and Xu Xu
    A Recursive Implementation of the Dimensionless FFT
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2003
  2. Aca Gacic, Markus Püschel and José M. F. Moura
    Fast Automatic Implementations of FIR Filters
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 541-544, 2003
  3. Franz Franchetti and Markus Püschel
    Short Vector Code Generation and Adaptation for DSP Algorithms
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. 537-540, 2003
  4. Franz Franchetti and Markus Püschel
    Short Vector Code Generation for the Discrete Fourier Transform
    Proc. International Parallel and Distributed Processing Symposium (IPDPS), 2003
  5. Fang Fang, Rob A. Rutenbar, Markus Püschel and Tsuhan Chen
    Toward Efficient Static Analysis of Finite-Precision Effects in DSP Applications via Affine Arithmetic Modeling
    Proc. Design Automation Conference (DAC), pp. 496-501, 2003

Conference (abstract reviewed) 

  1. Smarahara Misra, Adam C. Zelinski, James C. Hoe and Markus Püschel
    Custom Reduction of Arithmetic in Linear DSP Transforms
    Proc. High Performance Embedded Computing (HPEC), 2003
  2. Markus Püschel and José M. F. Moura
    SPIRAL: An Overview
    Proc. Workshop on Optimizations for DSP and Embedded Systems (ODES), 2003

Thesis 

  1. Xu Xu
    A Recursive Implementation of the Dimensionless FFT
    Master thesis, Computer Science, Drexel University, 2003
  2. Anthony F. Breitzman
    Automatic Derivation and Implementation of Fast Convolution Algorithms
    PhD. thesis, Computer Science, Drexel University, 2003
  3. Mihai Furis
    Cache Miss Analysis of Walsh-Hadamard Transform Algorithms
    Master thesis, Computer Science, Drexel University, 2003
  4. Smarahara Misra
    Custom Reduction of Arithmetic in Multiplierless implementations of DSP Transforms
    Master thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2003

2002 

Journal 

  1. Bryan Singer and Manuela Veloso
    Automating the Modeling and Optimization of the Performance of Signal Transforms
    IEEE Transactions on Signal Processing, Vol. 50, No. 8, pp. 2003-2014, 2002
  2. Bryan Singer and Manuela Veloso
    Learning to Construct Fast Signal Processing Implementations
    Journal of Machine Learning Research, special issue on "the Eighteenth International Conference on Machine Learning (ICML 2001)", Vol. 3, pp. 887-919, 2002

Book Chapter 

  1. Markus Püschel, Sebastian Egner and Thomas Beth
    AREP
    in New Reference Book on Computer Algebra, Eds. J. Grabmeier, E. Kaltofen, J. Grabmeier, E. Kaltofen, Springer 2002

Conference (fully reviewed) 

  1. Kang Chen and Jeremy Johnson
    A Prototypical Self-Optimizing Package for Parallel Implementation of Fast Signal Transforms
    Proc. International Parallel and Distributed Processing Symposium (IPDPS), pp. 58-63, 2002
  2. Franz Franchetti and Markus Püschel
    A SIMD Vectorizing Compiler for Digital Signal Processing Algorithms
    Proc. International Parallel and Distributed Processing Symposium (IPDPS), pp. 20-26, 2002

Conference (abstract reviewed) 

  1. Markus Püschel and José M. F. Moura
    Generation and Manipulation of DSP Transform Algorithms
    Proc. Digital Signal Processing Workshop, 2002
  2. Fang Fang, James C. Hoe, Markus Püschel and Smarahara Misra
    Generation of Custom DSP Transform IP Cores: Case Study Walsh-Hadamard Transform
    Proc. High Performance Embedded Computing (HPEC), 2002
  3. Franz Franchetti, Markus Püschel, José M. F. Moura and Christoph W. Ueberhuber
    Short Vector SIMD Code Generation for DSP Algorithms
    Proc. High Performance Embedded Computing (HPEC), 2002

Thesis 

  1. Mike Balog
    A Flexible Framework for Implementing FFT Processors
    Master thesis, Electrical and Computer Engineering, Drexel University, 2002
  2. Hung-Jen Huang
    Performance Analysis of an Adaptive Algorithm for the Walsh-Hadamard Transform
    Master thesis, Computer Science, Drexel University, 2002

2001 

Journal 

  1. Sebastian Egner, Jeremy Johnson, David Padua, Jianxin Xiong and Markus Püschel
    Automatic Derivation and Implementation of Signal Processing Algorithms
    ACM SIGSAM Bulletin Communications in Computer Algebra, Vol. 35, No. 2, pp. 1-19, 2001
  2. Sebastian Egner and Markus Püschel
    Automatic Generation of Fast Discrete Signal Transforms
    IEEE Transactions on Signal Processing, Vol. 49, No. 9, pp. 1992-2002, 2001

Conference (fully reviewed) 

  1. Neungsoo Park and Viktor K. Prasanna
    Cache Conscious Walsh-Hadamard Transform
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 2, pp. II-1205-II-1208, 2001
  2. Markus Püschel, Bryan Singer, Manuela Veloso and José M. F. Moura
    Fast Automatic Generation of DSP Algorithms
    Proc. International Conference on Computational Science (ICCS), Lecture Notes In Computer Science, Springer, Vol. 2073, pp. 97-106, 2001
  3. Bryan Singer and Manuela Veloso
    Learning to Generate Fast Signal Processing Implementations
    Proc. International Conference on Machine Learning (ICML) , pp. 529-536, 2001
  4. Jianxin Xiong, Jeremy Johnson, Robert W. Johnson and David Padua
    SPL: A Language and Compiler for DSP Algorithms
    Proc. Programming Languages Design and Implementation (PLDI), pp. 298-308, 2001
  5. Bryan Singer and Manuela Veloso
    Stochastic Search for Signal Processing Algorithm Optimization
    Proc. Supercomputing (SC), pp. 22, 2001

Conference (abstract reviewed) 

  1. José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel, Bryan Singer, Manuela Veloso and Jianxin Xiong
    Generating Platform-Adapted DSP Libraries using SPIRAL
    Proc. High Performance Embedded Computing (HPEC), 2001

Thesis 

  1. Jianxin Xiong
    Automatic Optimization of DSP Algorithms
    PhD. thesis, Computer Science, University of Illinois at Urbana-Champaign, 2001, Also as Technical Report UIUCDCS-R-2001-224, University of Illinois
  2. Bryan Singer
    Automating the Modeling and Optimization of the Performance of Signal Processing Algorithms
    PhD. thesis, Computer Science, Carnegie Mellon University, 2001
  3. Pinit Kumhom
    Design, Optimization, and Implementation of a Universal FFT Processor
    PhD. thesis, Electrical and Computer Engineering, Drexel University, 2001, Also Tech. Report DU-MCS-01-01, Drexel University, 2001

2000 

Conference (fully reviewed) 

  1. Neungsoo Park, Viktor K. Prasanna, Kiran Bondalapati and Dongsoo Kang
    Dynamic Data Layouts for Cache-conscious Factorization of DFT
    Proc. International Parallel and Distributed Processing Symposium (IPDPS), pp. 693-701, 2000
  2. Jeremy Johnson and Markus Püschel
    In Search of the Optimal Walsh-Hadamard Transform
    Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 6, pp. 3347-3350, 2000
  3. Bryan Singer and Manuela Veloso
    Learning to Predict Performance from Formula Modeling and Training Data
    Proc. International Conference on Machine Learning (ICML) , pp. 887-894, 2000

Conference (abstract reviewed) 

  1. José M. F. Moura, Jeremy Johnson, Robert W. Johnson, David Padua, Viktor K. Prasanna, Markus Püschel and Manuela Veloso
    SPIRAL: Automatic Implementation of Signal Processing Algorithms
    Proc. High Performance Embedded Computing (HPEC), 2000

Thesis 

  1. Gavin Haentjens
    An Investigation of Recursive FFT Implementations
    Master thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2000
  2. David Sepiashvili
    Performance Models and Search Methods for Optimal FFT Implementations
    Master thesis, Electrical and Computer Engineering, Carnegie Mellon University, 2000
Publication interface designed and implemented by Patra Pantupat, Aliaksei Sandryhaila, and Markus Püschel
Electrical and Computer Engineering, Carnegie Mellon University, 2007