Publications

Filtered as: all types - all years - author: Zhang, N. - all keywords
Sorted by: keyword
Corresponding
bibtex list 

Acceleration 

  1. N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
    Towards Full-Stack Acceleration for Fully Homomorphic Encryption
    Proc. High Performance Extreme Computing (HPEC), 2022

Architecture 

  1. N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
    Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023

BLAS 

  1. N. Zhang and Franz Franchetti
    Code Generation for Cryptographic Kernels using Multi-word Modular Arithmetic on GPU
    Proc. International Symposium on Code Generation and Optimization (CGO), 2025

Code generator 

  1. N. Zhang and Franz Franchetti
    Code Generation for Cryptographic Kernels using Multi-word Modular Arithmetic on GPU
    Proc. International Symposium on Code Generation and Optimization (CGO), 2025

CPUs 

  1. S. Fu, N. Zhang and Franz Franchetti
    Accelerating High-Precision Number Theoretic Transforms using Intel AVX-512
    Proc. Parallel Architectures and Compilation Techniques (PACT), 2024

Cryptography 

  1. N. Zhang and Franz Franchetti
    Code Generation for Cryptographic Kernels using Multi-word Modular Arithmetic on GPU
    Proc. International Symposium on Code Generation and Optimization (CGO), 2025
  2. N. Zhang, S. Fu and Franz Franchetti
    Towards Closing the Performance Gap for Cryptographic Kernels Between CPUs and Specialized Hardware
    Proc. IEEE/ACM International Symposium on Microarchitecture (MICRO), 2025
  3. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Cryptology ePrint Archive, 2023
  4. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023

Distributed, parallel, and cluster computing 

  1. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023

Fast Fourier Transform 

  1. N. Zhang, Sanil Rao, M. Franusich and Franz Franchetti
    Towards Semantics Lifting for Scientific Computing: A Case Study on FFT
    Proc. ACM SIGPLAN Symposium on Principles of Programming Languages (POPL), 2025

FFT 

  1. N. Zhang, Sanil Rao, M. Franusich and Franz Franchetti
    Towards Semantics Lifting for Scientific Computing: A Case Study on FFT
    Proc. ACM SIGPLAN Symposium on Principles of Programming Languages (POPL), 2025

FHE Hardward 

  1. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Cryptology ePrint Archive, 2023

Hardware accelerator 

  1. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Cryptology ePrint Archive, 2023

Hardware architecture 

  1. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023

High performance 

  1. N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
    Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023
  2. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023

High performance computing (HPC) 

  1. Y. Lan, L. Tang, N. Zhang, Y. Eum, James C. Hoe and Franz Franchetti
    A RISC-V Vector Extension for Multi-word Arithmetic
    Proc. Supercomputing (SC), 2025

Homomorphic encryption 

  1. S. Fu, N. Zhang and Franz Franchetti
    Accelerating High-Precision Number Theoretic Transforms using Intel AVX-512
    Proc. Parallel Architectures and Compilation Techniques (PACT), 2024
  2. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Cryptology ePrint Archive, 2023
  3. N. Zhang, H. Gamil, P. Brinich, B. Reynwar, A. Al Badawi, N. Neda, D. Soni, K. Canida, Y. Polyakov, P. Broderick, M. Maniatakos, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen, D. B. Cousins and Franz Franchetti
    Towards Full-Stack Acceleration for Fully Homomorphic Encryption
    Proc. High Performance Extreme Computing (HPEC), 2022

ISA extension 

  1. N. Zhang, S. Fu and Franz Franchetti
    Towards Closing the Performance Gap for Cryptographic Kernels Between CPUs and Specialized Hardware
    Proc. IEEE/ACM International Symposium on Microarchitecture (MICRO), 2025

Large integer arithmetic 

  1. N. Zhang, S. Fu and Franz Franchetti
    Towards Closing the Performance Gap for Cryptographic Kernels Between CPUs and Specialized Hardware
    Proc. IEEE/ACM International Symposium on Microarchitecture (MICRO), 2025

Large language models 

  1. Q. Oschatz, N. Zhang, M. Franusich and Franz Franchetti
    Towards Automated Reasoning Chains for Verification of LLM-Generated Scientific Code
    Proc. High Performance Extreme Computing (HPEC), 2025

Modular 

  1. N. Zhang and Franz Franchetti
    Code Generation for Cryptographic Kernels using Multi-word Modular Arithmetic on GPU
    Proc. International Symposium on Code Generation and Optimization (CGO), 2025

Multi-word arithmetic 

  1. Y. Lan, L. Tang, N. Zhang, Y. Eum, James C. Hoe and Franz Franchetti
    A RISC-V Vector Extension for Multi-word Arithmetic
    Proc. Supercomputing (SC), 2025
  2. N. Zhang and Franz Franchetti
    Code Generation for Cryptographic Kernels using Multi-word Modular Arithmetic on GPU
    Proc. International Symposium on Code Generation and Optimization (CGO), 2025
  3. Y. Eum, N. Zhang, L. Tang and Franz Franchetti
    Towards a RISC-V Instruction Set Extension for Multi-word Arithmetic
    Proc. High Performance Extreme Computing (HPEC), 2024

Multi-word integer 

  1. N. Zhang and Franz Franchetti
    Generating Number Theoretic Transforms for Multi-Word Integer Data Types
    Proc. International Symposium on Code Generation and Optimization (CGO), 2023

NTT 

  1. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Cryptology ePrint Archive, 2023

Number theoretic transforms 

  1. Y. Lan, L. Tang, N. Zhang, Y. Eum, James C. Hoe and Franz Franchetti
    A RISC-V Vector Extension for Multi-word Arithmetic
    Proc. Supercomputing (SC), 2025
  2. N. Zhang and Franz Franchetti
    Code Generation for Cryptographic Kernels using Multi-word Modular Arithmetic on GPU
    Proc. International Symposium on Code Generation and Optimization (CGO), 2025
  3. S. Fu, N. Zhang and Franz Franchetti
    Accelerating High-Precision Number Theoretic Transforms using Intel AVX-512
    Proc. Parallel Architectures and Compilation Techniques (PACT), 2024
  4. N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
    Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023
  5. N. Zhang and Franz Franchetti
    Generating Number Theoretic Transforms for Multi-Word Integer Data Types
    Proc. International Symposium on Code Generation and Optimization (CGO), 2023
  6. P. Brinich, N. Zhang, A. Ebel, Franz Franchetti and Jeremy Johnson
    Twiddle Factor Generation for a Vectorized Number Theoretic Transform
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023

Numerical analysis 

  1. Q. Oschatz, N. Zhang, M. Franusich and Franz Franchetti
    Towards Automated Reasoning Chains for Verification of LLM-Generated Scientific Code
    Proc. High Performance Extreme Computing (HPEC), 2025

Optimizing 

  1. D. Sun, N. Zhang and Franz Franchetti
    Optimization and Performance Analysis of Shor’s Algorithm in Qiskit
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023

Partial differential equations 

  1. Q. Oschatz, N. Zhang, M. Franusich and Franz Franchetti
    Towards Automated Reasoning Chains for Verification of LLM-Generated Scientific Code
    Proc. High Performance Extreme Computing (HPEC), 2025

Performance model 

  1. N. Zhang, S. Fu and Franz Franchetti
    Towards Closing the Performance Gap for Cryptographic Kernels Between CPUs and Specialized Hardware
    Proc. IEEE/ACM International Symposium on Microarchitecture (MICRO), 2025
  2. D. Sun, N. Zhang and Franz Franchetti
    Optimization and Performance Analysis of Shor’s Algorithm in Qiskit
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023

Qiskit 

  1. D. Sun, N. Zhang and Franz Franchetti
    Optimization and Performance Analysis of Shor’s Algorithm in Qiskit
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023

Rewrite system 

  1. N. Zhang and Franz Franchetti
    Code Generation for Cryptographic Kernels using Multi-word Modular Arithmetic on GPU
    Proc. International Symposium on Code Generation and Optimization (CGO), 2025

Ring Processing 

  1. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Cryptology ePrint Archive, 2023
  2. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023

RISC-V 

  1. Y. Lan, L. Tang, N. Zhang, Y. Eum, James C. Hoe and Franz Franchetti
    A RISC-V Vector Extension for Multi-word Arithmetic
    Proc. Supercomputing (SC), 2025

RPU 

  1. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Cryptology ePrint Archive, 2023
  2. D. Soni, N. Neda, N. Zhang, B. Reynwar, H. Gamil, B. Heyman, M. N. T. Moopan, A. Al Badawi, Y. Polyakov, K. Canida, M. Pedram, M. Maniatakos, D. B. Cousins, Franz Franchetti, M. French, A. G. Schmidt and B. Reagen
    RPU: The Ring Processing Unit
    Proc. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023

Scientific computing 

  1. Q. Oschatz, N. Zhang, M. Franusich and Franz Franchetti
    Towards Automated Reasoning Chains for Verification of LLM-Generated Scientific Code
    Proc. High Performance Extreme Computing (HPEC), 2025
  2. N. Zhang, Sanil Rao, M. Franusich and Franz Franchetti
    Towards Semantics Lifting for Scientific Computing: A Case Study on FFT
    Proc. ACM SIGPLAN Symposium on Principles of Programming Languages (POPL), 2025

Scientific computing 

  1. Q. Oschatz, N. Zhang, M. Franusich and Franz Franchetti
    Towards Automated Reasoning Chains for Verification of LLM-Generated Scientific Code
    Proc. High Performance Extreme Computing (HPEC), 2025
  2. N. Zhang, Sanil Rao, M. Franusich and Franz Franchetti
    Towards Semantics Lifting for Scientific Computing: A Case Study on FFT
    Proc. ACM SIGPLAN Symposium on Principles of Programming Languages (POPL), 2025

Security 

  1. D. B. Cousins, Y. Polyakov, A. Al Badawi, M. French, A. G. Schmidt, A. Jacob, B. Reynwar, K. Canida, A. Jaiswal, C. Mathew, H. Gamil, N. Neda, D. Soni, M. Maniatakos, B. Reagen, N. Zhang, Franz Franchetti, P. Brinich, Jeremy Johnson, P. Broderick, M. Franusich, B. Zhang, Z. Cheng and M. Pedram
    TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
    arXiv, 2023

Semantics 

  1. N. Zhang, Sanil Rao, M. Franusich and Franz Franchetti
    Towards Semantics Lifting for Scientific Computing: A Case Study on FFT
    Proc. ACM SIGPLAN Symposium on Principles of Programming Languages (POPL), 2025

Semantics lifting 

  1. N. Zhang, Sanil Rao, M. Franusich and Franz Franchetti
    Towards Semantics Lifting for Scientific Computing: A Case Study on FFT
    Proc. ACM SIGPLAN Symposium on Principles of Programming Languages (POPL), 2025

Shor's algorithm 

  1. D. Sun, N. Zhang and Franz Franchetti
    Optimization and Performance Analysis of Shor’s Algorithm in Qiskit
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023

SIMD 

  1. N. Zhang, S. Fu and Franz Franchetti
    Towards Closing the Performance Gap for Cryptographic Kernels Between CPUs and Specialized Hardware
    Proc. IEEE/ACM International Symposium on Microarchitecture (MICRO), 2025

SPIRAL 

  1. Q. Oschatz, N. Zhang, M. Franusich and Franz Franchetti
    Towards Automated Reasoning Chains for Verification of LLM-Generated Scientific Code
    Proc. High Performance Extreme Computing (HPEC), 2025
  2. N. Zhang, Sanil Rao, M. Franusich and Franz Franchetti
    Towards Semantics Lifting for Scientific Computing: A Case Study on FFT
    Proc. ACM SIGPLAN Symposium on Principles of Programming Languages (POPL), 2025

Static analysis 

  1. N. Zhang, Sanil Rao, M. Franusich and Franz Franchetti
    Towards Semantics Lifting for Scientific Computing: A Case Study on FFT
    Proc. ACM SIGPLAN Symposium on Principles of Programming Languages (POPL), 2025

Transform 

  1. S. Fu, N. Zhang and Franz Franchetti
    Accelerating High-Precision Number Theoretic Transforms using Intel AVX-512
    Proc. Parallel Architectures and Compilation Techniques (PACT), 2024

Twiddle factor generation 

  1. P. Brinich, N. Zhang, A. Ebel, Franz Franchetti and Jeremy Johnson
    Twiddle Factor Generation for a Vectorized Number Theoretic Transform
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023

Vector 

  1. S. Fu, N. Zhang and Franz Franchetti
    Accelerating High-Precision Number Theoretic Transforms using Intel AVX-512
    Proc. Parallel Architectures and Compilation Techniques (PACT), 2024
  2. N. Zhang, A. Ebel, N. Neda, P. Brinich, B. Reynwar, A. G. Schmidt, M. Franusich, Jeremy Johnson, B. Reagen and Franz Franchetti
    Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023
  3. P. Brinich, N. Zhang, A. Ebel, Franz Franchetti and Jeremy Johnson
    Twiddle Factor Generation for a Vectorized Number Theoretic Transform
    Proc. IEEE High Performance Extreme Computing (HPEC), 2023
Publication interface designed and implemented by Patra Pantupat, Aliaksei Sandryhaila, and Markus Püschel
Electrical and Computer Engineering, Carnegie Mellon University, 2007