2021-10
SparseAdapt: Runtime Control for Sparse Linear Algebra on a Reconfigurable Accelerator
GPU acceleration of finite state machine input execution: Improving scale and performance
2021-08
Generating high performance code for irregular data structures using dependent types
Proceedings of the 9th ACM SIGPLAN International Workshop on Functional High-Performance and Numerical Computing
(2021-08-22)
dblp.uni-trier.dePDF2021-05
Code Generation for Room Acoustics Simulations with Complex Boundary Conditions
2021-04
Fast Optimisation of Convolutional Neural Network Inference using System Performance Models
2021-02
Central Bank Digital Currency with Asymmetric Privacy
2020-11
DelayRepay: delayed execution for kernel fusion in Python
2020-10
Optimising the Performance of Convolutional Neural Networks across Computing Systems using Transfer Learning.
2020-07
Binary Ostensibly‐Implicit Trees for Fast Collision Detection
2020-02
Automatic generation of specialized direct convolutions for mobile GPUs
High-level hardware feature extraction for GPU performance prediction of stencils
Generating fast sparse matrix vector multiplication from a high level generic functional IR
Replication Packager for 'Generating Fast Sparse Matrix Vector Multiplicationfrom a High Level Generic Functional IR'
2019-12
Tiling Optimizations for Stencil Computations Using Rewrite Rules in Lift
Publications collected and formatted using Paperoni