Kernel Tuner
Kernel Tuner greatly simplifies the development of highly-optimized and auto-tuned CUDA, OpenCL, and C code, supporting many advanced use-cases and optimization strategies that speed up the auto-tuning process.
Kernel Tuner greatly simplifies the development of highly-optimized and auto-tuned CUDA, OpenCL, and C code, supporting many advanced use-cases and optimization strategies that speed up the auto-tuning process.
A real-time pipeline to search for Fast Radio Bursts and other transient radio sources.
Fast, memory efficient and GPU accelerated radio interferometric calibration program
openPSTD is a Python-based research software that allows efficient and detailed calculation of sound propagation in a 2D built environment. It is especially useful as a reference tool for other developed software on acoustic sound propagation.
The cudawrappers library is a C++ wrapper for the Nvidia C libraries such as the CUDA driver, NVRTC, and cuFFT.
PowerSensor is a low-cost, custom-built device that measures the instantaneous power consumption of GPUs and other devices at a high time resolution.
FLAME GPU is a GPU accelerated simulator for domain independent complex systems simulations. FLAME GPU provides a mapping between a simple description of an agent and its interactions into optimised GPU code. The software abstracts the details and complexity of the GPU away from modellers.
Dynamically compile CUDA kernels and launch them type-safely using C++ magic. Tight integration with Kernel Tuner results in blazing fast GPU code.
Offload Eigen matrix-matrix multiplications to an Nvidia GPU
Lightning: Fast data processing using GPUs on distributed platforms
Use and design neural network ansatz wave function for real-space quantum Monte Carlo simulations of molecular systems.
Rocket is a framework for efficient execution of all-pair applications on heterogeneous platforms.