Drive faster breakthroughs through faster code: Get more results on your hardware today and carry your code forward to the future with code modernization.
Develop with the Intel® Rendering Toolkit for high fidelity and high performance ray tracing applications. Improve your games with ray tracing experiences.
SNAP https://github.com/UoB-HPC/SNAP-OpenCL , serves as a proxy application to model modern discrete ordinates neutral particle transport application. In this project we leverage the existing OpenCL implementation of SNAP and evaluate the performance and productivity of MetaCL version of SNAP.
A novel Greedy Incremental Alignment-based algorithm called nGIA was proposed for sequence clustering with high efficiency and precision. The nGIA consists of a pre-filter, a modified short word filter, a new data packing strategy, a modified greedy incremental method, and is parallelized via GPU.
The TAU Performance System® supports profiling and tracing of programs written using the Intel OneAPI. Intel OneAPI provides two interfaces for programming - OpenCL and DPC++/SYCL for CPUs and GPUs. TAU supports both - the OpenCL profiling interface and Intel Level Zero API to observe performance.
We propose the tuning and migration of a CUDA-based RTM to a DPC++ application by applying DPC++ Compatibility Tool. We aim to demonstrate the versatility of OneAPI to build unified code capable of being executed in different processing units such as CPUs and GPUs with low implementation cost.
A tool for designing reproducible Machine learning experiments. It’s an end to end functional style data transformation distributed pipeline built on top of Intel OneAPI and Google JAX.
Spatter is a new benchmark tool for assessing memory system architectures in the context of a specific category of indexed accesses known as gather and scatter. OneAPI and DevCloud are used to develop support for a OneAPI backend for Spatter that can be targeted to Intel FPGAs.
Another alternative of the classical “stable” three-way quicksort performance optimization using Nvidia CUDA Development Toolkit, OpenMP 4.5/5.0 and Intel’s Open-Source Clang/LLVM compiler distribution.
This work focuses on exploring the architecture of Intel CPUs and Integrated Graphics and their heterogeneous computing potential to boost performance and energy-efficiency of epistasis detection. This will be achieved making use of OpenCL, Data Parallel C++ and OpenMP programming models.
Use the tool MetaCL to port Breadth First search application. MetaCL is a tool that autogenerates the OpenCL host code from OpenCL kernels. https://github.com/vtsynergy/MetaMorph
Markov decision processes provide a formal framework for a computer to make decisions autonomously and intelligently when the effects of its actions are not deterministic. To solve this computationally complex problem, we experiment with scheduling on a low-power heterogeneous CPU+GPU SoC platform.