
In this session, we will present a DPC++ code walk-through of a simple matrix multiplication example, and look at how we can optimize the code for performance using various features of DPC++. We will compare the execution times of various optimizations.
Agenda
- DPC++ code walk-through of simple matrix multiplication example - 20min
- Optimize the code for performance using ND-Range Kernel. - 20min
- Optimize the code for performance using Shared Local Memory. -20min
- Compare the execution times of various optimizations. - 20min
This module is a part of the Aurora Learning Paths Series.
About the Presenter
Rakshith Krishnappa is a developer evangelist at Intel, focused on oneAPI, DPC++, and High-Performance Computing. For the last 16 years, he has worked on various Intel products including CPUs, Graphics, GPUs, HPC products, and Software solutions.
