Skip to content

Latest commit

 

History

History
132 lines (87 loc) · 6.24 KB

README.md

File metadata and controls

132 lines (87 loc) · 6.24 KB

Data Parallel C++ Book Source Samples

This repository accompanies Data Parallel C++: Programming Accelerated Systems using C++ and SYCL by James Reinders, Ben Ashbaugh, James Brodman, Michael Kinsner, John Pennycook, Xinmin Tian second edition (Apress, 2023), and the first edition (Apress, 2020).

Cover 2nd Edition Cover 1st Edition

Purpose of this branch (main)

This branch (main) contains source code expanded from the Second Edition of the DPC++ book (available now!). We say 'expanded' because they include code not listed in the book, and we will update it as needed to keep it useful - as we did after the first edition was published. We welcome feedback. The sycl121_original_publication branch contains the source code published in the first edition. The first edition's book source was primarily based on the older SYCL 1.2.1 specification, and many enhancements and changes were added by the time the SYCL 2020 specification was published after our book. Since current toolchains which support SYCL are based on SYCL 2020, this main branch is intended to be compatible with recent compiler and toolchain releases.

The Second Edition of the DPC++ book, available now, is based on the updated code examples in this main branch.

Overview

Many of the samples in the book are snips from the more complete files in this repository. The full files contain supporting code, such as header inclusions, which are not shown in every listing within the book. The complete listings are intended to compile and be modifiable for experimentation.

Samples in this repository are updated to align with the most recent changes to the language and toolchains, and are more current than captured in the book text due to lag between finalization and actual publication of a print book. If experimenting with the code samples, start with the versions in this repository. DPC++ and SYCL are evolving to be more powerful and easier to use, and updates to the sample code in this repository are a good sign of forward progress!

Download the files as a zip using the green button, or clone the repository to your machine using Git.

How to Build the Samples

The samples in this repository are intended to compile with any modern C++ with SYCL compiler. We have tested it with the open source DPC++ project toolchain linked below, and with the 2023.2 release and newer of the oneAPI prebuilt icpx compilers based on the DPC++ open source project. If you have an older toolchain installed, you may encounter compilation errors due to evolution of the features and extensions. Recent testing verified that AdaptiveCpp (previously HipSYCL), with a few rare exceptions that should be resolved soon, is able to support all these examples as well. We will welcome any feedback regarding compatibility with any C++ compiler that has SYCL support.

Prerequisites

  1. An installed SYCL toolchain. See below for details on the tested DPC++ toolchain
  2. CMake 3.10 or newer (Linux) or CMake 3.25 or newer (Windows)
  3. Ninja or Make - to use the build steps described below

To build and use these examples, you will need an installed DPC++ toolchain. For one such toolchain, please visit:

https://software.intel.com/content/www/us/en/develop/tools/oneapi/components/dpc-compiler.html

Alternatively, much of the toolchain can be built directly from:

https://github.com/intel/llvm

Some of the samples require other dependencies. To disable samples requiring these dependencies use the CMake variables described below.

Setting Up an Environment to Build the Samples

Setup environment variables if using a oneAPI / DPC++ implementation:

On Windows:

\path\to\inteloneapi\setvars.bat

On Linux:

source /path/to/inteloneapi/setvars.sh

Building the Samples:

Note: CMake supports different generators to create build files for different build systems. Some popular generators are Unix Makefiles or Ninja when building from the command line, and Visual Studio-based generators when building from a Windows IDE. The examples below generate build files for Unix Makefiles, but feel free to substitute a different generator, if preferred.

  1. Create build files using CMake. For example:

    mkdir build && cd build
    cmake -G "Unix Makefiles" ..
  2. Build with the generated build files:

    cmake --build . --target install --parallel

    Or, use the generated Makefiles directly:

    make install -j8

If your SYCL compiler is not detected automatically, or to explicitly specify a different SYCL compiler, use the CMAKE_CXX_COMPILER variable. For example:

cmake -G "Unix Makefiles" -DCMAKE_CXX_COMPILER=/path/to/your/sycl/compiler ..

CMake Variables:

The following CMake variables are supported. To specify one of these variables via the command line generator, use the CMake syntax -D<option name>=<value>. See your CMake documentation for more details.

Variable Type Description
NODPL BOOL Disable samples that require the oneAPI DPC++ Library (oneDPL). Default: FALSE
NODPCT BOOL Disable samples that require the DPC++ Compatibility Tool (dpct). Default: FALSE
NOL0 BOOL Disable samples that require the oneAPI Level Zero Headers and Loader. Default: TRUE
WITHCUDA BOOL Enable CUDA device support for the samples. Default: FALSE
WITHROCM BOOL Enable ROCm device support for the samples. Default: FALSE

NVIDIA/ROCm Device Support

Note: Currently, fig_17_31_inter_kernel_pipe is disabled for NVIDIA/ROCm devices, as they are not supported.

You can set the WITHCUDA or WITHROCM variable to build for CUDA or ROCM devices. For example, for a CUDA device:

cmake -G "Unix Makefiles" .. -DWITHCUDA=ON

or for a ROCM device:

cmake -G "Unix Makefiles" .. -DWITHROCM=ON

You can also set the target GPU architecture (which is necessary for ROCm devices and set to gfx1100 by default) like:

cmake -G "Unix Makefiles" .. -DWITHROCM=ON -DROCM_GPU_ARCH=gfx1030