Version 1.3.0
This release includes a significant number of changes and new developments:
- An OpenMP-offload variant (
dwarf-cloudsc-gpu-omp-scc-hoist
), derived from the OpenACC SCC-hoist variant, developed by L. Lucido (Atos). - A new CUDA Fortran implementation (
dwarf-cloudsc-gpu-scc-cuf
), implementing the SCC loop layout. This requires the--with-cuda
flag to build. - A new Python implementation, based on GT4Py, capable of generating CPU and GPU code (using cupy and, optionally, DaCe)
- Deprecation of the
dwarf-cloudsc-gpu-claw
variant, which no longer works correctly on recent NVIDIA software stacks. Building this variant requires adding an explicit--with-claw
flag to the build command. - With Loki publicly available, the references to the Loki repository have been updated and testing of the source-to-source translation variants is now possible.
- A JUBE benchmark configuration has been added to ease testing across various platforms.
- Clean-up of the CMake scripts
- New arch files for:
- HPC2020, the ECMWF Atos system in Bologna
- LUMI
- MeluXina
- Additional Isambard partitions