Skip to content

Version 1.3.0

Compare
Choose a tag to compare
@reuterbal reuterbal released this 20 Jan 13:49
· 288 commits to main since this release
f4a90b6

This release includes a significant number of changes and new developments:

  • An OpenMP-offload variant (dwarf-cloudsc-gpu-omp-scc-hoist), derived from the OpenACC SCC-hoist variant, developed by L. Lucido (Atos).
  • A new CUDA Fortran implementation (dwarf-cloudsc-gpu-scc-cuf), implementing the SCC loop layout. This requires the --with-cuda flag to build.
  • A new Python implementation, based on GT4Py, capable of generating CPU and GPU code (using cupy and, optionally, DaCe)
  • Deprecation of the dwarf-cloudsc-gpu-claw variant, which no longer works correctly on recent NVIDIA software stacks. Building this variant requires adding an explicit --with-claw flag to the build command.
  • With Loki publicly available, the references to the Loki repository have been updated and testing of the source-to-source translation variants is now possible.
  • A JUBE benchmark configuration has been added to ease testing across various platforms.
  • Clean-up of the CMake scripts
  • New arch files for:
    • HPC2020, the ECMWF Atos system in Bologna
    • LUMI
    • MeluXina
    • Additional Isambard partitions