Support OpenCL PSy layer for GOcean 1.0 #174

arporter · 2018-05-24T10:24:47Z

In this issue we will extend PSyclone to support the (optional) generation of OpenCL code in the PSy layer. We will use the clfortran module so that we can stick with Fortran.
Note that transforming existing kernels into OpenCL will be the subject of a separate issue.

We will target the GOcean 1.0 API first since that is what is required in EuroEXA.

The text was updated successfully, but these errors were encountered:

arporter · 2018-05-24T10:36:32Z

Instead of calling kernel subroutines we must call clEnqueueNDRangeKernel with a kernel object as argument. We need some way of obtaining that kernel object in the generated PSy layer. Obviously we have the name of the kernel from its meta-data. We could then use this to make a call into (some) infrastructure that returns the associated kernel object.

arporter · 2018-06-04T10:03:32Z

I've had a change of plan and decided to go with using a Transformation to toggle whether or not to generate OpenCL. This allows it to be applied on a per-Schedule/Invoke basis rather than globally for an Algorithm file. Currently the transformation just sets Node._opencl = True for the Schedule. Children of the Schedule must currently first look-up their owning Schedule in order to determine whether or not OpenCL is enabled. Alternatively, I could change the setter for Node._opencl to cascade the setting to all child nodes.
When OpenCL is enabled, Loop.gen_code() should do nothing and Kernel.gen_code() should generate a clEnqueueNDRangeKernel call.

arporter · 2018-06-04T10:11:19Z

More importantly, I need to work out how we are going to set the kernel arguments which, in OpenCL, is done via (many) API calls. Happily, I think I can generate the necessary code using the kernel meta-data and Algorithm argument list. I think the generated code should then be called within the if(first_time) block at the start of the Invoke. Possibly we may need to ensure this is only ever done once for each kernel (but then again, it may not matter).

arporter · 2018-06-08T15:27:44Z

For reference, the code to set kernel arguments looks like this (in Fortran):

arg_idx = 0
ierr = clSetKernelArg(kernel, arg_idx, sizeof(nx), C_LOC(nx))
call check_status("clSetKernelArg", ierr)
arg_idx = arg_idx + 1
ierr = clSetKernelArg(kernel, arg_idx, sizeof(ssha_device), &
     C_LOC(ssha_device))
call check_status("clSetKernelArg", ierr)
arg_idx = arg_idx + 1

The scalar argument (nx) is just the local copy whereas for fields it is the pointer to the buffer on the device that is required. There's no mention of types - just addresses and sizes.

arporter · 2018-06-08T15:38:08Z

We will need to create a routine to set the kernel arguments for each kernel that we come across. Given that we run PSyclone separately on each Algorithm file, there's currently no way to avoid creating such a routine several times. That's probably not a big deal though. I think the logical place to do this is after we've created the Algorithm and PSy code since at that point we know what all our kernels are.

arporter · 2018-06-15T10:17:03Z

I've put the necessary code into PSy.gen_code() as that enables me to prevent producing the same routine several times (if a PSy layer makes multiple uses of the same kernel). There's nothing to stop a single invoke from calling the same kernel multiple times with different arguments - we must therefore call the set-kernel args routine before each kernel is launched. We could subsequently improve on this by checking the argument lists of each kernel call.

…ls() to be called multiple times.

…Sy module

arporter · 2018-06-15T16:14:04Z

I think I have all the basics covered now with the exception of the creation of buffers on the device. At the moment I assume the infrastructure has set-up field%device_ptr but I don't think that has to be the case. Although I could in theory do this from PSyclone (although making sure the same field was associated with the same bit of device memory between Algorithm files would be tricky), there would still be the issue of output. Currently PSyclone has no knowledge of when data is required on the CPU and therefore it makes sense to leave that to the infrastructure. That being so, I need to re-factor the dl_esm_inf code to use the newly-extracted opencl code (that is now in lib/opencl).

arporter · 2018-06-21T16:24:37Z

I don't want dl_esm_inf to have to depend on PSyclone though so I think this means we nead YAR (yet another repository) containing the Fortran->OpenCL interface code. In a spirit of collaboration I've done a google for such a code and found hiCL but that is based on C/C++ with a Fortran wrapper. It also hasn't been updated for two years.

arporter · 2018-06-22T10:02:53Z

I've created https://github.com/stfc/FortCL and moved my OpenCL wrapper code into there. This has the advantage that it's pure Fortran. dl_esm_inf now has FortCL as a submodule.

arporter · 2018-07-11T17:07:43Z

Have now brought branch up-to-date with master and used the new CharDeclGen to declare my list of kernel names correctly. Generated OpenCL code does not quite compile just yet.

arporter · 2018-07-12T10:49:08Z

In order to reduce code duplication I've added an is_literal argument to psyGen.args_filter(). I can then use this in several places in gocean1p0 when dealing with scalar arguments. Scalar arguments are now passed into the kernel-argument-setting routines as required.
Generated OpenCL version now compiles!

rupertford · 2018-07-12T13:23:11Z

Whoop!

arporter · 2018-07-20T10:33:20Z

Generated (PSycloneBench) code now runs as far as the point where I need to set a kernel argument that is a grid property. I now realise that I haven't created the buffers on the device for grid properties...

…buffer size [skip ci]

arporter · 2018-07-20T13:17:39Z

Code now runs through but some kernels fail because arguments aren't set. On closer inspection I realise that in Fortran, the momentum kernels have use model_mod, only: rdt, cbfr, visc. When I ported this kernel to OpenCL I made those scalars into kernel arguments (which is the way it has to be in OpenCL).

arporter · 2018-07-20T13:30:18Z

When we have a tool for converting Fortran kernels to OpenCL then we will be able to capture information on any such conversion of module variables into kernel arguments. In fact, we could do it now as part of the meta-data parsing (since we parse the kernel source anyway). I can break that out as a separate Issue.
I think the only other problem left to solve is that NEMOLite2D uses a fake 'built-in' to copy fields. For OpenCL we need to do a clEnqueueCopyBuffer. However, the gocean1.0 API does not actually support built-ins- for now I've simply commented-out the field-copy kernels from the invoke.

rupertford · 2019-01-31T15:14:52Z

PR #216 has been merged to master. Closing this issue.

arporter self-assigned this May 24, 2018

arporter added a commit that referenced this issue May 24, 2018

#174 add opencl option and generate use statements in psy module

a53bb95

arporter added a commit that referenced this issue Jun 4, 2018

#174 add first OCL test for gocean

5b67df5

arporter added a commit that referenced this issue Jun 4, 2018

#174 add new OCLTrans() transformation

e66212e

arporter added a commit that referenced this issue Jun 4, 2018

#174 rm opencl from constructor and have as member of Schedule

0883b24

arporter added a commit that referenced this issue Jun 15, 2018

#174 generate code to set kernel args

b484c58

arporter added a commit that referenced this issue Jun 15, 2018

#174 add opencl option and generate use statements in psy module

bb0a42a

arporter added a commit that referenced this issue Jun 15, 2018

#174 add first OCL test for gocean

dd3309c

arporter added a commit that referenced this issue Jun 15, 2018

#174 add new OCLTrans() transformation

585eafb

arporter added a commit that referenced this issue Jun 15, 2018

#174 rm opencl from constructor and have as member of Schedule

6c0a570

arporter added a commit that referenced this issue Jun 15, 2018

#174 generate code to set kernel args

9f161c0

arporter added a commit that referenced this issue Jun 15, 2018

#174 no explicit loops for ocl

e71012f

arporter added a commit that referenced this issue Jun 15, 2018

#174 add call to set kernel args before each kernel call

b540c75

arporter added a commit that referenced this issue Jun 15, 2018

#174 add Fortran OpenCLn utility code

c998f56

arporter added a commit that referenced this issue Jun 15, 2018

#174 add option to read kernel filename from env var. Allow add_kerne…

aeb3563

…ls() to be called multiple times.

arporter added a commit that referenced this issue Jun 15, 2018

#174 add the target attribute to DeclGen

1de9969

arporter added a commit that referenced this issue Jun 15, 2018

#174 add if(first_time) section to psy

7b776ff

arporter added a commit that referenced this issue Jun 15, 2018

#174 add calls to copy fields to device

26331cb

arporter added a commit that referenced this issue Jun 15, 2018

#174 add expected arg-setting code to test [skip ci]

26b074b

arporter added a commit that referenced this issue Jun 15, 2018

#174 add new gen_ocl_init() to create ocl-initialisation routine in P…

0ccd814

…Sy module

arporter added a commit that referenced this issue Jun 15, 2018

#174 add x-failing test for psy_init()

646f8f0

arporter added the in progress label Jun 27, 2018

arporter added a commit that referenced this issue Jun 27, 2018

#174 use c_sizeof instead of sizeof. Avoid setting scalar args [skip ci]

5838fc4

arporter added a commit that referenced this issue Jun 27, 2018

#174 WIP updating declarations of quantities in PSy layer [skip ci]

7fa79e5

arporter mentioned this issue Jun 27, 2018

Re-factor DeclGen and TypeDeclGen #181

Closed

arporter added a commit that referenced this issue Jul 11, 2018

#174 bring branch up-to-date with master (after declgen refactor)

03e1c2f

arporter added a commit that referenced this issue Jul 11, 2018

#174 rm duplicate target attribute after merge [skip ci]

125f173

arporter added a commit that referenced this issue Jul 11, 2018

#174 correct type of kernel-name list [skip ci]

17e0b55

arporter added a commit that referenced this issue Jul 12, 2018

#174 mv gen of set-kern-args into gocean1p0 [skip ci]

9d86ace

arporter added a commit that referenced this issue Jul 17, 2018

#174 ensure ocl-init routine only executed once [skip ci]

594fdbe

arporter added a commit that referenced this issue Jul 20, 2018

#174 create device buffer for each field

91ec86c

arporter added a commit that referenced this issue Jul 20, 2018

#174 add arg index and name to setkernelarg message [skip ci]

3dc8fee

arporter added a commit that referenced this issue Jul 20, 2018

#174 add code to set kernel argument for nx [skip ci]

1fd2f24

arporter added a commit that referenced this issue Jul 20, 2018

#174 add code to set-up grid property arrays on device [skip ci]

d3fd8d5

arporter added a commit that referenced this issue Jul 20, 2018

#174 correct kind of globalsize and use c_sizeof to calculate device …

92b3319

…buffer size [skip ci]

arporter mentioned this issue Jul 20, 2018

Identify module variables used by kernels #190

Closed

arporter added a commit that referenced this issue Jul 20, 2018

#174 change to generate code for new FortCL interface [skip ci]

5b1b707

arporter added a commit that referenced this issue Sep 20, 2018

#174 bring tests up-to-date now that we set the nx argument [skip ci]

ab91844

arporter added a commit that referenced this issue Sep 20, 2018

#174 bring branch up-to-date with master

dd26b4d

arporter mentioned this issue Sep 20, 2018

Initial support for generating an OpenCL PSy layer #216

Merged

rupertford closed this as completed Jan 31, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support OpenCL PSy layer for GOcean 1.0 #174

Support OpenCL PSy layer for GOcean 1.0 #174

arporter commented May 24, 2018

arporter commented May 24, 2018

arporter commented Jun 4, 2018

arporter commented Jun 4, 2018

arporter commented Jun 8, 2018

arporter commented Jun 8, 2018

arporter commented Jun 15, 2018

arporter commented Jun 15, 2018 •

edited

Loading

arporter commented Jun 21, 2018

arporter commented Jun 22, 2018

arporter commented Jul 11, 2018

arporter commented Jul 12, 2018

rupertford commented Jul 12, 2018

arporter commented Jul 20, 2018

arporter commented Jul 20, 2018

arporter commented Jul 20, 2018

rupertford commented Jan 31, 2019

Support OpenCL PSy layer for GOcean 1.0 #174

Support OpenCL PSy layer for GOcean 1.0 #174

Comments

arporter commented May 24, 2018

arporter commented May 24, 2018

arporter commented Jun 4, 2018

arporter commented Jun 4, 2018

arporter commented Jun 8, 2018

arporter commented Jun 8, 2018

arporter commented Jun 15, 2018

arporter commented Jun 15, 2018 • edited Loading

arporter commented Jun 21, 2018

arporter commented Jun 22, 2018

arporter commented Jul 11, 2018

arporter commented Jul 12, 2018

rupertford commented Jul 12, 2018

arporter commented Jul 20, 2018

arporter commented Jul 20, 2018

arporter commented Jul 20, 2018

rupertford commented Jan 31, 2019

arporter commented Jun 15, 2018 •

edited

Loading