WIP: Task definitions as json strings #145

ngc92 · 2025-01-21T01:29:40Z

Store generic task definitions as json string in the database (abusing the existing reference_code field for now).
This allows us to easily define multi-file tasks, have different eval scripts for different tasks (even though its possible, in general we should avoid that, I think)
Currently, I've copied (and then adapted) the file, but once we have the tasks repo, we can just symlink the eval.cu master copy into the individual tasks so we won't have to maintain multiple copies (with the option to make breaking changes for newer tasks without having to worry about existing ones, yay)

The identity exampe shows roughly how I envision task definitions to look like. In particular, we have a task.h that just defines the interface, and then we won't have to include the submission in the main file, and instead can compile it separately. We could also separate out the reference code, not sure if we want to, though.

To facilitate development and testing, I've added a command that creates a leaderboard from a local directory. It can overwrite an existing leaderboard, so you can iterate quickly.

I've added some translation code that attempts to take leaderboards that are still in their current format and map them to the new one. That code is pretty much untested.

msaroufim · 2025-01-21T02:11:21Z

src/discord-cluster-manager/cogs/leaderboard_cog.py

+        task: LeaderboardTask,
+    ) -> bool:
+        # Ask the user to select GPUs
+        view = GPUSelectionView([gpu.name for gpu in GitHubGPU] + [gpu.name for gpu in ModalGPU])


not feedback specific to this PR but one idea is we make the default view aggregate both the scheduler and the kernel on the same leaderboard. It'll be similar to F1 racing where the winner is a combination of the best car + best driver

When announcing winners we could then give prizes per scheduler

@msaroufim Do we have a sense for what other schedulers there will be? Our current schedulers (Modal vs. GH runners) don't really have overlap on devices in the first place.

msaroufim · 2025-01-21T02:12:08Z

src/discord-cluster-manager/leaderboard_db.py

+                task = LeaderboardTask.from_str(res[3])
+            except json.JSONDecodeError:
+                logging.error("json decoding error in LB %s. Legacy task?", leaderboard_name)
+                task = build_from_legacy_reference(res[3])


happy with BC breakages until we officially launch

msaroufim · 2025-01-21T02:13:02Z

src/discord-cluster-manager/task.py

@@ -0,0 +1,122 @@
+import copy


msaroufim · 2025-01-21T02:36:40Z

examples/identity_cuda/eval.cu

@@ -0,0 +1,138 @@
+#include <chrono>


should this file be in this folder? or are you envisioning that people copy paste some version of this per kernel

for this "demo", yes. for the reference-kernels repo, we'd have one master copy, and then symlink that into the individual task directories. If we make minor fixes, they automatically propagate to existing tasks, if we want to do breaking changes, we can make a copy and symlink that into any new task, leaving existing stuff working.

ngc92 requested review from msaroufim, alexzhang13 and S1ro1 and removed request for alexzhang13 January 21, 2025 01:30

ngc92 changed the title ~~Task definitions as json strings~~ WIP: Task definitions as json strings Jan 21, 2025

msaroufim reviewed Jan 21, 2025

View reviewed changes

ngc92 added 4 commits January 21, 2025 12:04

generalized leaderboard to allow arbitrary tasks

a4d2405

extended capabilities for customization of compile_cuda_script

0b00fe8

utility command for creating a leaderboard from a local directory

623ab1d

working on task definitions for identity example

2029e9f

ngc92 force-pushed the ngc92/generic-task branch 2 times, most recently from 34bb1d5 to e7da7c9 Compare January 21, 2025 12:46

update ci tests

7aaa97e

ngc92 force-pushed the ngc92/generic-task branch from e7da7c9 to 7aaa97e Compare January 21, 2025 12:56

Feat: identity example for python

f4aaa24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Task definitions as json strings #145

WIP: Task definitions as json strings #145

ngc92 commented Jan 21, 2025

msaroufim Jan 21, 2025

alexzhang13 Jan 21, 2025

msaroufim Jan 21, 2025

msaroufim Jan 21, 2025

msaroufim Jan 21, 2025

ngc92 Jan 21, 2025

WIP: Task definitions as json strings #145

Are you sure you want to change the base?

WIP: Task definitions as json strings #145

Conversation

ngc92 commented Jan 21, 2025

msaroufim Jan 21, 2025

Choose a reason for hiding this comment

alexzhang13 Jan 21, 2025

Choose a reason for hiding this comment

msaroufim Jan 21, 2025

Choose a reason for hiding this comment

msaroufim Jan 21, 2025

Choose a reason for hiding this comment

msaroufim Jan 21, 2025

Choose a reason for hiding this comment

ngc92 Jan 21, 2025

Choose a reason for hiding this comment