Tasks Modified

Add `--examples` Argument for Fine-Grained Task Evaluation in `lm-evaluation-harness`. This feature is the first step towards efficient multi-prompt evaluation with PromptEval [1,2] #4018

Sign in to view logs

Re-run triggered January 15, 2025 18:19

baberabb

#2520

mirianfsilva:examples-arg

Status Success

Total duration 1m 31s

Artifacts –

new_tasks.yml

on: pull_request

Scan for changed tasks

Annotations

1 warning

Scan for changed tasks

ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636