Skip to content

Add --examples Argument for Fine-Grained Task Evaluation in lm-evaluation-harness. This feature is the first step towards efficient multi-prompt evaluation with PromptEval [1,2] #4018

Add --examples Argument for Fine-Grained Task Evaluation in lm-evaluation-harness. This feature is the first step towards efficient multi-prompt evaluation with PromptEval [1,2]

Add --examples Argument for Fine-Grained Task Evaluation in lm-evaluation-harness. This feature is the first step towards efficient multi-prompt evaluation with PromptEval [1,2] #4018

Re-run triggered January 15, 2025 18:19
Status Success
Total duration 1m 31s
Artifacts

new_tasks.yml

on: pull_request
Scan for changed tasks
1m 23s
Scan for changed tasks
Fit to window
Zoom out
Zoom in

Annotations

1 warning
Scan for changed tasks
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636