Actions: EleutherAI/lm-evaluation-harness
Actions
Showing runs from all workflows
5,643 workflow runs
5,643 workflow runs
--examples
Argument for Fine-Grained Task Evaluation in lm-evaluation-harness
. This feature is the first step towards efficient multi-prompt evaluation with PromptEval [1,2]
Tasks Modified
#4023:
Pull request #2520
synchronize
by
mirianfsilva
--examples
Argument for Fine-Grained Task Evaluation in lm-evaluation-harness
. This feature is the first step towards efficient multi-prompt evaluation with PromptEval [1,2]
Unit Tests
#3995:
Pull request #2520
synchronize
by
mirianfsilva
--examples
Argument for Fine-Grained Task Evaluation in lm-evaluation-harness
. This feature is the first step towards efficient multi-prompt evaluation with PromptEval [1,2]
Tasks Modified
#4018:
Pull request #2520
synchronize
by
mirianfsilva
--examples
Argument for Fine-Grained Task Evaluation in lm-evaluation-harness
. This feature is the first step towards efficient multi-prompt evaluation with PromptEval [1,2]
Unit Tests
#3990:
Pull request #2520
synchronize
by
mirianfsilva